{"id":105,"date":"2018-10-15T11:14:46","date_gmt":"2018-10-15T11:14:46","guid":{"rendered":"http:\/\/speechlabs.pl\/?p=105"},"modified":"2018-12-09T18:12:50","modified_gmt":"2018-12-09T18:12:50","slug":"arm-at-ntav-2018","status":"publish","type":"post","link":"https:\/\/speechlabs.pl\/en\/arm-at-ntav-2018\/","title":{"rendered":"ARM at NTAV 2018"},"content":{"rendered":"\t<p>\r\n\t\t<b>Speech recognition results and their application to multimedia content search and retrieval, have been presented at the <a href=\"http:\/\/aes.org.pl\/ntav2018\/\" target=\"_blank\">XVII Symposium on New Trends in Audio and Video<\/a> (NTAV) in Pozna\u0144. NTAV is a biennial event organized by the <a href=\"http:\/\/aes.org.pl\/\" target=\"_blank\">Polish section of Audio Engineering Society<\/a>.\r\n\t\t<\/b>\r\n\t<\/p>\r\n<img decoding=\"async\" class=\"product-img\" src=\"http:\/\/speechlabs.pl\/wp-content\/uploads\/2018\/10\/arm_html-1024x455.png\" alt=\"An example radio content transcript  with reference text and evaluation results\" \/>\r\n\r\n\t<label  class=\"img-label\">An example radio content transcript  with reference text and evaluation results<\/label>\r\n\t<p>\r\nHow to find an AV content among hundreds of thousands of recordings whose total duration exceeds several months or even years?<\/p>\r\n<p>\r\nTo do that one needs a <b>text description<\/b> of each content in the set and a full-text search mechanism. The description can be generated through <b>sound and image analysis<\/b> whose objective is to detected and recognize speech in audio stream, and detect and recognize text in video stream.<\/p>\r\n<p>\r\nDespite possible inaccuracies in both elements, such a description can be used to find a specific content or information on a given subject with high probability. Speech recognition by <a href=\"?page=8\">ARM engine<\/a> produces not only the best hypothesis but also a set of alternatives  that additionally increase the probability of finding the desired content.\r\n\t<\/p>\r\n\t\t<img decoding=\"async\" class=\"product-img\" src=\"http:\/\/speechlabs.pl\/wp-content\/uploads\/2018\/10\/ticker-1024x672.png\" alt=\"An example of text retrieved from image\"\/>\r\n\t<label  class=\"img-label\">An example of text retrieved from image<\/label>","protected":false},"excerpt":{"rendered":"Speech recognition results and their application to multimedia content search and retrieval, have been presented at the XVII Symposium on New Trends in Audio and Video (NTAV) in Pozna\u0144. NTAV is a biennial event organized by the Polish section of Audio Engineering Society. An example radio content transcript with reference text and evaluation results How [&hellip;]","protected":false},"author":1,"featured_media":700,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_locale":"en_US","_original_post":"98","footnotes":""},"categories":[1],"tags":[],"class_list":["post-105","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","en-US"],"meta_box":[],"_links":{"self":[{"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/posts\/105","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/comments?post=105"}],"version-history":[{"count":8,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/posts\/105\/revisions"}],"predecessor-version":[{"id":372,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/posts\/105\/revisions\/372"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/media\/700"}],"wp:attachment":[{"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/media?parent=105"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/categories?post=105"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/speechlabs.pl\/wp-json\/wp\/v2\/tags?post=105"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}