{"id":1499,"date":"2024-02-01T10:30:35","date_gmt":"2024-02-01T01:30:35","guid":{"rendered":"https:\/\/slp.cs.tut.ac.jp\/?p=1499"},"modified":"2024-10-02T13:37:49","modified_gmt":"2024-10-02T04:37:49","slug":"apsipa2023%e3%81%ab%e5%8f%82%e5%8a%a0%e3%81%97%e3%80%81%e9%ab%98%e5%9f%8e%e5%b7%bd%e6%88%90%e3%81%95%e3%82%93%e3%83%bb%e5%a1%a9%e6%a0%b9%e5%87%aa%e4%ba%ba%e3%81%95%e3%82%93%e3%83%bb%e5%8c%97%e6%a2%9d","status":"publish","type":"post","link":"https:\/\/slp.cs.tut.ac.jp\/en\/2024\/02\/01\/apsipa2023%e3%81%ab%e5%8f%82%e5%8a%a0%e3%81%97%e3%80%81%e9%ab%98%e5%9f%8e%e5%b7%bd%e6%88%90%e3%81%95%e3%82%93%e3%83%bb%e5%a1%a9%e6%a0%b9%e5%87%aa%e4%ba%ba%e3%81%95%e3%82%93%e3%83%bb%e5%8c%97%e6%a2%9d\/","title":{"rendered":"At APSIPA 2023, Tatsunari Takagi, Nagito Shione, Keigo Hojo and Koharu Horii presented."},"content":{"rendered":"\n<p>Oct.31-Nov.3, 2023, At <a href=\"https:\/\/www.apsipa2023.org\/\">APSIPA2023<\/a>, Tatsunari Takagi, Nagito Shione, Keigo Hojo and Koharu Horii presented.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023takagi-1024x1024.jpg\" alt=\"\" class=\"wp-image-1421\" srcset=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023takagi-1024x1024.jpg 1024w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023takagi-300x300.jpg 300w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023takagi-150x150.jpg 150w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023takagi-768x768.jpg 768w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023takagi.jpg 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Streaming End-to-End ASR<br>Using CTC Decoder and DRA for<br>Linguistic Information Substitution   Takagi<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023shione-1024x1024.jpg\" alt=\"\" class=\"wp-image-1422\" srcset=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023shione-1024x1024.jpg 1024w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023shione-300x300.jpg 300w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023shione-150x150.jpg 150w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023shione-768x768.jpg 768w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023shione.jpg 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Construction of Automatic Speech Recognition<br>Model that Recognizes Linguistic Information and<br>Verbal\/Non-verbal Phenomena   Shione<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023hojo-1024x1024.jpg\" alt=\"\" class=\"wp-image-1423\" srcset=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023hojo-1024x1024.jpg 1024w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023hojo-300x300.jpg 300w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023hojo-150x150.jpg 150w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023hojo-768x768.jpg 768w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023hojo.jpg 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Combining multiple end-to-end speech recognition<br>models based on density ratio approach   Hojo<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023horii-1024x1024.jpg\" alt=\"\" class=\"wp-image-1424\" srcset=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023horii-1024x1024.jpg 1024w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023horii-300x300.jpg 300w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023horii-150x150.jpg 150w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023horii-768x768.jpg 768w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023horii.jpg 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Language modeling for spontaneous speech<br>recognition based on disfluency labeling and<br>generation of disfluent text   Horii<\/figcaption><\/figure>\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"768\" src=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023banquet-1024x768.jpg\" alt=\"\" class=\"wp-image-1425\" srcset=\"https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023banquet-1024x768.jpg 1024w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023banquet-300x225.jpg 300w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023banquet-768x576.jpg 768w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023banquet-1536x1152.jpg 1536w, https:\/\/slp.cs.tut.ac.jp\/wp-content\/uploads\/2024\/04\/apsipa2023banquet.jpg 1600w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Break<\/figcaption><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>Oct.31-Nov.3, 2023, At APSIPA2023, Tatsunari Takagi, Nagito Shione, Keigo Hojo and Koharu Horii presented.<\/p>\n","protected":false},"author":10,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_locale":"en_US","_original_post":"https:\/\/slp.cs.tut.ac.jp\/?p=1420","footnotes":""},"categories":[12],"tags":[16,19],"class_list":["post-1499","post","type-post","status-publish","format-standard","hentry","category-12","tag-16","tag-19","en-US"],"_links":{"self":[{"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/posts\/1499","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/comments?post=1499"}],"version-history":[{"count":1,"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/posts\/1499\/revisions"}],"predecessor-version":[{"id":1500,"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/posts\/1499\/revisions\/1500"}],"wp:attachment":[{"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/media?parent=1499"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/categories?post=1499"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/slp.cs.tut.ac.jp\/wp-json\/wp\/v2\/tags?post=1499"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}