Click here to Kitaoka’s publication list.
2022 / 2021 / 2020 / 2019 / 2018 / 2017 / 2016 / 2015 / 2014
2023
Journal Papers
- Yukoh Wakabayashi, Kouei Yamaoka, and Nobutaka Ono, “Sound field interpolation for rotation-invariant multichannel array signal processing,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 2286―2298, Jun. 2023. DOI: 10.1109/TASLP.2023.3282098
- Katsunori Yokoi, Yurie Iribe, Norihide Kitaoka, Takashi Tsuboi, Keita Hiraga, Yuki Satake, Makoto Hattori, Yasuhiro Tanaka, Maki Sato, Akihiro Hori, Masahisa Katsuno, “Analysis of spontaneous speech in Parkinson’s disease by natural language processing,” Parkinsonism and Related Disorders, April, 2023. (DOI: 10.1016/j.parkreldis.2023.105411)
- Binh Thien Nguyen, Yukoh Wakabayashi, Kenta Iwai, and Takanobu Nishiura, “Inter-frequency phase difference for phase reconstruction using deep neural networks and maximum likelihood,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 1667―1680, Apr. 2023. DOI: 10.1109/TASLP.2023.3268577
International Conferences
- Koharu Horii, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “Language Modeling for Spontaneous Speech Recognition Based on Disfluency Labeling and Generation of Disfluent Text,” APSIPA ASC 2023, (to appear), Nov. 2023.
- Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Kengo Ohta, Atsunori Ogawa, Norihide Kitaoka, “Combining Multiple End-To-End Speech Recognition Models Based on Density Ratio Approach,” APSIPA ASC 2023, (to appear), Nov. 2023.
- Nagito Shione, Norihide Kitaoka, “Construction of Automatic Speech Recognition Model That Recognizes Linguistic Information and Verbal/Non-Verbal Phenomena,” APSIPA ASC 2023, (to appear), Nov. 2023.
- Tatsunari Takagi, Norihide Kitaoka, Atsunori Ogawa, Yukoh Wakabayashi, “Streaming End-To-End ASR Using CTC Decoder and DRA for Linguistic Information Substitution,” APSIPA ASC 2023, (to appear), Nov. 2023.
- Ryo Maejima and Norihide Kitaoka, “Speech recognition interface for updating electronic medical records with automatic itemization,” ICAICTA2023, (to appear) Oct., 2023.
- Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi and Norihide Kitaoka, “Domain adaptation with a non-parallel target domain corpus,” ICAICTA2023, (to appear) Oct., 2023.
- Tatsunari Takagi, Yukoh Wakabayashi, Atsunori Ogawa and Norihide Kitaoka, “Domain Adaptation Using Density Ratio Approach and CTC Decoder for Streaming Speech Recognition,” ICAICTA2023, (to appear) Oct., 2023.
- Shione Nagito, Yukoh Wakabayashi and Norihide Kitaoka, “Automatic Speech Recognition Using Linguistic and Verbal/Non-verbal Information,” ICAICTA2023, (to appear) Oct., 2023.
- Aito Nakata, Ryota Nishimura, Kengo Ohta, Norihide Kitaoka, “Development of a Model for Predicting Timing of Back-Channel in a Real-Time Spoken Dialog System,” GCCE2023, (to appear), Oct., 2023.
- Kazuya Tsubokura, Yurie Iribe, Norihide Kitaoka, “Relationships Between Gender, Personality Traits and Features of Multi-Modal Data to Responses to Spoken Dialog Systems Breakdown,” INTERSPEECH2023, pp. 2713-2717, Oct., 2023. (DOI: 10.21437/Interspeech.2023-1267)
Domestic Conferences
- 前島亮,北岡教英, “連続音声認識とChatGPTを活用した医療用電子カルテ項目別自動入力インタフェースの構築,” 東海支部連合大会, Aug, 2023.
Ryo Maejima and Norihide Kitaoka, “Construction of an Automatic Input Interface for Medical Electronic Medical Record Items Using Continuous Speech Recognition and ChatGPT,” Tokai Section Joint Conference, Aug, 2023. - 長江勇樹,岡田智哉,入部百合絵,北岡教英,横井克典,勝野雅央, “認知症患者の自由会話音声から抽出した言語的特徴の解析,” 東海支部連合大会, Aug, 2023.
Yuki Nagae, Tomoya Okada, Yurie Iribe, Norihide Kitaoka, Katsumi Yokoi and Masahiro Katsuno, “Analysis of linguistic features extracted from free conversation speech of dementia patients,” Tokai Section Joint Conference, Aug, 2023. - 高城巽成, 小川厚徳, 北岡教英, 若林佑幸, “暗黙的言語情報を置換するCTCデコーダを用いたストリーミングEnd-to-End音声認識,” 音学シンポジウム, Jun., 2023.
Tatsunari Takagi, Atsunori Ogawa, Norihide Kitaoka, Yukoh Wakabayashi, “Streaming End-to-End Speech Recognition Using a CTC Decoder to Replace Implicit Linguistic Information,” Acoustics Symposium, Jun. 2023. - 木内貴浩, 小川厚徳, 若林佑幸, 北岡教英, “目標ドメイン音声を用いた自己教師あり学習に基づく音声認識モデルのドメイン適応,” 音学シンポジウム, Jun., 2023.
Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi, and Norihide Kitaoka, “Domain adaptation of speech recognition models based on self-supervised learning using target domain speech,” Acoustics Symposium, Jun. 2023. - 塩根凪人, 若林佑幸, 北岡教英, “言語情報と言語・非言語現象を同時認識する音声認識モデルの構築,” 音学シンポジウム, Jun., 2023.
Nagito Shione, Yukoh Wakabayashi, and Norihide Kitaoka, “Construction of a speech recognition model for simultaneous recognition of linguistic information and verbal and non-verbal phenomena,” Acoustics Symposium, Jun. 2023. - 前島 亮・森 大輝・若林佑幸・北岡教英, “小規模学習データドメインのための文生成に基づく音声認識用言語モデルの構築,” SPEASIPワークショップ, Mar., 2023.
Ryo Maejima, Daiki Mori, Yukoh Wakabayashi, and Norihide Kitaoka, “Construction of a Language Model for Speech Recognition Based on Sentence Generation for Small Training Data Domains,” SPEASIP Workshop, Mar. 2023. - 塩根凪人・若林佑幸・北岡教英, “言語・非言語情報タグを付与する音声認識モデルの検討,” SPEASIPワークショップ, Mar., 2023.
Nagito Shione, Yukoh Wakabayashi and Norihide Kitaoka, “A study of speech recognition models with linguistic and non-linguistic information tags,” SPEASIP Workshop, Mar. 2023. - 髙橋 知宏,木下 裕磨,若林 佑幸,小野 順貴,本多 潤,福馬 誠士,中川 浩, “トラフィックカウンタにより取得した学習データに基づく音による交通モニタリング,” 日本音響学会講演論文集, 1-1-12, Mar., 2023.
Tomohiro Takahashi, Hiruma Kinoshita, Yukoh Wakabayashi, Junki Ono, Jun Honda, Seiji Fukuma and Hiroshi Nakagawa, “Traffic Monitoring by Sound Based on Learning Data Obtained by Traffic Counter,” Proceedings of the Acoustical Society of Japan, 1-1-12, Mar., 2023. - 上坂 奏人,河内 秀人,山岡 洸瑛,若林 佑幸,木下 裕磨,小野 順貴,野口 潤,渡邉 惠,一戸 紀孝,ベナー 聖子,山末 英典, “機械学習によるマーモセットの発声分類とそれに基づく発達に伴う発声変化の分析,” 日本音響学会講演論文集, 3-4-5, Mar., 2023.
Kanato Uesaka, Shuto Kawauchi, Kouei Yamaoka, Yukoh Wakabayashi, Yuma Kinoshita, Junki Ono, Jun Noguchi, Kei Watanabe, Noritaka Ichido, Seiko Benner and Hidenori Yamasue, “Vocal classification of marmosets using machine learning and analysis of developmental vocal change based on it,” Proceedings of the Acoustical Society of Japan, 3-4-5, Mar., 2023. - 堀井 こはる,太田 健吾,西村 良太,小川 厚徳,北岡 教英, “自発的発話認識のためのBERTによる非流暢文生成に基づく言語モデリング,” 日本音響学会講演論文集, 1-3-2, Mar., 2023.
Koharu Horii, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “Language modelling based on non-fluent sentence generation by BERT for spontaneous speech recognition,” Proceedings of the Acoustical Society of Japan, 1-3-2, Mar. 2023. - 伊達 龍斗,太田 健吾,西村 良太,北岡 教英, “深層学習による口唇情報を用いた雑音下での音声認識精度の改善,” 日本音響学会講演論文集, 1-3P-3, Mar., 2023.
Ryuto Date, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “Improvement of speech recognition accuracy in noise using lip information by deep learning,” Proceedings of the Acoustical Society of Japan, 1-3P-3, Mar., 2023. - 北條 圭悟,森 大輝,若林 佑幸,小川 厚徳,北岡 教英, “複数Encoder-Decoder 音声認識モデルの統合による頑健な音声認識システムの構築,” 日本音響学会講演論文集, 1-3Q-3, Mar., 2023.
Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa and Norihide Kitaoka, “Construction of a robust speech recognition system by integration of multiple Encoder-Decoder speech recognition models,” Proceedings of the Acoustical Society of Japan, 1-3Q-3, Mar., 2023. - 杉山 雅和,太田 健吾,西村 良太,北岡 教英, “割り込み発話にも対応可能なリアルタイム話者交替システム,” 日本音響学会講演論文集, 2-3P-1, Mar., 2023.
Masato Sugiyama, Kengo Ohta, Ryota Nishimura, Norihide Kitaoka, “A real-time speaker alternation system for interrupted speech,” Proc. of the Acoustical Society of Japan, 2-3P-1, Mar. 2023. - 坪倉和哉, 武田拓也, 入部百合絵 , 北岡教英, “音声対話システムの対話破綻に対するユーザの反応と個人特性との関連,” 言語処理学会第29回年次大会, pp. 2002-2006, Mar., 2023.
Kazuya Tsubokura, Takuya Takeda, Yurie Iribe, and Norihide Kitaoka, “Relationship between user responses to dialogue breakdowns in spoken dialogue systems and individual characteristics,” 29th Annual Conference of the Association for Language Processing, pp. 2002-2006, Mar. 2023. - 堀田慎,堀井こはる,北岡教英,西崎博光,”日本語音声認識結果の整形に基づく分かりやすい英語字幕の生成,” 情報処理学会第85回全国大会, 1W-01, Mar., 2023.
Makoto Hotta, Koharu. Horii, Norihide Kitaoka, Hiromitsu Nishizaki, “Generation of easy-to-understand English subtitles based on shaping of Japanese speech recognition results,” IPSJ 85th National Convention, 1W-01, Mar. 2023.
2022
Journal Papers
- Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Koharu Horii, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka. “A new speech corpus of super-elderly Japanese for acoustic modeling,” Computer Speech & Language, Vol. 77, pp. 1-22, 2022 (DOI: 10.1016/j.csl.2022.101424)
- 西村 良太, 森 雷太, 太田 健吾, 北岡 教英, “音声対話システムのための自由発話に対応した照応解析による入力発話への話題補完手法, ” 人工知能学会論文誌, Vol. 37, No. 3, pp.1–13, 2022.
Ryota Nishimura, Raita Mori, Kengo Ohta, and Norihide Kitaoka, “A Topic Complementation Method to Input Speech by Matching Analysis Corresponding to Free Speech for Spoken Dialogue Systems,” Transactions of the Japanese Society for Artificial Intelligence, Vol. 37, No. 3, pp. 1-13,. 2022.
Explanation
- 北岡 教英, 西村 良太, 太田 健吾, “フォトリアルCGエージェントとのマルチモーダル対話,” 日本音響学会誌 Vol. 78, No. 5, pp. 257-264, May, 2022.
Norihide Kitaoka, Ryota Nishimura, and Kengo Ohta, “Multimodal dialogue with photorealistic CG agents,” Journal of the Acoustical Society of Japan, Vol. 78, No. 5, pp. 257-264, May, 2022. - 山本 一公, 坂野 秀樹, 北岡 教英, “小特集「音声対話システムにおける“不気味の谷”を超えるには」にあたって,” 日本音響学会誌 Vol. 78, No. 5, pp. 245-248, May, 2022.
Ikkoh Yamamoto, Hideki Sakano, and Norihide Kitaoka, “On the ‘uncanny valley’ in spoken dialogue systems,” Journal of the Acoustical Society of Japan, Vol. 78, No. 5, pp. 245-248. , May, 2022.
International Conferences
- Binh Thien Nguyen, Yukoh Wakabayashi, Geng Yuting, Kenta Iwai, and Takanobu Nishiura, “Von Mises mixture model-based DNN for sign indetermination problem in phase reconstruction,” Proc. APSIPA ASC 2022, pp. 958―962, Chiang Mai, Nov., 2022.
- Yui Kuriki, Taishi Nakashima, Kouei Yamaoka, Natsuki Ueno, Yukoh Wakabayashi, Nobutaka Ono, and Ryo Sato, “Efficient low-latency convolution with uniform filter partition and its evaluation on real-time blind source separation,” Proc. APSIPA ASC 2022, pp. 766―770, Chiang Mai, Nov., 2022.
- Kenta Yamada, Yoshiki Masuyama, Yukoh Wakabayashi, and Nobutaka Ono, “Simultaneous frequency estimation for three or more sinusoids based on sinusoidal constraint differential equation,” Proc. APSIPA ASC 2022, pp. 976―979, Chiang Mai, Nov., 2022.
- Meiko Fukuda, Masakazu Sugiyama, Ryota Nishimura, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka, “A corpus-based analysis of age-related change in the acoustic features of elderly to super elderly speech,” Proc. Oriental-COCOSDA, (6 pages), Nov., 2022.
- Haruki Nammoku, Kouei Yamaoka, Taishi Nakashima, Yukoh Wakabayashi, and Nobutaka Ono, “Analysis and source separation of overlapping speech using corpus of everyday Japanese conversation,” Proc. ICA, Gyeongju, Oct., 2022.
- Kazuya Tsubokura, Yurie Iribe, Norihide Kitaoka, “Dialog Breakdown Detection Using Multimodal Features for Non-Task-Oriented Dialog Systems,” GCCE2022, pp. 359-363, Oct., 2022.
- Shuming Luan, Yukoh Wakabayashi, and Tomoki Toda, “Modified sound field interpolation method for rotation-robust beamforming with unequally spaced circular microphone array,” Proc. EUSIPCO 2022, pp. 344―348, Belgrade, Sep., 2022.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Norihide Kitaoka, “Implicit language information replace method in Japanese encoder-decode ASR model,” ICAICTA-2022, Sep., 2022.
- Takahiro Kinouchi, Norihide Kitaoka, “A response generation method of chat-bot system using input formatting and reference resolution,” ICAICTA-2022, Sep., 2022.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka. “End-to-End Spontaneous Speech Recognition Using Disfluency Labeling,” (5 pages), Proc. INTERSPEECH 2022, Sep., 2022.
- Meiko Fukuda, Maina Umezawa, Ryota Nishimura, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka, “Elderly Conversational Speech Corpus with Cognitive Impairment Test and Pilot Dementia Detection Experiment Using Acoustic Characteristics of Speech in Japanese Dialects,” Proc. LREC2022. pp. 1016-1022, Jun, 2022.
- Akio Kobayashi, Junji Onishi, Hiromitsu Nishizaki, Norihide Kitaoka, “End-to-End Speech to Braille Translation in Japanese,” ICCE2021, 2 pages, Jan., 2022.
Domestic Conferences
- 北條 圭悟,森 大輝,若林 佑幸,小川 厚徳,北岡 教英, “Density Ratio Approachに基づく複数Encoder-Decoder音声認識モデル統合手法,” 第24回音声言語および第9回自然言語処理シンポジウム, Dec., 2022.
Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka, “Multiple Encoder-Decoder Speech Recognition Model Integration Method Based on Density Ratio Approach,” 24th Symposium on Spoken Language and 9th Natural Language Processing. Dec. 2022. - 鳥井章宏, 西村良太, 北岡教英, “音声対話システムにおける対話破綻検出器の構築,” 令和4年度 電気・電子・情報関係学会 四国支部連合大会 講演論文集, vol. 15?8, pp. 145?145, 2022.
Akihiro Torii, Ryota Nishimura and Norihide Kitaoka, “Construction of Dialogue Failure Detector in Spoken Dialogue System,” Proceedings of 2022 Shikoku Branch Joint Conference of Institutes of Electrical, Electronics and Information Engineers, vol. 15?8, pp. 145?145, 2022. - 鳥井章宏, 西村良太, 北岡教英, “音声対話システムにおける対話破綻検出器の構築,” 令和4年度 電気・電子・情報関係学会 四国支部連合大会 講演論文集, vol. 15?8, pp. 145?145, 2022.
Akihiro Torii, Ryota Nishimura and Norihide Kitaoka, “Construction of Dialogue Failure Detector in Spoken Dialogue System,” Proceedings of 2022 Shikoku Branch Joint Conference of Institutes of Electrical, Electronics and Information Engineers, vol. 15?8, pp. 145?145, 2022. - 福村考洋, 西村良太, 北岡教英, “BERT による雑談対話話題拡張,” 令和4年度 電気・電子・情報関係学会 四国支部連合大会 講演論文集, vol. 15?9, pp. 146?146, 2022.
Kohyo Fukumura, Ryota Nishimura, and Norihide Kitaoka, “Extension of Chat Dialogue Topics by BERT,” Proceedings of the 2022 Shikoku Branch Joint Conference of Institutes of Electrical, Electronics and Information Engineers, vol. 15?9, pp. 146?146, 2022. - Binh Thien Nguyen, Yukoh Wakabayashi, Yuting GENG, Kenta Iwai, and Takanobu Nishiura, “Two-stage phase reconstruction using inter-frequency phase difference,” 日本音響学会講演論文集, 1-Q-11, Sep. 2022.
Binh Thien Nguyen, Yukoh Wakabayashi, Yuting GENG, Kenta Iwai, and Takanobu Nishiura, “Two-stage phase reconstruction using inter-frequency phase difference,” Proceedings of the Acoustical Society of Japan, 1-Q-11, Sep. 2022. - 連 冠三, 中嶋 大志, 若林 佑幸, 小野 順貴, “補助関数法に基づく円状マイクロホンアレイの自己回転角度推定,” 日本音響学会講演論文集, 1-R-29, Sep. 2022.
Guanzang Ren, Daishi Nakajima, Yukoh Wakabayashi, and Junki Ono, “Self-rotation angle estimation of circular microphone array based on auxiliary function method,” Proceedings of the Acoustical Society of Japan, 1-R-29, Sep. 2022. - 中嶋 大志, 若林 佑幸, 小野 順貴, “音場補間を用いた円状マイクロホンアレイの回転に頑健なブラインド音源分離,” 日本音響学会講演論文集, 1-Q-23, Sep. 2022.
Daishi Nakajima, Yukoh Wakabayashi, and Junki Ono, “Rotationally robust blind source separation of circular microphone arrays using sound field interpolation,” Proceedings of the Acoustical Society of Japan, 1-Q-23, Sep. 2022. - 堀井こはる, 福田芽衣子, 太田健吾, 西村良太, 小川厚徳, 北岡教英, “End-to-End非流暢整形音声認識システムの対話音声による評価,” 日本音響学会講演論文集, 2-8-5, Sep., 2022.
Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, and Norihide Kitaoka, “Evaluation of an end-to-end non-fluent shaped speech recognition system by dialogue speech,” Proceedings of the Acoustical Society of Japan, 2-8-5, Sep. 2022. - 森大輝, 太田健吾, 西村良太, 北岡教英, “ドメイン外音響情報で補強したEncoder-Decoder音声認識モデルの設計,” 日本音響学会講演論文集, 2-Q-26, Sep., 2022.
Daiki Mori, Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Design of an Encoder-Decoder speech recognition model augmented with out-of-domain acoustic information,” Proceedings of the Acoustical Society of Japan, 2-Q-26, Sep. 2022. - 坪倉和哉, 入部百合絵,北岡教英, “マルチモーダル対話システムにおける対話破綻時のユーザの個人差,” 日本音響学会講演論文集, 3-Q-13, Sep., 2022.
Kazuya Tsubokura, Yurie Iribe, and Norihide Kitaoka, “Individual user differences during dialogue breakdown in a multimodal dialogue system,” Proceedings of the Acoustical Society of Japan, 3-Q-13, Sep. 2022. - 岡田智哉, 入部百合絵, 北岡教英, “BERTを用いた雑談対話音声からの認知症疑い検出,” 日本音響学会講演論文集, 3-Q-29, Sep., 2022.
Tomoya Okada, Yurie Iribe, and Norihide Kitaoka, “Detection of suspected dementia from chat dialog speech using BERT,” Proceedings of the Acoustical Society of Japan, 3-Q-29, Sep. 2022. - 福田 芽衣子,杉山 雅和,西村 良太,入部 百合絵,山本 一公,北岡 教英, “超高齢者コーパスとS-JNASを用いた高齢者音声の音響的特徴の分析,” 日本音響学会講演論文集, 3-Q-32, Sep., 2022.
Meiko Fukuda, Masakazu Sugiyama, Ryota Nishimura, Yurie Iribe, Kazunori Yamamoto and Norihide Kitaoka, “Analysis of acoustic features of elderly speech using a corpus of very elderly people and S-JNAS,” Proceedings of the Acoustical Society of Japan, 3-Q-32, Sep. 2022. - 丸山由華,入部百合絵, 北岡教英, 横井克典,勝野雅央, “パーキンソン病の重症度に基づく音響的特徴量の分析,” 日本音響学会講演論文集, 3-Q-43, Sep., 2022.
Yuka Maruyama, Yurie Iribe, Norihide Kitaoka, Katsunori Yokoi, and Masao Katsuno, “Analysis of acoustic features based on severity of Parkinson’s disease,” Proceedings of the Acoustical Society of Japan, 3-Q-43, Sep. 2022. - 丸山 由華,入部 百合絵,北岡 教英,横井 克典,勝野 雅央, “音響情報を用いた短い発話音声からのパーキンソン病検出,” 日本音響学会講演論文集, 2-3P-10, Mar., 2022.
Yuka Maruyama, Yurie Iribe, Norihide Kitaoka, Katsunori Yokoi, Masao Katsuno, “Parkinson’s disease detection from short speech utterances using acoustic information,” Proceedings of the Acoustical Society of Japan, 2-3P-10, Mar. 2022. - 森 大輝,太田 健吾,西村 良太,小川厚徳, 北岡 教英, “タスク外音響情報を付加したEnd-to-End音声認識モデルの設計,” 日本音響学会講演論文集, 2-3Q-2, Mar., 2022.
Mori, Daiki, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, and Norihide Kitaoka, “Design of an End-to-End Speech Recognition Model with Extra-Task Acoustic Information,” Proceedings of the Acoustical Society of Japan, 2-3Q-2, Mar. 2022. - 堀井 こはる,福田 芽衣子,太田 健吾,西村 良太,小川厚徳,北岡 教英, “非流暢ラベルを用いた言い淀み整形End-to-End音声認識,” 日本音響学会講演論文集, 1-3-5, Mar., 2022.
Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “Yodomi-shaping end-to-end speech recognition using non-fluent labels,” Proceedings of the Acoustical Society of Japan, 1-3-5, Mar. 2022.
2021
Journal Papers
- Zolzaya Byambadorj,Ryota Nishimura,Altangerel Ayush, Kengo Ohta, Norihide Kitaoka, “Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation,” EURASIP Journal on Audio, Speech, and Music Processing, 2021:42, 20 pages, Dec., 2021. (DOI: 10.1186/s13636-021-00225-4) .
- Zolzaya Byambadorj,Ryota Nishimura,Altangerel Ayush, Norihide Kitaoka, “Normalization of Transliterated Mongolian Words Using Seq2Seq Model with Limited Data,” ACM Transactions on Asian and Low-Resource Language Information Processing, No. 103,, pp. 1-19, Nov. , 2021.
- Kego Ohta,Ryota Nishimura,Norihide Kitaoka, “Response Type Selection for Chat-like Spoken Dialog Systems Based on LSTM and Multi-task Learning,” SPEECH COMMUNICAGTION, vol. 133, pp. 23-30, Oct., 2021.
- 石原颯人,入部百合絵,北岡教英,”係り受け距離に着目した雑談対話からの認知症疑い検出,” 電子情報通信学会論文誌D, Vol.J104-D,No.04, pp. 357-367, Apr. 2021.
Hayato Ishihara, Yurie Iribe and Norihide Kitaoka, “Detection of Suspected Dementia from Chat Dialogues Focusing on Engagement Distance,” IEICE Transactions D, Vol. J104-D,No. 04, pp. 357-367, Apr. 2021. - Norihide Kitaoka; Bohan Chen; Yuya Obashi, “Dynamic out-of-vocabulary word registration to language model for speech recognition,” EURASIP Journal on Audio, Speech, and Music Processing, 2021:4, (8 pages), 2021. (DOI: 10.1186/s13636-020-00193-1)
International Conferences
- Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka, “Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation,” Proc. APSIPA ASC 2021, pp. 849-853, 2021.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “Advanced language model fusion method for encoder-decoder model in Japanese speech,” Proc. APSIPA ASC 2021, pp. 503-510, 2021.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “End-to-end spontaneous speech recognition using hesitation labeling,” Proc. APSIPA ASC 2021, pp. 1077-1081, 2021.
- Akio Kobayashi, Keiichi Yasu, Hiromitsu Nishizaki, Norihide Kitaoka, “Corpus Design and Automatic Speech Recognition for Deaf and Hard-Of-Hearing People,” GCCE2021, pp. 17-18, Oct., 2021.
Explanation
- 大須賀 晋, 田中 五大, 鍋倉 彩那, 藤井 宏行, 中野 涼太, 渡邊 凌太, TELYUKA, 太田 健吾, 西村 良太, 北岡 教英, “次世代の移動を支えるマルチモーダルエージェント“Saya”,” 自動車技術, Vol. 75, No. 9, pp. 109-109, Sep. 2021.
Susumu Ohsuga, Godai Tanaka, Ayana Nabekura, Hiroyuki Fujii, Ryota Nakano, Ryota Watanabe, TELYUKA, Kengo Ohta, Ryota Nishimura, Norihide Kitaoka, “Multimodal Agent “Saya” for Next Generation Mobility. ,” Automotive Technology, Vol. 75, No. 9, pp. 109-109, Sep. 2021.
Domestic Conferences
- 堀井 こはる,福田 芽衣子,太田 健吾,西村 良太,北岡 教英, “言い淀みを考慮したEnd-to-End音声認識,” 第19回情報学ワークショップ(WiNF2021), S-5-2, Nov. 2021.
Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “End-to-End Speech Recognition Considering Stammering,” 19th Workshop on Informatics (WiNF2021), S-5-2, Nov. 2021. - 木内 貴浩, 北岡 教英, “発話整形した対話履歴を用いた雑談応答生成システム,” 第19回情報学ワークショップ(WiNF2021), S-5-3, Nov. 2021.
Takahiro Kiuchi and Norihide Kitaoka, “A chat response generation system using speech-formatted dialogue history,” 19th Workshop on Informatics (WiNF2021), S-5-3, Nov. 2021. - 森 大輝,太田 健吾,西村 良太,北岡 教英, “Encoder-Decoder音声認識モデルにおける暗黙的言語情報の置換法,” 第19回情報学ワークショップ(WiNF2021), S-5-5, Nov. 2021.
Daiki Mori, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “A replacement method for implicit linguistic information in the Encoder-Decoder speech recognition model,” 19th Workshop on Informatics (WiNF2021), S-5-5, Nov. 2021. - 森 大輝,太田 健吾,西村 良太,小川 厚徳,北岡 教英, “Encoder-Decoder音声認識モデルにおける暗黙的言語情報の置換法,” 日本音響学会講演論文集, 1-3-1, Sep., 2021.
Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “A method for replacing implicit linguistic information in the Encoder-Decoder speech recognition model,” Proc. - 堀井 こはる,福田 芽衣子,太田 健吾,西村 良太,北岡 教英, “言い淀みを考慮した自由発話のEnd-to-End音声認識,” 日本音響学会講演論文集, 1-3-3, Sep., 2021.
Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “End-to-End Speech Recognition of Free Speech Considering Stuttering,” Proceedings of the Acoustical Society of Japan, 1-3-1, Sep. 2021. - Zolzaya Byambadorj,Ryota Nishimura,Altangerel Ayush,Kengo Ohta,Norihide Kitaoka, “Cross-lingual, multi-speaker text-to-speech synthesis for low resource languages,” 日本音響学会講演論文集, 1-3-7, Sep., 2021.
Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “Cross-lingual, multi-speaker text-to-speech synthesis for low resource languages,” Proceedings of the Acoustical Society of Japan, 1-3-7, Sep. 2021. - Narangerel Purevdorj,Ryota Nishimura,Altangerel Ayush,Kengo Ohta,Norihide Kitaoka, “How language similarity affects the Mongolian ASR using cross-lingual transfer learning,” 日本音響学会講演論文集, 2-3-7, Sep., 2021.
Narangerel Purevdorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “How language similarity affects the Mongolian ASR using cross-lingual transfer learning,” Proceedings of the Acoustical Society of Japan, 2-3-7, Sep. 2021. - 小林 彰夫,大西 淳児,西崎 博光,北岡 教英, “読み上げ文を対象としたEnd-to-End音声点訳,” 日本音響学会講演論文集, 2-3P-3, Sep., 2021.
Akio Kobayashi, Junji Ohnishi, Hiromitsu Nishizaki, and Norihide Kitaoka, “End-to-End Spoken Braille Translation of Readout Sentences,” Proceedings of the Acoustical Society of Japan, 2-3P-3, Sep. 2021. - 福田 芽衣子,西村 良太,西崎 博光,入部 百合絵,山本 一公,北岡 教英, “超高齢者音声コーパスEARSにおける超高齢者の音響的特徴,” 日本音響学会講演論文集, 2-3P-11, Sep., 2021.
Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Yurie Iribe, Kazunori Yamamoto and Norihide Kitaoka, “Acoustic features of the very elderly in the EARS speech corpus of the very elderly,” Proceedings of the Acoustical Society of Japan, 2-3P-11, Sep. 2021. - 西村 良太,森 貴大,北岡 教英, “ROSを利用したリアルタイム制御が可能な音声対話システムの構築,” 日本音響学会講演論文集, 2-3Q-4, Sep., 2021.
Ryota Nishimura, Takahiro Mori and Norihide Kitaoka, “Construction of a spoken dialogue system with real-time control using ROS,” Proceedings of the Acoustical Society of Japan, 2-3Q-4, Sep. 2021. - 北岡 教英,西村 良太,太田 健吾,石川 晃之,石川 友香,中野 涼太,田中 五大,鍋倉彩那,佐藤 辰耶,渡邊 凌太,大須賀 晋, “3D CGエージェントSayaとの対話における応答制御,” 日本音響学会講演論文集, 3-3-14, Sep., 2021.
Norihide Kitaoka, Ryota Nishimura, Kengo Ohta, Teruyuki Ishikawa, Yuka Ishikawa, Ryota Nakano, Godai Tanaka, Ayana Nabekura, Tatsuya Sato, Ryota Watanabe and Susumu Ohsuga, “Response control in dialogue with the 3D CG agent Saya,” Proceedings of the Acoustical Society of Japan, 3-3-14, Sep . ., 2021. - 横井 克典, 坪井 崇, 服部 誠, 佐竹 勇紀, 平賀 経太, 田中 康博, 佐藤 茉紀, 堀 明洋, 入部 百合絵, 北岡 教英, 勝野 雅央, “パーキンソン病患者の音読と会話の自然言語処理,” パーキンソン病・運動障害疾患コングレスプログラム・抄録集 15回 p. 81, Jul., 2021.
Katsunori Yokoi, Takashi Tsuboi, Makoto Hattori, Yuki Satake, Keita Hiraga, Yasuhiro Tanaka, Maki Sato, Akihiro Hori, Yurie Iribe, Norihide Kitaoka, Masao Katsuno, “Natural language processing of oral reading and conversation in Parkinson’s disease patients,” Parkinson’s and Movement Disorders Congress Programme, Abstracts. 15th p. 81, Jul. 2021. - 森大輝,太田健吾,西村良太,小川厚徳,北岡教英 “End-to-end音声認識モデルにおける暗黙的言語情報の置換法” 音学シンポジウム, Jun., 2021.
Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “A replacement method for implicit linguistic information in end-to-end speech recognition models,” Acoustics Symposium, Jun. 2021. - 堀井こはる,福田芽衣子,太田健吾,西村良太,北岡教英 “言い淀みラベル付けによる非流暢発話のEnd-to-End音声認識” 音学シンポジウム, Jun., 2021.
Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “End-to-End Speech Recognition of Non-Fluent Speech by Stamina Labelling,” Acoustics Symposium, Jun. 2021. - 北岡 教英,西村 良太,太田 健吾,石川 晃之,石川 友香(TELYUKA),中野 涼太,田中 五大,鍋倉彩那,佐藤 辰耶,渡邊 凌太,大須賀 晋, “フォトリアルCGエージェントとのマルチモーダル対話システムの構築,” 日本音響学会講演論文集, 1-2-6, Mar., 2021.
Norihide Kitaoka, Ryota Nishimura, Kengo Ohta, Teruyuki Ishikawa, Yuka Ishikawa (TELYUKA), Ryota Nakano, Godai Tanaka, Ayana Nabekura, Tatsuya Sato, Ryota Watanabe, Susumu Osuga, “Construction of a multimodal dialogue system with photorealistic CG agents,” Proceedings, 1-2-6, Mar. 2021. - 太田 健吾,西村 良太,北岡 教英, “アクセント句を考慮した日本語End-to-End音声合成サーバの構築,” 日本音響学会講演論文集, 1-2-7, Mar., 2021.
Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “Construction of a Japanese End-to-End Speech Synthesis Server Considering Accented Phrases,” Proceedings of the Acoustical Society of Japan, 1-2-7, Mar. 2021. - 小林 彰夫,安 啓一,西崎 博光,北岡 教英, “聴覚障害者の音声データの収集と音素認識による評価,” 日本音響学会講演論文集, 2-2-4, Mar., 2021.
Akio Kobayashi, Keiichi Yasu, Hiromitsu Nishizaki, Norihide Kitaoka, “Collection of speech data of hearing-impaired people and evaluation by phoneme recognition,” Proceedings of the Acoustical Society of Japan, 2-2-4, Mar. 2021. - 下笠 元暉,西崎 博光,北岡 教英, “超高齢者音声認識のためのCycleGANを用いたデータ拡張,” 日本音響学会講演論文集, 2-2P-6, Mar., 2021.
Motoki Shimogasa, Hiromitsu Nishizaki and Norihide Kitaoka, “Data expansion using CycleGAN for speech recognition of very elderly people,” Proceedings of the Acoustical Society of Japan, 2-2P-6, Mar., 2021. - Narangerel Purevdorj,Ryota Nishimura,Altangerel Ayush, Kengo Ohta, Norihide Kitaoka, “Building low resource speech recognizer: Transfer learning and data augmentation,” 日本音響学会講演論文集, 3-2-9, Mar., 2021.
Narangerel Purevdorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “Building a low resource speech recogniser: Transfer learning and data augmentation,” Proceedings of the Acoustical Society of Japan, 3-2-9, Mar. 2021. - olzaya Byambadorj,Ryota Nishimura,Altangerel Ayush, Kengo Ohta,Norihide Kitaoka, “Text to speech system for low resource languages by cross-lingual transfer learning and data augmentation,” 日本音響学会講演論文集, 3-2-10, Mar., 2021.
Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “Text to speech system for low resource languages by cross-lingual transfer learning and data augmentation,” Proceedings of the Acoustical Society of Japan, 3-2-10, Mar., 2021.
2020
Journal Papers
- Jiahao Chen, Ryota Nishimura, Norihide Kitaoka, “End-to-End Recognition of Streaming Japanese Speech Using CTC and Local Attention,” APSIPA Transactions on Signal and Information Processing, vol. 9, e 25, pp. 1-7, 2020.
- Norihide Kitaoka, Eichi Seto, Ryota Nishimura, “Example phrase adaptation method for customized, example-based sialog system using user data and disributed word representations,” IEICE Trans. Inf. & Syst., Vol. E103-D, No. 11, pp. 2332-2339, Nov., 2020.
International Conferences
- Chee Siang Leow, Tomoaki Hayakawa, Hiromitsu Nishizaki, Norihide Kitaoka, “Development of a Low-Latency and Real-Time Automatic Speech Recognition System,” GCCE2020, pp. 464-467, Oct., 2020.
- Meiko Fukuda, Hiromitsu Nishizaki, Yurie Iribe, Ryota Nishimura, Norihide Kitaoka, “Improving speech recognition for the elderly: A new corpus of elderly Japanese speech and investigation of acoustic modeling for speech recognition,” Proc. LREC2020, 9 pages, Jun, 2020.
- Jiahao Chen, Ryota Nishimura, Norihide Kitaoka, “E2E Streaming Speech Recognition Using CTC and Local Attention,” Proc. NCSP’20, 4 pages, Mar. 2020.
Domestic Conferences
- 梅澤舞菜, 入部百合絵, 北岡教英, “音声言語情報に基づいた認知症高齢者の判別,” 信学技報(SP2020-12, WIT2020-12), 6 pages, Oct. 2020.
Maina Umezawa, Yurie Iribe, and Norihide Kitaoka, “Discrimination of elderly people with dementia based on spoken language information,” Shinrigaku Giho (SP2020-12, WIT2020-12), 6 pages, Oct. 2020. - 福田芽衣子, 入部百合絵, 西崎博光, 山本一公, 西村良太, 北岡教英, “超高齢者音声コーパスEARSの構築と音声認識へ利用の予備的検討,” 情処研報, Vol.2020-SLP-133 No.6, pp. 1-6, Oct. 2020.
Meiko Fukuda, Yurie Iribe, Hiromitsu Nishizaki, Kazuhiro Yamamoto, Ryota Nishimura, and Norihide Kitaoka, “Construction of EARS, a speech corpus for very elderly people, and preliminary study of its use for speech recognition,” IJI-KENPO, Vol. 2020-SLP-133 No. 6, pp. 1-6, Oct. 2020. - レオ チーシャン,西崎 博光,北岡 教英, “Kaldiベースの低遅延リアルタイム音声認識システムの開発と評価,” 日本音響学会講演論文集, 2-P1-3, pp. ???-???, Sep.., 2020.
Leo Qishan, Hiromitsu Nishizaki and Norihide Kitaoka, “Development and evaluation of a Kaldi-based low-latency real-time speech recognition system,” in Proc. of the Acoust.? -?????? , Sep… , 2020. - 鈴木 海斗, 入部 百合絵,北岡 教英, “顔表情と音響情報を用いた対話破綻検出,” 日本音響学会講演論文集, 2-P1-4, pp. ???-???, Sep.., 2020.
Kaito Suzuki, Yurie Iribe, and Norihide Kitaoka, “Dialogue Breakdown Detection Using Facial Expression and Acoustic Information,” Proceedings of the Acoustical Society of Japan, 2-P1-4, pp. ????? -?????? , Sep… , 2020. - 山崎 大河,西村 良太,北岡 教英, “感情表現が可能なEnd-to-End日本語音声合成システムの構築,” 日本音響学会講演論文集, 2-P1-2, pp. ???-???, Sep.., 2020.
Yamazaki, Taiga, Ryota Nishimura and Norihide Kitaoka, “Construction of an End-to-End Japanese Speech Synthesis System Capable of Expressing Emotions,” Proceedings of the Acoustical Society of Japan, 2-P1-2, pp. ????? -?????? , Sep… , 2020. - 石原 颯人,入部 百合絵,北岡 教英, “文章の複雑さを考慮した雑談対話音声からの認知症傾向検出,” 日本音響学会講演論文集, 2-P1-2, pp. ???-???, Sep.., 2020.
Hayato Ishihara, Yurie Iribe, and Norirhide Kitaoka, “Detection of Dementia Tendency from Chat Dialogue Speech Considering Sentence Complexity,” Proceedings of the Acoustical Society of Japan, 2-P1-2, pp. ???? -?????? , Sep.. , 2020. - Byambadorj Zolzaya, Ryota Nishimura, Ayush Altangerel, Norihide Kitaoka, “Normalization of transliterated words using seq2seq model with spell checker,” 言語処理学会第26回年次大会, E5-3, pp.1133-1136, Mar. 2020.
Byambadorj Zolzaya, Ryota Nishimura, Ayush Altangerel, Norihide Kitaoka, “Normalisation of transliterated words using seq2seq model with spell checker,” 26th Annual Conference of the Association for Language Processing, E5-3, pp. 1133-1136, Mar. 2020. - 陳 家浩,西村 良太,北岡 教英, “Uni-directional LSTM と Local Attentionを用いたストリーミング音声認識,” 日本音響学会講演論文集, 2-Q-12, pp. 943-946, Mar., 2020.
Chen, Jiahao, Ryota Nishimura and Norihide Kitaoka, “Streaming Speech Recognition Using Uni-directional LSTM and Local Attention,” Proceedings of the Acoustical Society of Japan, 2-Q-12, pp. 943-946, Mar. 2020. - 福田 芽衣子,西崎 博光,入部 百合絵,西村 良太,北岡 教英, “高齢者音声コーパス構築と音声認識への年齢・方言の影響の分析,” 日本音響学会講演論文集, 2-Q-13, pp. 947-950, Mar., 2020.
Meiko Fukuda, Hiromitsu Nishizaki, Yurie Iribe, Ryota Nishimura and Norihide Kitaoka, “Construction of a speech corpus for the elderly and analysis of age and dialect effects on speech recognition,” Proceedings of the Acoustical Society of Japan, 2-Q-13, pp. 947-950, Mar. 2020. - 小橋 優矢,西村 良太,北岡 教英, “書き言葉から話し言葉へのテキスト変換を用いた話し言葉音声認識用言語モデルの評価,” 日本音響学会講演論文集, 2-Q-13, pp. 951-954, Mar., 2020.
Yuya Kobashi, Ryota Nishimura and Norihide Kitaoka, “Evaluation of a Language Model for Spoken Language Recognition Using Text Conversion from Written to Spoken Language,” Proc. of the Acoustical Society of Japan, 2-Q-13, pp. 951-954, Mar. 2020. - 森 雷太,西村 良太,北岡 教英, “自由発話に対応した照応解析を備えた音声対話システム,” 日本音響学会講演論文集, 3-P-13, pp. 1023-1026, Mar., 2020.
Raita Mori, Ryota Nishimura, and Norihide Kitaoka, “Spoken dialogue system with collocational analysis for free speech,” Proceedings of the Acoustical Society of Japan, 3-P-13, pp. 1023-1026, Mar. 2020. - 清原侃太,西村良太,北岡教英, “音声と指差しの統合理解によるマルチモーダル幾何問題解答システムの構築,” 情報処理学会第82回全国大会, 5F-03, pp. 4-5 – 4-6, Mar., 2020.
Kanta Kiyohara, Ryota Nishimura, and Norihide Kitaoka, “Construction of a Multimodal Geometric Problem Solving System by Integrated Understanding of Speech and Pointing,” IPSJ 82nd National Convention, 5F-03, pp. 4-5 – 4-6, Mar. 2020. - 石原颯人, 入部百合絵, 北岡教英, “語彙と係り受け構造に着目した雑談対話からの認知症傾向検出,” 情報処理学会第82回全国大会, 5ZE-03, pp. 4-459 – 4-460, Mar., 2020.
Hayato Ishihara, Yurie Iribe, and Norihide Kitaoka, “Dementia tendency detection from chat dialogues focusing on lexical and engaged structures,” The 82nd National Convention of Information Processing Society of Japan, 5ZE-03, pp. 4-459 – 4-460, Mar. 2020.
Book Chapters
- Norihide Kitaoka, Takuma Nakagawa, Ryota Nishimura, Yoshio Ishiguro, Shin’ichi Kojima and Shin Ohsuga, “A multimodal control system for autonomous vehicles using speech, gesture and gaze recognition,” pp. 101-111, in Vehicles, Drivers, and Safety, De Gruyter, 2020.
2019
Invited talk
- Norihide Kitaoka, “Spoken and multimodal interfaces: Interaction systems with machines,” ICAICTA2019 (Keynote speech), Sep 2019.
International Conferences
- Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Yurie Iribe, Norihide Kitaoka, “A new corpus of elderly Japanese speech for acoustic modeling, and a preliminary investigation of dialect-dependent speech recognition,” Proc. Oriental-COCOSDA2019, 6 pages, Oct., 2019. (Best paper award)
- Akihira Komatsu, Ryota Nishimura, Norihide Kitaoka, “Environmental sounds recognition with convolutional-LSTM,” GCCE2019, pp. 717-719, 2018.
- Yuya Obashi, Ryota Nishimura, Norihide Kitaoka, “Automatic conversion of written language into spoken language using a sequence-to-sequence model trained with a parallel corpus,” Proc. Oriental-COCOSDA2019, 5 pages, Oct., 2019.
- Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, Norihide Kitaoka, “Small-footprint magic word detection method using convolutional LSTM neural network,” Proc. INTERSPEECH2019, pp. 2035-2039, Sep. 2019.
Domestic Conferences
- 小橋 優矢,西村 良太,北岡 教英, “Sequence-to-Sequence model を用いた話し言葉音声認識用言語モデルのための書き言葉から話し言葉へのテキスト変換,” 日本音響学会講演論文集, 1-3-8, pp. 807-810, Sep., 2019.
Yuya Kobashi, Ryota Nishimura, and Norihide Kitaoka, “Text conversion from written to spoken language for a language model for spoken speech recognition using a sequence-to-sequence model,” in Proc. 807-810, Sep. 2019. - 陳 家浩,西村 良太,北岡 教英, “CTCとAttentionを用いたEnd-to-endストリーミング音声認識,” 日本音響学会講演論文集, 1-P-16, pp. 871-874, Sep., 2019.
Chen Jiahao, Ryota Nishimura, and Norihide Kitaoka, “End-to-end streaming speech recognition using CTC and Attention,” Proceedings of the Acoustical Society of Japan, 1-P-16, pp. 871-874, Sep. 2019. - 福田 芽衣子,西村 良太,西崎 博光,入部 百合絵,北岡 教英, “高齢者音声認識のための音声コーパス構築と方言への適応の効果,” 日本音響学会講演論文集, 1-P-17, pp. 875-878, Sep., 2019.
Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Yurie Iribe, and Norihide Kitaoka, “Speech corpus construction for elderly speech recognition and the effect of adaptation to dialects,” in Proceedings of the Acoustical Society of Japan, 1-P-17, pp. 875-878, Sep. 2019. - 山本 泰暉,西村 良太,三崎 正之,北岡 教英, “Convolutional LSTMを用いた省メモリMagic Word検出,” 日本音響学会講演論文集, 2-3-4, pp. 819-822, Sep., 2019.
Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, and Norihide Kitaoka, “Memory-saving Magic Word detection using Convolutional LSTM,” Proceedings of the Acoustical Society of Japan, 2-3-4, pp. 819-822, Sep. 2019. - 小松 明久,西村 良太,北岡 教英, “CNNとCLSTMを用いた環境音認,” 日本音響学会講演論文集, 2-Q-17 pp. 925-928, Sep., 2019.
Akihisa Komatsu, Ryota Nishimura, and Norihide Kitaoka, “Environmental sound recognition using CNN and CLSTM,” in Proceedings of the Acoustical Society of Japan, 2-Q-17 pp. 925-928, Sep. 2019. - 秋水紫苑, 入部百合絵, 北岡教英, “非言語情報を用いた対話システムにおける対話破綻の検出” 情報処理学会第81回全国大会, 2T-08, pp. 2-365-2-366, Mar., 2019.
Shion Akimizu, Yurie Iribe, and Norihide Kitaoka, “Detecting dialogue breakdowns in dialogue systems using non-verbal information,” IPSJ 81st National Conference, 2T-08, pp. 2-365-2-366, Mar. 2019. - 梅澤舞菜, 入部百合絵, 北岡教英, “方言を考慮した音声言語情報に基づく高齢者認知症傾向の検出,” 情報処理学会第81回全国大会, 4ZE-07, pp. 4-463-4-464, Mar., 2019.
Maina Umezawa, Yurie Iribe, and Norihide Kitaoka, “Detection of Dementia Tendency in the Elderly Based on Spoken Language Information Considering Dialect,” 81st National Conference of Information Processing Society, 4ZE-07, pp. 4-463-4-464, Mar. 2019. - 梅原 靖之,西村 良太,北岡 教英, “様々な対話戦略を統合した音声対話システムの構築法,” 日本音響学会講演論文集, 2-P-1, pp. 945-948, Mar., 2019.
Yasuyuki Umehara, Ryota Nishimura, and Norihide Kitaoka, “A method for constructing a spoken dialogue system integrating various dialogue strategies,” Proceedings of the Acoustical Society of Japan, 2-P-1, pp. 945-948, Mar. 2019. - 梶並 和明,西村 良太,入部 百合絵,北岡 教英, “音声対話破綻検出手法の開発に向けた音声対話データ収録システム,” 日本音響学会講演論文集, 2-P-2, pp. 949-952, Mar., 2019.
Kazuaki Kajinami, Ryota Nishimura, Yurie Iribe, and Norihide Kitaoka, “A Spoken Dialogue Data Recording System for the Development of a Spoken Dialogue Failure Detection Method,” Proc. of the Acoustical Society of Japan, 2-P-2, pp. 949-952, Mar. 2019. - 太田 健吾,西村 良太,北岡 教英, “LSTM によるマルチタスク学習を用いた雑談音声対話システムの応答種別選択,” 日本音響学会講演論文集, 2-P-3, 953-956, Mar., 2019.
Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Response type selection for a chatting spoken dialogue system using multi-task learning with LSTM,” Proceedings of the Acoustical Society of Japan, 2-P-3, 953-956, Mar. 2019. - 清原 侃太,西村 良太,北岡 教英, “幾何問題における音声と指差しを用いた学習支援システムの構築とその評価,” 日本音響学会講演論文集, 2-P-17, pp. 989-992, Mar., 2019.
Kanta Kiyohara, Ryota Nishimura, and Norihide Kitaoka, “Construction and evaluation of a learning support system using voice and pointing in geometry problems,” Proc. of the IAAC, 2-P-17, pp. 989-992, Mar. 2019.
2018
Journal Papers
- Ryota Nishimura, Daisuke Yamamoto, Takahiro Uchiya, Ichi Takumi, “Web-based environment for user generation of spoken dialog for virtual assistants,” EURASIP Journal Audio, Speech, Music Process., pp. 1-13, 2018.
- Ryota Nishimura, Daisuke Yamamotob, Takahiro Uchiya, Ichi Takumi, “MMDAE: Dialog scenario editor for MMDAgent on the web browser,” ICT Express, 1-5, 2018. (In Press)
- Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Tomoki Toda, Kazuya Takeda, “Daily Activity Recognition with Large-scaled Real-life Recording Datasets Based on Deep Neural Network using Multi-modal Signals,” IEICE Trans. Fundamentals, Vol.E101-A,No.1, pp. 199-210,Jan. 2018.
Letters
- 西村良太, 長尾拓海, 一万田郁仁, 北岡教英, “高齢者の音声知覚特性に基づいた音声の明瞭化加工法の研究,” 日本知能情報ファジィ学会誌, Vol. 30, No. 6, pp. 840-845, Dec., 2018.
Ryota Nishimura, Takumi Nagao, Ikujin Ichimanda, and Norihide Kitaoka, “A study of speech intelligibility processing method based on speech perception characteristics of elderly people,” Journal of the Japanese Society for Fuzzy Intelligent Information, Vol. 30, No. 6, pp. 840-845, Dec. 2018. - 西村良太, 檜垣美帆, 北岡教英, “RNN-LSTMによる音響ベクトル空間と文書ベクトル空間とのマッピング,” 日本知能情報ファジィ学会誌, Vol. 30, No. 4, pp. 628-633, Aug., 2018.
Ryota Nishimura, Miho Higaki, and Norihide Kitaoka, “Mapping between Acoustic Vector Space and Document Vector Space by RNN-LSTM,” Journal of Japanese Society for Fuzzy Intelligent Information, Vol. 30, No. 4, pp. 628-633, Aug. 2018. - Norihide Kitaoka, Shuhei Segawa, Ryota Nishimura, Kazuya Takeda, “Recognizing emotions from speech using a physical model,” Acoustical Science and Technology, Vol. 39, Issue 2, pp. 167-170, Feb., 2018. (doi: 10.1250/ast.39.167)
Invited talk
- Norihide Kitaoka, Yurie Iribe, Hiromitsu Nishizaki, “Construction of a corpus of elderly Japanese spech for analysis and recognition,” LREC2018, May 2018.
International Conferences
- Eichi Seto, Ryota Nishimura, Norihide Kitaoka, “Customization of an example-based dialog system with user data and distributed word representations,” Proc. APSIPA2018, 7 pages, Nov. 2018.
- Ryota Nishimura, Miho Higaki, Norihide Kitaoka, “Mapping acoustic vector space and document vector space by RNN-LSTM,” 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018, pp.296-297, 2018.
- Meiko Fukuda, Ryota Nishimura, Norihide Kitaoka, Hiromitsu Nishizaki, Yurie Iribe, “Construction of a corpus for elderly Japanese speech recognition,” 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018, pp.652-653, 2018.
- Kanta Kiyohara, Ryota Nishimura, Norihide Kitaoka, “Multi-modal geometry tutoring system using speech and touchscreen figure tracing,” 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018, pp.225-226, 2018.
- Norihide Kitaoka, Takuma Nakagawa, Ryota Nishimura, Yoshio Ishiguro, Shin’ichi Kojima, Shin Ohsuga, “A multimodal control system for autonomous vehicles using speech, gesture, and gaze recognition,” DSP in Vehicles 2018, (no paper), 2018.
- Kazuaki Kajinami, Ryota Nishimura, Norihide Kitaoka, “Construction of dialog database for development of spoken dialog breakdown detection methods,” in ICAICTA-2018, pp.1-5, 2018.
Domestic Conferences
- 太田健吾, 西村良太, 北岡教英, “発話の分散表現に基づく雑談音声対話システムの応答種別選択,” 音声言語シンポジウム, 信学技報, SP2017-55, pp. 1-5, Dec. 2018.
Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Response type selection for a chatting spoken dialogue system based on distributed representation of utterances,” Spoken Language Symposium, Technical Journal of Spoken Language, SP2017-55, pp. 1-5, Dec. 2018. - 下笠 元暉,西崎 博光,福田 芽衣子,西村 良太,北岡 教英, “超高齢者の自然発話音声のための音声認識モデルの検討,” 日本音響学会講演論文集, 1-R-10, pp. 977-978, Mar., 2018.
Motohki Shimogasa, Hiromitsu Nishizaki, Meiko Fukuda, Ryota Nishimura, and Norihide Kitaoka, “A study of speech recognition models for spontaneous speech of very elderly people,” Proceedings of the Acoustical Society of Japan, 1-R-10, pp. 977-978, Mar. 2018. - 山本 泰暉,西村 良太,三崎 正之,北岡 教英, “LSTM Neural Network を用いた連続発話中のMagic Word検出手法,” 日本音響学会講演論文集, 1-R-21, pp. 1009-1012, Mar., 2018.
Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, and Norihide Kitaoka, “Magic Word detection method in continuous speech using LSTM Neural Network,” Proceedings of the Acoustical Society of Japan, 1-R-21, pp. 1009-1012, Mar. 2018. - 小橋 優矢,西村 良太,北岡 教英, “Twitter中の使用単語の変化に基づく未知語の発見とそれに基づく音声認識用言語モデルの適応,” 日本音響学会講演論文集, 1-R-24, pp. 1017-1020, Mar., 2018.
Yuya Kobashi, Ryota Nishimura and Norihide Kitaoka, “Discovery of unknown words based on changes in words used in Twitter and adaptation of language models for speech recognition based on them,” in Proceedings of the Acoustical Society of Japan, 1-R-24, pp. 1017-1020, Mar. 2018. - 西村良太, 檜垣美帆, 北岡教英, “RNNに基づく音響ベクトル時系列の文書ベクトルへのマッピング,” 信学技報 (PRMU2018-32, SP2018-12), 2018.
Ryota Nishimura, Miho Higaki, and Norihide Kitaoka, “Mapping of Acoustic Vector Time Series to Document Vectors Based on RNNs,” in Shinagaku Giho (PRMU2018-32, SP2018-12), 2018. - 田端芳樹, 河内亮周, “最悪時入力標本器に対する確率的回路の識別不可能性難読化器,” 暗号と情報セキュリティシンポジウム (SCIS), 1B1-5, Jan., 2018.
Yoshiki Tabata and Akinori Kawachi, “A probabilistic circuit indistinguishability obfuscator for worst-case input samplers,” Symposium on Cryptography and Information Security (SCIS), 1B1-5, Jan. 2018. - 清原侃太,西村良太,北岡教英,”指差しと口述説明を理解する幾何学問題学習支援システム,” FIT-2018, J-011, (2 pages), Mar., 2018.
Kanta Kiyohara, Ryota Nishimura, and Norihide Kitaoka, “A Geometry Problem Learning Support System for Understanding Pointing and Oral Explanation,” FIT-2018, J-011, (2 pages), Mar. 2018. - 西村良太, 陳 伯翰, 北岡教英, “音声認識における言語モデルへの未知語登録法の検討,” 日本音響学会講演論文集, 1-Q-23, pp. 127-130, Mar., 2018.
Ryota Nishimura, Chen Bohan, and Norihide Kitaoka, “A method for registering unknown words to language models in speech recognition,” in Proceedings of the Acoustical Society of Japan, 1-Q-23, pp. 127-130, Mar. 2018. - 太田健吾,西村良太,北岡教英, “単語順を考慮したLSTM-RNN に基づく雑談音声対話システムの応答種別選択, ” 日本音響学会講演論文集, 2-8-7, pp. 45-48, Mar., 2018.
Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Response Type Selection for a Chatting Spoken Dialogue System Based on LSTM-RNN Considering Word Order,” Proceedings of the Acoustical Society of Japan, 2-8-7, pp. 45-48, Mar. 2018. - 瀬戸栄地,西村良太,北岡教英, “単語の分散表現に基づく事例ベース雑談音声対話システムのユーザ適応,” 日本音響学会講演論文集, 2-8-8, pp. 49-52, Mar., 2018.
Eiji Seto, Ryota Nishimura, and Norihide Kitaoka, “User adaptation of an example-based chatter spoken dialogue system based on distributed representation of words,” Proceedings of the Acoustical Society of Japan, 2-8-8, pp. 49-52, Mar. 2018. - 中川拓磨,西村良太,入部百合絵,石黒祥生,大須賀晋,北岡教英, “自動運転車の操作におけるマルチモーダルインタラクション,” 日本音響学会講演論文集, 2-8-10, pp. 57-60, Mar., 2018.
Takuma Nakagawa, Ryota Nishimura, Yurie Iribe, Sachio Ishiguro, Susumu Osuga, and Norihide Kitaoka, “Multimodal interaction in the operation of automated vehicles,” in Proceedings of the Acoustic Society of Japan, 2-8-10, pp. 57-60, Mar. 2018. - 梶並和明,西村良太,北岡教英, “音声対話破綻検出手法の開発に向けた対話データベースの構築,” 日本音響学会講演論文集, 2-Q-14, pp. 177-180, Mar., 2018.
Kazuaki Kajinami, Ryota Nishimura, and Norihide Kitaoka, “Construction of a dialogue database for the development of a spoken dialogue failure detection method,” Proceedings of the Acoustical Society of Japan, 2-Q-14, pp. 177-180, Mar. 2018. - 川島愛美,入部百合絵,北岡教英, “高齢者の対話音声から抽出した言語的・音響的特徴に基づく認知症傾向の判別,” 日本音響学会講演論文集, 2-Q-36, pp. 369-370, Mar., 2018.
Manami Kawashima, Yurie Iribe, and Norihide Kitaoka, “Discrimination of dementia tendency based on linguistic and acoustic features extracted from elderly people’s dialogue speech,” in Proceedings of the Acoustical Society of Japan, 2-Q-36, pp. 369-370, Mar. 2018.
2017
Tutorial Papers
- 入部百合絵, 北岡教英 “音声認識にむけた超高齢者音声のコーパス構築,” 小特集—高齢者や視覚障害者に配慮した音環境—, 日本音響学会誌 Vol. 73, No. 5, pp. 303-310, May, 2017.
Yurie Iribe and Norihide Kitaoka, “Corpus construction of very elderly people’s speech for speech recognition,” Minor Special Issue – Sound environment for elderly and visually impaired people -, J. Acoust. -310, May, 2017.
International Conferences
- Kengo Ohta, Rikito Marumoto, Ryota Nishimura, Norihide Kitaoka, “Selecting type of response for chat-like spoken dialogue systems based on acoustic features of user utterances,” Proc. APSIPA2017, 5 pages, Dec. 2017.
- Takahiro Uchiya, Ryota Nishimura, Takahiro Hirano, Masaru Sakurai, ” Design of Reminiscence Therapy System for Elderly People with Dementia,” Proc. BWCCA2017-Workshop-RI3C-2017, pp. 844-853, Nov., 2017.
- Takuma Nakagawa, Ryota Nishimura, Yurie iribe, Yoshio Ishiguro, Shin Osuga, Norihide Kitaoka, “A human machine interface framework for autonomous vehicle control,” Proc. GCCE 2017, pp. 411-413, Oct., 2017.
- Takahiro Uchiya, Satoshi Otake, Ryota Nishimura, Daisuke Yamamoto, Ichi Takumi, “Extraction of User Preferences based on Voice Interaction ,” Proc. GCCE 2017, pp. 416-417, Oct., 2017.
- Ryota Nishimura, Takahiro Uchiya, Takahiro Hirano, Masaru Sakurai, “Proposal of Reminiscence Therapy System using Spoken Dialog to Suppress Dementia ,” Proc. GCCE 2017, pp. 418-419, Occt., 2017.
- Eichi Seto, Norihide Kitaoka, “User adaptation of input-response pairs in an example-based dialog system using distributed reporesentations of words,” Proc. ICAICTA2017, Aug., 2017.
- Akinori Kawachi, Kenichi Kawano, François Le Gall, and Suguru Tamaki, “Quantum query complexity of unitary operator discrimination,” Proc. COCOON’17, SESSION 2, Aug., 2017.
- Akinori Kawachi and Yoshiki Tabata, “On indistinguishability obfuscation of probabilistic circuits for worst-case-input subexponentially indistinguishable samplers,” The 12th International Workshop on Security (IWSEC’17), Poster Session, Aug., 2017.
Invited talk
- 北岡教英, 入部百合絵, “高齢者音声の収録・分析・認識,” 音声資源活用シンポジウム(招待講演), Sep. 2017.
Norihide Kitaoka, Yurie Iribe, “Recording, analysis and recognition of elderly speech,” Symposium on Speech Resource Utilisation (invited talk), Sep. 2017.
Domestic Conferences and Research Meetings
- 中川拓磨, 大須賀普, 北岡教英, “音声・指差し・視線認識を用いた自動運転車とのマルチモーダルインタラクション,” 電子情報通信学会総合大会, D-14-4 (1 page), Mar., 2017.
Takuma Nakagawa, Susumu Ohsuga, and Norihide Kitaoka, “Multimodal interaction with self-driving cars using voice, pointing and gaze recognition,” IEICE General Conference, D-14-4 (1 page), Mar. 2017. - 黒川有紀, 入部百合絵, 北岡教英, “音響的特徴を利用した高齢者の認知症傾向の分析,” 日本音響学会講演論文集, 1-Q-36, pp. 313-314, Mar., 2017.
Yuki Kurokawa, Yurie Iribe, and Norihide Kitaoka, “Analysis of dementia tendency in the elderly using acoustic features,” Proceedings of the Acoustical Society of Japan, 1-Q-36, pp. 313-314, Mar. 2017. - 澤田優希, 入部百合絵, 北岡教英, “マルチモーダル情報を用いた運転中におけるシステム向け発話の推定,” 日本音響学会講演論文集, 2-P-6, pp. 149-150, Mar., 2017.
Yuki Sawada, Yurie Iribe, and Norihide Kitaoka, “Estimation of speech for systems during driving using multimodal information,” in Proceedings of the Acoustical Society of Japan, 2-P-6, pp. 149-150, Mar. 2017. - 冬野雄三, 北岡教英, 彭志遠, “対話システムとのタスク指向型・非タスク指向型対話における特徴と嗜好の関係分析,” 日本音響学会講演論文集, 2-P-13, pp. 171-172, Mar., 2017.
Yuzo Fuyuno, Norihide Kitaoka, and Peng Zhiyuan, “An analysis of feature-preference relationships in task-oriented and non-task-oriented dialogues with dialogue systems,” in Proceedings of the Acoustic Society of Japan, 2-P-13, pp. 171-172, Mar. 2017. - 太田健吾, 丸本理貴人, 北岡教英, “ユーザ発話の音響情報に基づく雑談対話システムの応答種別選択,” 日本音響学会講演論文集, 3-5-4, pp. 71-74, Mar., 2017.
Kengo Ohta, Rikito Marumoto, and Norihide Kitaoka, “Response type selection for a chat dialogue system based on acoustic information of user utterances,” Proceedings of the Acoustical Society of Japan, 3-5-4, pp. 71-74, Mar. 2017. - 河内亮周, 川野賢一, ルガル フランソワ, 玉置卓, “ユニタリ演算識別問題の質問計算量,” RIMS研究集会「理論計算機科学の最先端」(冬のLAシンポジウム), S3, Feb., 2017
Akinori Kawachi, Kenichi Kawano, Rugal Francois, and Taku Tamaki, “Question Computational Complexity for Unitary Arithmetic Identification Problems,” RIMS Research Conference on Advanced Theoretical Computer Science (Winter LA Symposium), S3, Feb. 2017.
Book
- 間瀬健二・北岡教英,人工知能学大辞典(9章総論),人工知能学会編, pp. 696-705, ISBN978-4320124202, Jul. 2017.
Kenji Mase and Norihide Kitaoka, Encyclopaedia of Artificial Intelligence (Chapter 9, General introduction), ed. by the Japanese Association for Artificial Intelligence, pp. 696-705, ISBN 978-4320124202, Jul. 2017.
2016
Journal Papers
- Bohan Chen, Norihide Kitaoka, Kazuya Takeda, “Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues,” EURASIP Journal on Audio, Speech, and Music Processing, 2016:19, 2016. (DOI: 10.1186/s13636-016-0097-6)
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, “Investigation of DNN-based audio-visual speech recognition,” IEICE Trans. Inf. & Syst., pp. 2444-2451, Oct., 2016.
Foreword
- Norihide Kitaoka, “FOREWORD: Special section on recent advances in machine learning in spoken language processing,” IEICE Trans. Inf. & Syst., Vol. E-99-D, No. 10, p. 2422, Oct., 2016.
International Conferences
- Takuma Nakagawa, Norihide Kitaoka, “Multimodal control system for autonomous vehicles using speec and gesture recognition,” 5th ASA/ASJ Joint Meeting, Nov., 2016.
- Eichi Seto, Norihide Kitaoka, “Example-based spoken chat system which can be cutomized for each user,” 5th ASA/ASJ Joint Meeting, Nov., 2016.
- Norihide Kitaoka, Shuhei Segawa, Kazuya Takeda, “Emotion recognition from speech using a physical model,” Proc. ICA2016, ICA2016-714 (8 pages), Sep., 2016.
- Yurie Iribe, Norihide Kitaoka and Shuhei Segawa, “Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese,” Proc. LREC 2016, pp. 4674-4677, May, 2016.
Domestic Conferences and Research Meetings
- 瀬戸栄地, 北岡教英, “単語の分散表現を用いた雑談対話システムの事例適応,” 日本音響学会講演論文集, 2-Q-8, (4 pages), Sep., 2016.
Eiji Seto and Norihide Kitaoka, “Case Adaptation of a Chat Dialogue System Using Distributed Representation of Words,” Proc. of the IAAC, 2-Q-8, (4 pages), Sep. 2016. - 中川拓磨, 北岡教英, “音声と指差しを用いた自動運転車とのマルチモーダルインタラクション,” 日本音響学会講演論文集, 3-Q-11, (4 pages), Sep., 2016.
Nakagawa, Takuma, and Norihide Kitaoka, “Multimodal Interaction with Automatic Driving Vehicles Using Speech and Pointing,” Proc. of the NACS, 3-Q-11, (4 pages), Sep. 2016. - 林知樹,北岡教英,戸田智基,武田一哉, “Deep Neural Networkに基づく日常生活行動認識における適応手法,” 電子情報通信学会 技術報告,SP2016-27, pp. 1-6, Aug., 2016.
Tomoki Hayashi, Norihide Kitaoka, Tomoki Toda, Kazuya Takeda, “An Adaptation Method in Recognition of Everyday Activities Based on Deep Neural Network,” IEICE Technical Report, SP2016-27, pp. 1-6, Aug. 2016. - 陳 伯翰,北岡教英, 武田一哉, “Difference of prosodic information transmission efficiency casued by verbally meaningless acoustic difference : An experimental study,” 日本音響学会講論集, 2-Q-32, (2 pages), Mar., 2016.
Bokhan Chen, Norihide Kitaoka and Kazuya Takeda, “Difference of prosodic information transmission efficiency casued by verbally meaningless acoustic difference : An experimental study,” Proceedings of the Acoustical Society of Japan, 2-Q-32, (2 pages), Mar. 2016.
Book
- 北岡教英(編集委員(分野幹事・音声)), 音響キーワードブック, ISBN978-4-339-00880-7, Mar., 2016.
Norihide Kitaoka (Editorial Board Member (Field Secretary, Speech)), Acoustic Keyword Book, ISBN 978-4-339-00880-7, Mar. 2016.
2015
Journal Papers
- 市川 賢, 北岡教英, 柘植 覚, 武田一哉, 北 研二, “種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化,” 情報処理学会論文誌, Vol.56, No. 3, 1003-1012, Mar., 2015.
Ken Ichikawa, Norihide Kitaoka, Satoru Tsuge, Kazuya Takeda, Kenji Kita, “Improving the Robustness of Various Text Retrieval Models for Speech Document Retrieval,” Transactions of Information Processing Society of Japan, Vol. 56, No. 3, 1003-1012, Mar. 2015. - Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “An evaluation method of aggressiveness of driving behavior using drive recorders,” IEEJ Journal of Industry Applications, Vol. 4, No. 1, pp. 59-66, 2015.
Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “An evaluation method of aggressiveness of driving behaviour using drive recorders,” IEEJ Journal of Industry Applications, Vol. 4, No. 1, pp. 59-66, 2015.
Letters
- Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “Modelling of Physical Characteristics of Speech under Stress,” IEEE Signal Processing Letters, (accepted), 2015.
International Conferences
- Shuhei Segawa, Norihide Kitaoka, Kazuya Takeda, “Elderly person’s emotional state estimation in conversation based on speech features for spoken dialogue systems,” 12th Western pacific Acoustics Conference 2015 (WESPAC2015), pp. 299-301, Dec., 2015.
- Bohan Chen, Norihide Kitaoka, Kazuya Takeda, “Relationship between Speaker/Listener Similarity and Information Transmission Quality in Speech Communication,” APSIPA ASC 2015, pp. 1190-1193, Dec., 2015.
- Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda, “Daily activity recognition based on acoustic signals and acceleration signals estimated with Gaussian process,” APSIPA ASC 2015, pp. 279-282, Dec., 2015.
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu, “Audio-visual speech recognition using deep bottleneck features and high-perfromanc lipreading,” APSIPA ASC 2015, pp. 575-582, Dec., 2015.
- Yurie Iribe, Norihide Kitaoka, Shuhei Segawa, “Development of new speech corpus for elderly Japanese speech recognition,” Oriental-COCOSDA/CASLRE, pp. 27-31, Oct., 2015.
- Bohan Chen, Norihide Kitaoka, Kazuya Takeda, “Effect of speaking rate and speech complexity on transmission quality during driving navigation task,” 7th Biennial Workshop on DSP for In-Vehicle Systems and Safety, 4 pages, Oct., 2015.
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu, “Audio-visual processing toward robust speech recognition in cars,” 7th Biennial Workshop on DSP for In-Vehicle Systems and Safety, 4 pages, Oct., 2015.
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu, “Investigation of DNN-based modeling for audio-visual speech recognition,” 2015 First International Workshop on Spoken Language Processing (MLSLP2015), (6 pages), Oct., 2015.
- Hiroshi Ninomiya, Norihide Kitaoka, Satoshi Tamura, Yurie Iribe, Kazuya Takeda, “Integration of Deep Bottleneck Features for Audio-Visual Speech Recognition,” Proc. INTERSPEECH, pp. 563-566, Sep., 2015.
- Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda, “Dayly activity recogntion based on DNN using environmental sound and acceleration signals,” Proc. EUSIPCO 2015, pp. 2351-2355, Sep. 2015.
- Yuto Dekiura, Tetsuya Matsumoto, Yoshinori Takeuchi, Hiroaki Kudo, Noboru Onishi, Norihide Kitaoka, Kazuya Takeda, “Fast Separation and Accurate Recognition of Overlapped Speech — Separation by Spectral Subtraction and Acoustic Model Training using Separated Speeches—,” 2015 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP’15), pp. 1-4, Mar., 2015.
- Katsuya Sakoyama, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “Tracking Roadside Signage Observed by Drivers,” 2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP’15), pp. 429-432, Mar., 2015.
Domestic Conferences and Research Meetings
- 田村哲嗣, 二宮宏史, 北岡教英, 大須賀晋, 入部百合絵, 武田一哉, 速水 悟, “深層学習によるマルチモーダル音声認識 – 深層学習の活用法の調査,” 第2回サイレント音声認識ワークショップ, ID 16, p. 8, Oct., 2015.
Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayami, “Multimodal speech recognition using deep learning – A survey of deep learning applications,” 2nd Silent Speech Recognition Workshop, ID 16, p . 8, Oct. 2015. - 田村哲嗣, 二宮宏史, 北岡教英, 大須賀晋, 入部百合絵, 武田一哉, 速水 悟, “深層学習によるマルチモーダル音声認識 – 画像特徴量の改善,” 第2回サイレント音声認識ワークショップ, ID 15, p. 8, Oct., 2015.
Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayami, “Multimodal speech recognition using deep learning – improvement of image features,” 2nd Silent Speech Recognition Workshop, ID 15, p. 8. Oct. 2015. - 田村哲嗣, 二宮宏史, 北岡教英, 大須賀晋, 入部百合絵, 武田一哉, 速水 悟, “深層学習によるボトルネック特徴量を用いたマルチモーダル音声認識,” 電子情報通信学会 技術研究報告, SP2015-69, vol.115, no.253, pp.57-62, Oct., 2015.
Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda, and Satoru Hayami, “Multimodal speech recognition using bottleneck features by deep learning,” IEICE Technical Report, SP2015-69, vol. 115, no. 253, pp. 57-62, Oct. 2015. - 田村哲嗣,二宮宏史,北岡教英,大須賀晋,入部百合絵,武田一哉,速水悟 “深層学習による音響・画像特徴量を用いたマルチモーダル音声認識,” 日本音響学会講論集, 3-2-5, (2 pages), Sep., 2015.
Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda and Satoru Hayami, “Multimodal speech recognition using acoustic and image features by deep learning,” Proceedings of the Acoustical Society of Japan, 3-2-5, (2 pages), Sep. 2015. - 瀬川周平,北岡教英,武田一哉, “音声対話システムの対話戦略への応用を目的とした音声からの高齢者の感情認識,” 日本音響学会講論集, 3-Q-19, (2 pages), Sep., 2015.
Shuhei Segawa, Norihide Kitaoka and Kazuya Takeda, “Emotion Recognition of Elderly People from Speech for Application to Dialogue Strategies in Spoken Dialogue Systems,” Proceedings of the Acoustical Society of Japan, 3-Q-19, (2 pages), Sep., 2015. - 陳 伯翰,北岡教英,大武美保子, 武田一哉 “話者交替の確率モデル化と情報量を用いた話者活性度の評価,” 日本音響学会講論集, 1-Q-35, (4 pages), Sep., 2015.
Chen, Bohan, Norihide Kitaoka, Mihoko Otake and Kazuya Takeda, “Probabilistic modelling of speaker alternation and evaluation of speaker activity using information content,” Proceedings of the Acoustical Society of Japan, 1-Q-35, (4 pages), Sep. 2015. - Bohan Chen, Norihide Kitaoka, Mihoko Otake, Kazuya Takeda, “Evaluation of speaker engagement using turn-taking behavior entropy,” 電子情報通信学会技術報告SP, SP2015-52, pp. 13-17, Jun., 2014.
Bohan Chen, Norihide Kitaoka, Mihoko Otake, Kazuya Takeda, “Evaluation of speaker engagement using turn-taking behaviour entropy. ,” IEICE Technical Report SP, SP2015-52, pp. 13-17, Jun. 2014. - 川合窒登,北岡教英,武田一哉, “韻律補正した学習者の音声と日本語音節に基づく近似発音の提示による英語発音矯正手法,” 日本音響学会講論集, 1-2-10, (4 pages), Mar., 2015.
Nitoto Kawai, Norihide Kitaoka, Kazuya Takeda, “A method for correcting English pronunciation by presenting approximate pronunciation based on prosody-corrected learners’ speech and Japanese syllables,” Proceedings of the Acoustical Society of Japan, 1-2-10, (4 pages), Mar. 2015. - 陳伯翰, 北岡教英, 武田一哉, “音声情報伝達における合理的な音声特徴制御とその伝達効率への影響,” 日本音響学会講論集, 1-R-20, (4 pages), Mar., 2015.
Chen, H., N. Kitaoka, and K. Takeda, “Rational Speech Feature Control in Speech Information Transfer and Its Effect on Transfer Efficiency,” Proceedings of the Acoustical Society of Japan, 1-R-20, (4 pages), Mar. 2015. - 林 知樹, 西田昌史,北岡教英, 武田一哉, “DNN による環境音と加速度信号を用いた日常生活行動認識,” 日本音響学会講論集, 2-1-16, (4 pages), Mar., 2015.
Tomoki Hayashi, Masashi Nishida, Norihide Kitaoka, Kazuya Takeda, “Recognition of daily life behaviour using environmental sound and acceleration signals by DNN,” Proceedings of the Acoustical Society of Japan, 2-1-16, (4 pages), Mar. 2015.
Book
- 北岡教英, 進化するヒトと機械の音声コミュニケーション (第4篇第2章 タスク指向対話), ニッケイ印刷, ISBN978-4-86469-065-2, Sep., 2015.
Norihide Kitaoka, Evolving Speech Communication between Humans and Machines (Part 4, Chapter 2: Task-oriented Dialogue), Nikkei Printing, ISBN 978-4-86469-065-2, Sep. 2015.
2014 (Oct.~)
International Conferences
- Norihide Kitaoka, Tomoki Hayashi, Kazuya Takeda, “Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features,” APSIPA ASC 2014, (5 pages), Oct., 2014
- Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda, “Development and preliminary analysis of sensor signal database of contiuous daily living activity over the long term,” APSIPA ASC 2014, (6 pages), Oct., 2014
- Panikos Heracleous, Pongtep Angkititrakul, Norihide Kitaoka, Kazuya Takeda, “Unsupervised energy disaggregation using conditional random fields,” IEEE ISGT Europe 2014, (6 pages), Oct., 2014.
- Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “Measuring Aggressive Driving Behavior Using,” IEEE ITSC14, 1886-1887, Oct., 2014.
Domestic Conferences and Research Meetings
- 森田一輝, 宮島千代美, 北岡教英, 武田一哉, “楽曲構成の違いに着目したアレンジ曲検索性能の評価,” 電子情報通信学会総合大会, D-12-4, (1 page), Mar., 2015. 陳伯翰, 北岡教英, 武田一哉, “対話者間の音声特徴類似度と対話の情報伝達効果の関係,” 音声言語シンポジウム, SP2014-124, pp. 147-152, Dec., 2014.
Kazuki Morita, Chiyomi Miyajima, Norihide Kitaoka and Kazuya Takeda, “Evaluation of Arranged Song Retrieval Performance Focusing on Differences in Song Structure,” IEICE General Conference, D-12-4, (1 page), Mar. 2015. Chen, H., Kitaoka, N., and Takeda, K. “Relationship between speech feature similarity among interlocutors and the information transfer effect of dialogue,” Spoken Language Symposium, SP2014-124, pp. 147-152, Dec. 2014.