Click here to Kitaoka’s publication list.
2023 / 2022 / 2021 / 2020 /
2019 / 2018 / 2017 / 2016 / 2015 / 2014
2024
Journal Papers
- Shuming Luan, Yukoh Wakabayashi, Tomoki Toda, “Unequally Spaced Sound Field Interpolation for Rotation-Robust Beamforming,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 3185―3199, Jun., 2024. DOI: 10.1109/TASLP.2024.3410879
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “Recognition of target domain Japanese speech using language model replacement,” EURASIP Journal on Audio, Speech and Music Processing, Article number: 40 (2024), 14 pages, 2024. (DOI: 10.1186/s13636-024-00360-8)
- Ryota Nishimura, Takaaki Uno, Taiki Yamamoto, Kengo Ohta, Norihide Kitaoka, “Detection of Arbitrary Wake Words by Coupling a Phoneme Predictor and a Phoneme Sequence Detector,” APSIPA Transactions on Signal and Information Processing, (to appear), 2024.
International Conferences
- Tatsunari Takagi, Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka, “Substitution of Implicit Linguistic Information in the Search Space Text-only Domain Adaptation for CTC-based Speech Recognition through,” Proc. INTERSPEECH, (to appear), Sep. 2024.
- Keigo Hojo, Yukoh Wakabayashi, Kengo Ohta, Atsunori Ogawa, Norihide Kitaoka, “CTC-based ASR using inter-layer attention-based CTC loss,” Proc. INTERSPEECH, (to appear), Sep. 2024.
- Kazuya Tsubokura, Takuya Takeda, Yurie Iribe, Norihide Kitaoka, “Dialog Breakdown Recovery Strategies Based on User Personality,” Proc. of The 14th International Workshop on Spoken Dialogue Systems Technology (IWSDS2024), Mar. 2024.
Domestic Conferences
- Toshimitsu Sakai, Yukoh Wakabayashi, Norihide Kitaoka, “Speech recognition without the need for speech segment detection with noise and silence labelling,” Acoustical Society of Japan Autumn Research Conference, (to appear), 2 pages, Sep. 2024.
- Takanori Kanai, Yukoh Wakabayashi, Ryota Nishimura, Norihide Kitaoka, “Estimating the end time of input utterances to a spoken dialogue system considering linguistic features using wav2vec 2.0,” Acoustical Society of Japan Autumn Research Conference, (to appear), 2 pages, Sep. 2024.
- Kaito Takahashi, Yukoh Wakabayashi, Kengo Ohta, Akio Kobayashi, Norihide Kitaoka, “Improving Speech Recognition Accuracy with Encoder Layer Substitution in Deaf Speech,” Acoustical Society of Japan Autumn Research Conference, (to appear), 2 pages, Sep. 2024.
- Keigo Hojo, Yukoh Wakabayashi, Kengo Ota, Atsunori Ogawa, Norihide Kitaoka, “Improving the performance of CTC speech recognition models by weighting the encoder layer using attention mechanisms,” Symposium on Sound Science, Jun. 2024.
- YANG TINGCHENG, Yuya Hosoda, Yukoh Wakabayashi, Norihide Kitaoka, “Japanese Pronunciation Scoring of L2 Learners Based on LSTM,” IPSJ 86th National Conference, 4R-08, Mar. 2024.
- Kazuya Tsubokura, Mai Okada, Yurie Iribe, Norihide Kitaoka, “Collection and Analysis of Dialogue Break Repair Corpus – Towards Repair Sentence Generation Considering User’s Individual Characteristics and Relationship with the System,” 30th Annual Conference of the Association for Natural Language Processing, pp. 1436-1440 (P5-18), Mar. 2024.
- Tomoya Okada, Yurie Iribe, Katsunori Yokoi, Akinori Nakamura, Norihide Kitaoka, Masao Katsuno, “Analysis of the effects of dementia etiologic agents on conversational content and prediction of pre-onset Alzheimer’s disease,” 30th Annual Conference of the Association for Language Processing, pp. 571-575 (P2-22), Mar. 2024.
- Yuki Nagae, Tomoya Okada, Yurie Iribe, Katsunori Yokoi, Akinori Nakamura, Norihide Kitaoka, Masao Katsuno, “Detecting mild cognitive impairment based on a topic model of free conversation,” 30th Annual Conference of the Association for Language Processing, pp. 472-476 (P2-4), Mar. 2024.
- Yuka Maruyama, Yurie Iribe, Norihide Kitaoka, Katsunori Yokoi, Masao Katsuno, “Comparative analysis of phonemes and syllables in conversational speech of Parkinson’s disease patients,” Spring Meeting of the Acoustical Society of Japan, 2-P-18, Mar. 2024.
- Tatsunari Takagi, Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka, “Domain adaptation by substituting linguistic information in streaming speech recognition using CTC,” in Spring Meeting of the Acoustical Society of Japan, 1-Q-22, Mar. 2024.
- Takanori Kanai, Yukoh Wakabayashi, Ryota Nishimura, Norihide Kitaoka, “Advance Estimation of Speech Termination Time for Smooth Spoken Dialogue Systems,” Spring Meeting of the Acoustical Society of Japan, 2-P-7, Mar. 2024.
- Kaito Takahashi, Takahiro Kinouchi, Tatsunari Takagi, Yukoh Wakabayashi, Kengo Ota, Akio Kobayashi, Norihide Kitaoka, “Evaluation of Speech Recognition Based on Self-Supervised Learning in Speech of the Deaf,” Spring Meeting of the Acoustical Society of Japan, 1-Q-23, Mar. 2024.
- Tamon Mikawa, Yasushi Fujii, Kengo Ota, Yukoh Wakabayashi, Norihide Kitaoka, “Analysis of human head movements in response to a dialogue partner’s voice in a multimodal chat dialogue dataset,” in Spring Meeting of the Acoustical Society of Japan, 2-P-8, Mar. 2024.
- Li Chengfeng, Tatsunari Takagi, Yukoh Wakabayashi, Norihide Kitaoka, “Building an Adaptive Speech Recognition Model for Electronic Medical Record Entry Based on Data Extension with ChatGPT,” Spring Meeting of the Acoustical Society of Japan, 1-Q-24, Mar. 2024.
- Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi, Kengo Ota, Norihide Kitaoka, “Domain adaptation using only large-scale speech data for speech recognition based on multilingual SSL models,” in Spring Meeting of the Acoustical Society of Japan, 1-2-2, Mar. 2024.
- Takumi Shine, Takahiro Kiuchi, Yuko Wakabayashi, Norihide Kitaoka, “Improving the accuracy of speech recognition for the elderly by combining an age estimation task,” Spring Meetings of the Acoustical Society of Japan, 1-2-5, Mar. 2024.
- Rintaro Imamoto, Ryota Nishimura, Kengo Ota, Norihide Kitaoka, “Construction and evaluation of a real-time spoken dialogue system incorporating a model of aizuchi generation and speaker alternation,” Spring Meeting of Acoustical Society of Japan, 2-P-6, Mar. 2024.
- Jotaro Emoto, Ryota Nishimura, Kengo Ota, Norihide Kitaoka, “Development of a real-time VAD-less speech recognition model with noise and silence rejection,” Spring Meeting of Acoustical Society of Japan, 1-Q-14, Mar. 2024.
- Yoshinori Fukunaga, Ryota Nishimura, Kengo Ota, Norihide Kitaoka, “Construction of a phase selection model for natural spoken dialogue system using deep learning,” Spring Meeting of Acoustical Society of Japan, 2-P-4, Mar. 2024.
- Meiko Fukuda, Ryota Nishimura, Yurie Iribe, Kazuhiro Yamamoto, Norihide Kitaoka, “EARS: Construction of a corpus of Japanese very elderly people’s speech,” Spring Meeting of the Acoustical Society of Japan, 1-2-4, Mar. 2024.
- Tatsunari Takagi , Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka, “Replacement of implicit linguistic information within beam-search decoding in CTC speech recognition models,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
- Sota Hosoi, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka, “Intermediate Speech Synthesis between Two Speakers Using x-vector Speaker Space,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
- Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi, Kengo Ota, Norihide Kitaoka, “Domain adaptation of speech recognition models using only SSL-based speech data,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
- Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi, Kengo Ota, Akio Kobayashi, Norihide Kitaoka, “Evaluation of Speech Recognition for the Deaf and Hard of Hearing by Speaker Adaptation,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
- Keigo Hojo, Yukoh Wakabayashi, Kengo Ota, Atsunori Ogawa, Norihide Kitaoka, “Integrating Multiple Speech Recognition Models for High Accuracy in Speech Recognition Systems,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
- Takumi Shine, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka, “Improving the accuracy of elderly speech recognition by multitask learning with age information,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
- Ryo Maejima, Norihide Kitaoka, “Construction and Evaluation of a Batch Speech Input Interface for Electronic Medical Records Using Large-scale Language Models,” SPEASIP Workshop 2024, pp. 1-6, Mar. 2024.
2023
Journal Papers
- Kazuya Tsubokura, Yurie Iribe, Norihide Kitaoka, “Analysis of the Relationship between User Response to Dialog Breakdown and Personality Traits,” Applied Robotics, Vol. 37, Issue 21, pp. 1-10 ,Nov., 2023. (DOI: 10.1080/01691864.2023.2279610)
- Yukoh Wakabayashi, Kouei Yamaoka, and Nobutaka Ono, “Sound field interpolation for rotation-invariant multichannel array signal processing,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 2286―2298, Jun. 2023. DOI: 10.1109/TASLP.2023.3282098
- Katsunori Yokoi, Yurie Iribe, Norihide Kitaoka, Takashi Tsuboi, Keita Hiraga, Yuki Satake, Makoto Hattori, Yasuhiro Tanaka, Maki Sato, Akihiro Hori, Masahisa Katsuno, “Analysis of spontaneous speech in Parkinson’s disease by natural language processing,” Parkinsonism and Related Disorders, Vol. 112, pp. 1-6, April, 2023. (DOI: 10.1016/j.parkreldis.2023.105411)
- Binh Thien Nguyen, Yukoh Wakabayashi, Kenta Iwai, and Takanobu Nishiura, “Inter-frequency phase difference for phase reconstruction using deep neural networks and maximum likelihood,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 1667―1680, Apr. 2023. DOI: 10.1109/TASLP.2023.3268577
International Conferences
- Koharu Horii, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “Language Modeling for Spontaneous Speech Recognition Based on Disfluency Labeling and Generation of Disfluent Text,” APSIPA ASC 2023, pp. 1867-1872, Nov. 2023.
- Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Kengo Ohta, Atsunori Ogawa, Norihide Kitaoka, “Combining Multiple End-To-End Speech Recognition Models Based on Density Ratio Approach,” APSIPA ASC 2023, pp. 2250-2255, Nov. 2023.
- Nagito Shione, Norihide Kitaoka, “Construction of Automatic Speech Recognition Model That Recognizes Linguistic Information and Verbal/Non-Verbal Phenomena,” APSIPA ASC 2023, pp. 2282-2287, Nov. 2023.
- Tatsunari Takagi, Norihide Kitaoka, Atsunori Ogawa, Yukoh Wakabayashi, “Streaming End-To-End ASR Using CTC Decoder and DRA for Linguistic Information Substitution,” APSIPA ASC 2023, pp. 1768-1772, Nov. 2023.
- Ryo Maejima and Norihide Kitaoka, “Speech recognition interface for updating electronic medical records with automatic itemization,” ICAICTA2023, (5 pages) Oct., 2023.
- Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi and Norihide Kitaoka, “Domain adaptation with a non-parallel target domain corpus,” ICAICTA2023, (6 pages) Oct., 2023.
- Tatsunari Takagi, Yukoh Wakabayashi, Atsunori Ogawa and Norihide Kitaoka, “Domain Adaptation Using Density Ratio Approach and CTC Decoder for Streaming Speech Recognition,” ICAICTA2023, (5 pages) Oct., 2023.
- Shione Nagito, Yukoh Wakabayashi and Norihide Kitaoka, “Automatic Speech Recognition Using Linguistic and Verbal/Non-verbal Information,” ICAICTA2023, (6 pages) Oct., 2023.
- Aito Nakata, Ryota Nishimura, Kengo Ohta, Norihide Kitaoka, “Development of a Model for Predicting Timing of Back-Channel in a Real-Time Spoken Dialog System,” GCCE2023, (to appear), Oct., 2023.
- Kazuya Tsubokura, Yurie Iribe, Norihide Kitaoka, “Relationships Between Gender, Personality Traits and Features of Multi-Modal Data to Responses to Spoken Dialog Systems Breakdown,” INTERSPEECH2023, pp. 2713-2717, Oct., 2023. (DOI: 10.21437/Interspeech.2023-1267)
Domestic Conferences
- Nagito Shione, Yukoh Wakabayashi, and Norihide Kitaoka, “Proposal of a Speech Recognition System for Detecting Verbal and Nonverbal Phenomena, Acoustical Society of Japan Autumn Meeting, 2-Q-3, Sep. 2023.
- Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi, Norihide Kitaoka, “Domain adaptation of speech recognition models using only SSL-based speech data,” Acoustical Society of Japan Autumn Meeting, 2-Q-9, Sep. 2023.
- Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa, and Norihide Kitaoka, “Construction of Robust Speech Recognition System by Integration of Multiple Speech Recognition Models Based on Density Ratio Approach,” Acoustical Society of Japan Autumn Meeting, 2-Q-10, Sep. 2023.
- Kaito Kofuji, Ryota Nishimura, Kengo Ota, and Norihide Kitaoka, “Construction and evaluation of a multilingual speech synthesis model using monolingual speakers,” Acoustical Society of Japan Autumn Meeting, 2-Q-37, Sep. 2023.
- Tamon Mikawa, Daishi Yamaoka, and Norihide Kitaoka, “Evaluation of Robust Speech Recognition Models for Overlap Using End-to-end Models,” Acoustical Society of Japan Autumn Meeting, 3-Q-1, Sep. 2023.
- Keigo Hojo, Sousuke Kawahigashi, and Norihide Kitaoka, “Data expansion for speech recognition by sentence generation and speech synthesis focusing on words that are difficult to recognise,” Acoustical Society of Japan Autumn Meeting, 3-Q-2, Sep. 2023.
- Tatsunari Takagi, Atsunori Ogawa, Norihide Kitaoka, Yukoh Wakabayashi, “Domain adaptation based on Density Ratio Approach for streamable speech recognition using CTC decoder,” Acoustical Society of Japan Autumn Meeting, 3-Q-6, Sep. 2023.
- Ryo Maejima, Daiki Mori, Yukoh Wakabayashi and Norihide Kitaoka, “Building a medical electronic health record item-specific automatic entry interface using speech recognition,” FIT2023, Sep. 2023.
- Ryo Maejima and Norihide Kitaoka, “Construction of an Automatic Input Interface for Medical Electronic Medical Record Items Using Continuous Speech Recognition and ChatGPT,” Tokai Section Joint Conference, Aug, 2023.
- Yuki Nagae, Tomoya Okada, Yurie Iribe, Norihide Kitaoka, Katsumi Yokoi and Masahiro Katsuno, “Analysis of linguistic features extracted from free conversation speech of dementia patients,” Tokai Section Joint Conference, Aug, 2023.
- Tatsunari Takagi, Atsunori Ogawa, Norihide Kitaoka, Yukoh Wakabayashi, “Streaming End-to-End Speech Recognition Using a CTC Decoder to Replace Implicit Linguistic Information,” Acoustics Symposium, Jun. 2023.
- Takahiro Kinouchi, Atsunori Ogawa, Yukoh Wakabayashi, and Norihide Kitaoka, “Domain adaptation of speech recognition models based on self-supervised learning using target domain speech,” Acoustics Symposium, Jun. 2023.
- Nagito Shione, Yukoh Wakabayashi, and Norihide Kitaoka, “Construction of a speech recognition model for simultaneous recognition of linguistic information and verbal and non-verbal phenomena,” Acoustics Symposium, Jun. 2023.
- Ryo Maejima, Daiki Mori, Yukoh Wakabayashi, and Norihide Kitaoka, “Construction of a Language Model for Speech Recognition Based on Sentence Generation for Small Training Data Domains,” SPEASIP Workshop, Mar. 2023.
- Nagito Shione, Yukoh Wakabayashi and Norihide Kitaoka, “A study of speech recognition models with linguistic and non-linguistic information tags,” SPEASIP Workshop, Mar. 2023.
- Tomohiro Takahashi, Hiruma Kinoshita, Yukoh Wakabayashi, Junki Ono, Jun Honda, Seiji Fukuma and Hiroshi Nakagawa, “Traffic Monitoring by Sound Based on Learning Data Obtained by Traffic Counter,” Proceedings of the Acoustical Society of Japan, 1-1-12, Mar., 2023.
- Kanato Uesaka, Shuto Kawauchi, Kouei Yamaoka, Yukoh Wakabayashi, Yuma Kinoshita, Junki Ono, Jun Noguchi, Kei Watanabe, Noritaka Ichido, Seiko Benner and Hidenori Yamasue, “Vocal classification of marmosets using machine learning and analysis of developmental vocal change based on it,” Proceedings of the Acoustical Society of Japan, 3-4-5, Mar., 2023.
- Koharu Horii, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “Language modelling based on non-fluent sentence generation by BERT for spontaneous speech recognition,” Proceedings of the Acoustical Society of Japan, 1-3-2, Mar. 2023.
- Ryuto Date, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “Improvement of speech recognition accuracy in noise using lip information by deep learning,” Proceedings of the Acoustical Society of Japan, 1-3P-3, Mar., 2023.
- Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa and Norihide Kitaoka, “Construction of a robust speech recognition system by integration of multiple Encoder-Decoder speech recognition models,” Proceedings of the Acoustical Society of Japan, 1-3Q-3, Mar., 2023.
- Masato Sugiyama, Kengo Ohta, Ryota Nishimura, Norihide Kitaoka, “A real-time speaker alternation system for interrupted speech,” Proc. of the Acoustical Society of Japan, 2-3P-1, Mar. 2023.
- Kazuya Tsubokura, Takuya Takeda, Yurie Iribe, and Norihide Kitaoka, “Relationship between user responses to dialogue breakdowns in spoken dialogue systems and individual characteristics,” 29th Annual Conference of the Association for Language Processing, pp. 2002-2006, Mar. 2023.
- Makoto Hotta, Koharu. Horii, Norihide Kitaoka, Hiromitsu Nishizaki, “Generation of easy-to-understand English subtitles based on shaping of Japanese speech recognition results,” IPSJ 85th National Convention, 1W-01, Mar. 2023.
2022
Journal Papers
- Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Koharu Horii, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka. “A new speech corpus of super-elderly Japanese for acoustic modeling,” Computer Speech & Language, Vol. 77, pp. 1-22, 2022 (DOI: 10.1016/j.csl.2022.101424)
- Ryota Nishimura, Raita Mori, Kengo Ohta, and Norihide Kitaoka, “A Topic Complementation Method to Input Speech by Matching Analysis Corresponding to Free Speech for Spoken Dialogue Systems,” Transactions of the Japanese Society for Artificial Intelligence, Vol. 37, No. 3, pp. 1-13,. 2022.
Explanation
- Norihide Kitaoka, Ryota Nishimura, and Kengo Ohta, “Multimodal dialogue with photorealistic CG agents,” Journal of the Acoustical Society of Japan, Vol. 78, No. 5, pp. 257-264, May, 2022.
- Ikkoh Yamamoto, Hideki Sakano, and Norihide Kitaoka, “On the ‘uncanny valley’ in spoken dialogue systems,” Journal of the Acoustical Society of Japan, Vol. 78, No. 5, pp. 245-248. , May, 2022.
International Conferences
- Binh Thien Nguyen, Yukoh Wakabayashi, Geng Yuting, Kenta Iwai, and Takanobu Nishiura, “Von Mises mixture model-based DNN for sign indetermination problem in phase reconstruction,” Proc. APSIPA ASC 2022, pp. 958―962, Chiang Mai, Nov., 2022.
- Yui Kuriki, Taishi Nakashima, Kouei Yamaoka, Natsuki Ueno, Yukoh Wakabayashi, Nobutaka Ono, and Ryo Sato, “Efficient low-latency convolution with uniform filter partition and its evaluation on real-time blind source separation,” Proc. APSIPA ASC 2022, pp. 766―770, Chiang Mai, Nov., 2022.
- Kenta Yamada, Yoshiki Masuyama, Yukoh Wakabayashi, and Nobutaka Ono, “Simultaneous frequency estimation for three or more sinusoids based on sinusoidal constraint differential equation,” Proc. APSIPA ASC 2022, pp. 976―979, Chiang Mai, Nov., 2022.
- Meiko Fukuda, Masakazu Sugiyama, Ryota Nishimura, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka, “A corpus-based analysis of age-related change in the acoustic features of elderly to super elderly speech,” Proc. Oriental-COCOSDA, (6 pages), Nov., 2022.
- Haruki Nammoku, Kouei Yamaoka, Taishi Nakashima, Yukoh Wakabayashi, and Nobutaka Ono, “Analysis and source separation of overlapping speech using corpus of everyday Japanese conversation,” Proc. ICA, Gyeongju, Oct., 2022.
- Kazuya Tsubokura, Yurie Iribe, Norihide Kitaoka, “Dialog Breakdown Detection Using Multimodal Features for Non-Task-Oriented Dialog Systems,” GCCE2022, pp. 359-363, Oct., 2022.
- Shuming Luan, Yukoh Wakabayashi, and Tomoki Toda, “Modified sound field interpolation method for rotation-robust beamforming with unequally spaced circular microphone array,” Proc. EUSIPCO 2022, pp. 344―348, Belgrade, Sep., 2022.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Norihide Kitaoka, “Implicit language information replace method in Japanese encoder-decode ASR model,” ICAICTA-2022, Sep., 2022.
- Takahiro Kinouchi, Norihide Kitaoka, “A response generation method of chat-bot system using input formatting and reference resolution,” ICAICTA-2022, Sep., 2022.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka. “End-to-End Spontaneous Speech Recognition Using Disfluency Labeling,” (5 pages), Proc. INTERSPEECH 2022, Sep., 2022.
- Meiko Fukuda, Maina Umezawa, Ryota Nishimura, Yurie Iribe, Kazumasa Yamamoto, Norihide Kitaoka, “Elderly Conversational Speech Corpus with Cognitive Impairment Test and Pilot Dementia Detection Experiment Using Acoustic Characteristics of Speech in Japanese Dialects,” Proc. LREC2022. pp. 1016-1022, Jun, 2022.
- Akio Kobayashi, Junji Onishi, Hiromitsu Nishizaki, Norihide Kitaoka, “End-to-End Speech to Braille Translation in Japanese,” ICCE2021, 2 pages, Jan., 2022.
Domestic Conferences
- Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka, “Multiple Encoder-Decoder Speech Recognition Model Integration Method Based on Density Ratio Approach,” 24th Symposium on Spoken Language and 9th Natural Language Processing. Dec. 2022.
- Akihiro Torii, Ryota Nishimura and Norihide Kitaoka, “Construction of Dialogue Failure Detector in Spoken Dialogue System,” Proceedings of 2022 Shikoku Branch Joint Conference of Institutes of Electrical, Electronics and Information Engineers, vol. 15?8, pp. 145?145, 2022.
- Akihiro Torii, Ryota Nishimura and Norihide Kitaoka, “Construction of Dialogue Failure Detector in Spoken Dialogue System,” Proceedings of 2022 Shikoku Branch Joint Conference of Institutes of Electrical, Electronics and Information Engineers, vol. 15?8, pp. 145?145, 2022.
- Kohyo Fukumura, Ryota Nishimura, and Norihide Kitaoka, “Extension of Chat Dialogue Topics by BERT,” Proceedings of the 2022 Shikoku Branch Joint Conference of Institutes of Electrical, Electronics and Information Engineers, vol. 15?9, pp. 146?146, 2022.
- Binh Thien Nguyen, Yukoh Wakabayashi, Yuting GENG, Kenta Iwai, and Takanobu Nishiura, “Two-stage phase reconstruction using inter-frequency phase difference,” Proceedings of the Acoustical Society of Japan, 1-Q-11, Sep. 2022.
- Guanzang Ren, Daishi Nakajima, Yukoh Wakabayashi, and Junki Ono, “Self-rotation angle estimation of circular microphone array based on auxiliary function method,” Proceedings of the Acoustical Society of Japan, 1-R-29, Sep. 2022.
- Daishi Nakajima, Yukoh Wakabayashi, and Junki Ono, “Rotationally robust blind source separation of circular microphone arrays using sound field interpolation,” Proceedings of the Acoustical Society of Japan, 1-Q-23, Sep. 2022.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, and Norihide Kitaoka, “Evaluation of an end-to-end non-fluent shaped speech recognition system by dialogue speech,” Proceedings of the Acoustical Society of Japan, 2-8-5, Sep. 2022.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Design of an Encoder-Decoder speech recognition model augmented with out-of-domain acoustic information,” Proceedings of the Acoustical Society of Japan, 2-Q-26, Sep. 2022.
- Kazuya Tsubokura, Yurie Iribe, and Norihide Kitaoka, “Individual user differences during dialogue breakdown in a multimodal dialogue system,” Proceedings of the Acoustical Society of Japan, 3-Q-13, Sep. 2022.
- Tomoya Okada, Yurie Iribe, and Norihide Kitaoka, “Detection of suspected dementia from chat dialog speech using BERT,” Proceedings of the Acoustical Society of Japan, 3-Q-29, Sep. 2022.
- Meiko Fukuda, Masakazu Sugiyama, Ryota Nishimura, Yurie Iribe, Kazunori Yamamoto and Norihide Kitaoka, “Analysis of acoustic features of elderly speech using a corpus of very elderly people and S-JNAS,” Proceedings of the Acoustical Society of Japan, 3-Q-32, Sep. 2022.
- Yuka Maruyama, Yurie Iribe, Norihide Kitaoka, Katsunori Yokoi, and Masao Katsuno, “Analysis of acoustic features based on severity of Parkinson’s disease,” Proceedings of the Acoustical Society of Japan, 3-Q-43, Sep. 2022.
- Yuka Maruyama, Yurie Iribe, Norihide Kitaoka, Katsunori Yokoi, Masao Katsuno, “Parkinson’s disease detection from short speech utterances using acoustic information,” Proceedings of the Acoustical Society of Japan, 2-3P-10, Mar. 2022.
- Mori, Daiki, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, and Norihide Kitaoka, “Design of an End-to-End Speech Recognition Model with Extra-Task Acoustic Information,” Proceedings of the Acoustical Society of Japan, 2-3Q-2, Mar. 2022.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “Yodomi-shaping end-to-end speech recognition using non-fluent labels,” Proceedings of the Acoustical Society of Japan, 1-3-5, Mar. 2022.
2021
Journal Papers
- Zolzaya Byambadorj,Ryota Nishimura,Altangerel Ayush, Kengo Ohta, Norihide Kitaoka, “Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation,” EURASIP Journal on Audio, Speech, and Music Processing, 2021:42, 20 pages, Dec., 2021. (DOI: 10.1186/s13636-021-00225-4) .
- Zolzaya Byambadorj,Ryota Nishimura,Altangerel Ayush, Norihide Kitaoka, “Normalization of Transliterated Mongolian Words Using Seq2Seq Model with Limited Data,” ACM Transactions on Asian and Low-Resource Language Information Processing, No. 103,, pp. 1-19, Nov. , 2021.
- Kego Ohta,Ryota Nishimura,Norihide Kitaoka, “Response Type Selection for Chat-like Spoken Dialog Systems Based on LSTM and Multi-task Learning,” SPEECH COMMUNICAGTION, vol. 133, pp. 23-30, Oct., 2021.
- Hayato Ishihara, Yurie Iribe and Norihide Kitaoka, “Detection of Suspected Dementia from Chat Dialogues Focusing on Engagement Distance,” IEICE Transactions D, Vol. J104-D,No. 04, pp. 357-367, Apr. 2021.
- Norihide Kitaoka; Bohan Chen; Yuya Obashi, “Dynamic out-of-vocabulary word registration to language model for speech recognition,” EURASIP Journal on Audio, Speech, and Music Processing, 2021:4, (8 pages), 2021. (DOI: 10.1186/s13636-020-00193-1)
International Conferences
- Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka, “Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation,” Proc. APSIPA ASC 2021, pp. 849-853, 2021.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “Advanced language model fusion method for encoder-decoder model in Japanese speech,” Proc. APSIPA ASC 2021, pp. 503-510, 2021.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka, “End-to-end spontaneous speech recognition using hesitation labeling,” Proc. APSIPA ASC 2021, pp. 1077-1081, 2021.
- Akio Kobayashi, Keiichi Yasu, Hiromitsu Nishizaki, Norihide Kitaoka, “Corpus Design and Automatic Speech Recognition for Deaf and Hard-Of-Hearing People,” GCCE2021, pp. 17-18, Oct., 2021.
Explanation
- Susumu Ohsuga, Godai Tanaka, Ayana Nabekura, Hiroyuki Fujii, Ryota Nakano, Ryota Watanabe, TELYUKA, Kengo Ohta, Ryota Nishimura, Norihide Kitaoka, “Multimodal Agent “Saya” for Next Generation Mobility. ,” Automotive Technology, Vol. 75, No. 9, pp. 109-109, Sep. 2021.
Domestic Conferences
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “End-to-End Speech Recognition Considering Stammering,” 19th Workshop on Informatics (WiNF2021), S-5-2, Nov. 2021.
- Takahiro Kiuchi and Norihide Kitaoka, “A chat response generation system using speech-formatted dialogue history,” 19th Workshop on Informatics (WiNF2021), S-5-3, Nov. 2021.
- Daiki Mori, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “A replacement method for implicit linguistic information in the Encoder-Decoder speech recognition model,” 19th Workshop on Informatics (WiNF2021), S-5-5, Nov. 2021.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “A method for replacing implicit linguistic information in the Encoder-Decoder speech recognition model,” Proc.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “End-to-End Speech Recognition of Free Speech Considering Stuttering,” Proceedings of the Acoustical Society of Japan, 1-3-1, Sep. 2021.
- Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “Cross-lingual, multi-speaker text-to-speech synthesis for low resource languages,” Proceedings of the Acoustical Society of Japan, 1-3-7, Sep. 2021.
- Narangerel Purevdorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “How language similarity affects the Mongolian ASR using cross-lingual transfer learning,” Proceedings of the Acoustical Society of Japan, 2-3-7, Sep. 2021.
- Akio Kobayashi, Junji Ohnishi, Hiromitsu Nishizaki, and Norihide Kitaoka, “End-to-End Spoken Braille Translation of Readout Sentences,” Proceedings of the Acoustical Society of Japan, 2-3P-3, Sep. 2021.
- Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Yurie Iribe, Kazunori Yamamoto and Norihide Kitaoka, “Acoustic features of the very elderly in the EARS speech corpus of the very elderly,” Proceedings of the Acoustical Society of Japan, 2-3P-11, Sep. 2021.
- Ryota Nishimura, Takahiro Mori and Norihide Kitaoka, “Construction of a spoken dialogue system with real-time control using ROS,” Proceedings of the Acoustical Society of Japan, 2-3Q-4, Sep. 2021.
- Norihide Kitaoka, Ryota Nishimura, Kengo Ohta, Teruyuki Ishikawa, Yuka Ishikawa, Ryota Nakano, Godai Tanaka, Ayana Nabekura, Tatsuya Sato, Ryota Watanabe and Susumu Ohsuga, “Response control in dialogue with the 3D CG agent Saya,” Proceedings of the Acoustical Society of Japan, 3-3-14, Sep . ., 2021.
- Katsunori Yokoi, Takashi Tsuboi, Makoto Hattori, Yuki Satake, Keita Hiraga, Yasuhiro Tanaka, Maki Sato, Akihiro Hori, Yurie Iribe, Norihide Kitaoka, Masao Katsuno, “Natural language processing of oral reading and conversation in Parkinson’s disease patients,” Parkinson’s and Movement Disorders Congress Programme, Abstracts. 15th p. 81, Jul. 2021.
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa and Norihide Kitaoka, “A replacement method for implicit linguistic information in end-to-end speech recognition models,” Acoustics Symposium, Jun. 2021.
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “End-to-End Speech Recognition of Non-Fluent Speech by Stamina Labelling,” Acoustics Symposium, Jun. 2021.
- Norihide Kitaoka, Ryota Nishimura, Kengo Ohta, Teruyuki Ishikawa, Yuka Ishikawa (TELYUKA), Ryota Nakano, Godai Tanaka, Ayana Nabekura, Tatsuya Sato, Ryota Watanabe, Susumu Osuga, “Construction of a multimodal dialogue system with photorealistic CG agents,” Proceedings, 1-2-6, Mar. 2021.
- Kengo Ohta, Ryota Nishimura and Norihide Kitaoka, “Construction of a Japanese End-to-End Speech Synthesis Server Considering Accented Phrases,” Proceedings of the Acoustical Society of Japan, 1-2-7, Mar. 2021.
- Akio Kobayashi, Keiichi Yasu, Hiromitsu Nishizaki, Norihide Kitaoka, “Collection of speech data of hearing-impaired people and evaluation by phoneme recognition,” Proceedings of the Acoustical Society of Japan, 2-2-4, Mar. 2021.
- Motoki Shimogasa, Hiromitsu Nishizaki and Norihide Kitaoka, “Data expansion using CycleGAN for speech recognition of very elderly people,” Proceedings of the Acoustical Society of Japan, 2-2P-6, Mar., 2021.
- Narangerel Purevdorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “Building a low resource speech recogniser: Transfer learning and data augmentation,” Proceedings of the Acoustical Society of Japan, 3-2-9, Mar. 2021.
- Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta and Norihide Kitaoka, “Text to speech system for low resource languages by cross-lingual transfer learning and data augmentation,” Proceedings of the Acoustical Society of Japan, 3-2-10, Mar., 2021.
2020
Journal Papers
- Jiahao Chen, Ryota Nishimura, Norihide Kitaoka, “End-to-End Recognition of Streaming Japanese Speech Using CTC and Local Attention,” APSIPA Transactions on Signal and Information Processing, vol. 9, e 25, pp. 1-7, 2020.
- Norihide Kitaoka, Eichi Seto, Ryota Nishimura, “Example phrase adaptation method for customized, example-based sialog system using user data and disributed word representations,” IEICE Trans. Inf. & Syst., Vol. E103-D, No. 11, pp. 2332-2339, Nov., 2020.
International Conferences
- Chee Siang Leow, Tomoaki Hayakawa, Hiromitsu Nishizaki, Norihide Kitaoka, “Development of a Low-Latency and Real-Time Automatic Speech Recognition System,” GCCE2020, pp. 464-467, Oct., 2020.
- Meiko Fukuda, Hiromitsu Nishizaki, Yurie Iribe, Ryota Nishimura, Norihide Kitaoka, “Improving speech recognition for the elderly: A new corpus of elderly Japanese speech and investigation of acoustic modeling for speech recognition,” Proc. LREC2020, 9 pages, Jun, 2020.
- Jiahao Chen, Ryota Nishimura, Norihide Kitaoka, “E2E Streaming Speech Recognition Using CTC and Local Attention,” Proc. NCSP’20, 4 pages, Mar. 2020.
Domestic Conferences
- Maina Umezawa, Yurie Iribe, and Norihide Kitaoka, “Discrimination of elderly people with dementia based on spoken language information,” Shinrigaku Giho (SP2020-12, WIT2020-12), 6 pages, Oct. 2020.
- Meiko Fukuda, Yurie Iribe, Hiromitsu Nishizaki, Kazuhiro Yamamoto, Ryota Nishimura, and Norihide Kitaoka, “Construction of EARS, a speech corpus for very elderly people, and preliminary study of its use for speech recognition,” IJI-KENPO, Vol. 2020-SLP-133 No. 6, pp. 1-6, Oct. 2020.
- Leo Qishan, Hiromitsu Nishizaki and Norihide Kitaoka, “Development and evaluation of a Kaldi-based low-latency real-time speech recognition system,” in Proc. of the Acoust.? -?????? , Sep… , 2020.
- Kaito Suzuki, Yurie Iribe, and Norihide Kitaoka, “Dialogue Breakdown Detection Using Facial Expression and Acoustic Information,” Proceedings of the Acoustical Society of Japan, 2-P1-4, pp. ????? -?????? , Sep… , 2020.
- Yamazaki, Taiga, Ryota Nishimura and Norihide Kitaoka, “Construction of an End-to-End Japanese Speech Synthesis System Capable of Expressing Emotions,” Proceedings of the Acoustical Society of Japan, 2-P1-2, pp. ????? -?????? , Sep… , 2020.
- Hayato Ishihara, Yurie Iribe, and Norirhide Kitaoka, “Detection of Dementia Tendency from Chat Dialogue Speech Considering Sentence Complexity,” Proceedings of the Acoustical Society of Japan, 2-P1-2, pp. ???? -?????? , Sep.. , 2020.
- Byambadorj Zolzaya, Ryota Nishimura, Ayush Altangerel, Norihide Kitaoka, “Normalisation of transliterated words using seq2seq model with spell checker,” 26th Annual Conference of the Association for Language Processing, E5-3, pp. 1133-1136, Mar. 2020.
- Chen, Jiahao, Ryota Nishimura and Norihide Kitaoka, “Streaming Speech Recognition Using Uni-directional LSTM and Local Attention,” Proceedings of the Acoustical Society of Japan, 2-Q-12, pp. 943-946, Mar. 2020.
- Meiko Fukuda, Hiromitsu Nishizaki, Yurie Iribe, Ryota Nishimura and Norihide Kitaoka, “Construction of a speech corpus for the elderly and analysis of age and dialect effects on speech recognition,” Proceedings of the Acoustical Society of Japan, 2-Q-13, pp. 947-950, Mar. 2020.
- Yuya Kobashi, Ryota Nishimura and Norihide Kitaoka, “Evaluation of a Language Model for Spoken Language Recognition Using Text Conversion from Written to Spoken Language,” Proc. of the Acoustical Society of Japan, 2-Q-13, pp. 951-954, Mar. 2020.
- Raita Mori, Ryota Nishimura, and Norihide Kitaoka, “Spoken dialogue system with collocational analysis for free speech,” Proceedings of the Acoustical Society of Japan, 3-P-13, pp. 1023-1026, Mar. 2020.
- Kanta Kiyohara, Ryota Nishimura, and Norihide Kitaoka, “Construction of a Multimodal Geometric Problem Solving System by Integrated Understanding of Speech and Pointing,” IPSJ 82nd National Convention, 5F-03, pp. 4-5 – 4-6, Mar. 2020.
- Hayato Ishihara, Yurie Iribe, and Norihide Kitaoka, “Dementia tendency detection from chat dialogues focusing on lexical and engaged structures,” The 82nd National Convention of Information Processing Society of Japan, 5ZE-03, pp. 4-459 – 4-460, Mar. 2020.
Book Chapters
- Norihide Kitaoka, Takuma Nakagawa, Ryota Nishimura, Yoshio Ishiguro, Shin’ichi Kojima and Shin Ohsuga, “A multimodal control system for autonomous vehicles using speech, gesture and gaze recognition,” pp. 101-111, in Vehicles, Drivers, and Safety, De Gruyter, 2020.
2019
Invited talk
- Norihide Kitaoka, “Spoken and multimodal interfaces: Interaction systems with machines,” ICAICTA2019 (Keynote speech), Sep 2019.
International Conferences
- Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Yurie Iribe, Norihide Kitaoka, “A new corpus of elderly Japanese speech for acoustic modeling, and a preliminary investigation of dialect-dependent speech recognition,” Proc. Oriental-COCOSDA2019, 6 pages, Oct., 2019. (Best paper award)
- Akihira Komatsu, Ryota Nishimura, Norihide Kitaoka, “Environmental sounds recognition with convolutional-LSTM,” GCCE2019, pp. 717-719, 2018.
- Yuya Obashi, Ryota Nishimura, Norihide Kitaoka, “Automatic conversion of written language into spoken language using a sequence-to-sequence model trained with a parallel corpus,” Proc. Oriental-COCOSDA2019, 5 pages, Oct., 2019.
- Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, Norihide Kitaoka, “Small-footprint magic word detection method using convolutional LSTM neural network,” Proc. INTERSPEECH2019, pp. 2035-2039, Sep. 2019.
Domestic Conferences
- Yuya Kobashi, Ryota Nishimura, and Norihide Kitaoka, “Text conversion from written to spoken language for a language model for spoken speech recognition using a sequence-to-sequence model,” in Proc. 807-810, Sep. 2019.
- Chen Jiahao, Ryota Nishimura, and Norihide Kitaoka, “End-to-end streaming speech recognition using CTC and Attention,” Proceedings of the Acoustical Society of Japan, 1-P-16, pp. 871-874, Sep. 2019.
- Meiko Fukuda, Ryota Nishimura, Hiromitsu Nishizaki, Yurie Iribe, and Norihide Kitaoka, “Speech corpus construction for elderly speech recognition and the effect of adaptation to dialects,” in Proceedings of the Acoustical Society of Japan, 1-P-17, pp. 875-878, Sep. 2019.
- Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, and Norihide Kitaoka, “Memory-saving Magic Word detection using Convolutional LSTM,” Proceedings of the Acoustical Society of Japan, 2-3-4, pp. 819-822, Sep. 2019.
- Akihisa Komatsu, Ryota Nishimura, and Norihide Kitaoka, “Environmental sound recognition using CNN and CLSTM,” in Proceedings of the Acoustical Society of Japan, 2-Q-17 pp. 925-928, Sep. 2019.
- Shion Akimizu, Yurie Iribe, and Norihide Kitaoka, “Detecting dialogue breakdowns in dialogue systems using non-verbal information,” IPSJ 81st National Conference, 2T-08, pp. 2-365-2-366, Mar. 2019.
- Maina Umezawa, Yurie Iribe, and Norihide Kitaoka, “Detection of Dementia Tendency in the Elderly Based on Spoken Language Information Considering Dialect,” 81st National Conference of Information Processing Society, 4ZE-07, pp. 4-463-4-464, Mar. 2019.
- Yasuyuki Umehara, Ryota Nishimura, and Norihide Kitaoka, “A method for constructing a spoken dialogue system integrating various dialogue strategies,” Proceedings of the Acoustical Society of Japan, 2-P-1, pp. 945-948, Mar. 2019.
- Kazuaki Kajinami, Ryota Nishimura, Yurie Iribe, and Norihide Kitaoka, “A Spoken Dialogue Data Recording System for the Development of a Spoken Dialogue Failure Detection Method,” Proc. of the Acoustical Society of Japan, 2-P-2, pp. 949-952, Mar. 2019.
- Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Response type selection for a chatting spoken dialogue system using multi-task learning with LSTM,” Proceedings of the Acoustical Society of Japan, 2-P-3, 953-956, Mar. 2019.
- Kanta Kiyohara, Ryota Nishimura, and Norihide Kitaoka, “Construction and evaluation of a learning support system using voice and pointing in geometry problems,” Proc. of the IAAC, 2-P-17, pp. 989-992, Mar. 2019.
2018
Journal Papers
- Ryota Nishimura, Daisuke Yamamoto, Takahiro Uchiya, Ichi Takumi, “Web-based environment for user generation of spoken dialog for virtual assistants,” EURASIP Journal Audio, Speech, Music Process., pp. 1-13, 2018.
- Ryota Nishimura, Daisuke Yamamotob, Takahiro Uchiya, Ichi Takumi, “MMDAE: Dialog scenario editor for MMDAgent on the web browser,” ICT Express, 1-5, 2018. (In Press)
- Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Tomoki Toda, Kazuya Takeda, “Daily Activity Recognition with Large-scaled Real-life Recording Datasets Based on Deep Neural Network using Multi-modal Signals,” IEICE Trans. Fundamentals, Vol.E101-A,No.1, pp. 199-210,Jan. 2018.
Letters
- Ryota Nishimura, Takumi Nagao, Ikujin Ichimanda, and Norihide Kitaoka, “A study of speech intelligibility processing method based on speech perception characteristics of elderly people,” Journal of the Japanese Society for Fuzzy Intelligent Information, Vol. 30, No. 6, pp. 840-845, Dec. 2018.
- Ryota Nishimura, Miho Higaki, and Norihide Kitaoka, “Mapping between Acoustic Vector Space and Document Vector Space by RNN-LSTM,” Journal of Japanese Society for Fuzzy Intelligent Information, Vol. 30, No. 4, pp. 628-633, Aug. 2018.
- Norihide Kitaoka, Shuhei Segawa, Ryota Nishimura, Kazuya Takeda, “Recognizing emotions from speech using a physical model,” Acoustical Science and Technology, Vol. 39, Issue 2, pp. 167-170, Feb., 2018. (doi: 10.1250/ast.39.167)
Invited talk
- Norihide Kitaoka, Yurie Iribe, Hiromitsu Nishizaki, “Construction of a corpus of elderly Japanese spech for analysis and recognition,” LREC2018, May 2018.
International Conferences
- Eichi Seto, Ryota Nishimura, Norihide Kitaoka, “Customization of an example-based dialog system with user data and distributed word representations,” Proc. APSIPA2018, 7 pages, Nov. 2018.
- Ryota Nishimura, Miho Higaki, Norihide Kitaoka, “Mapping acoustic vector space and document vector space by RNN-LSTM,” 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018, pp.296-297, 2018.
- Meiko Fukuda, Ryota Nishimura, Norihide Kitaoka, Hiromitsu Nishizaki, Yurie Iribe, “Construction of a corpus for elderly Japanese speech recognition,” 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018, pp.652-653, 2018.
- Kanta Kiyohara, Ryota Nishimura, Norihide Kitaoka, “Multi-modal geometry tutoring system using speech and touchscreen figure tracing,” 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018, pp.225-226, 2018.
- Norihide Kitaoka, Takuma Nakagawa, Ryota Nishimura, Yoshio Ishiguro, Shin’ichi Kojima, Shin Ohsuga, “A multimodal control system for autonomous vehicles using speech, gesture, and gaze recognition,” DSP in Vehicles 2018, (no paper), 2018.
- Kazuaki Kajinami, Ryota Nishimura, Norihide Kitaoka, “Construction of dialog database for development of spoken dialog breakdown detection methods,” in ICAICTA-2018, pp.1-5, 2018.
Domestic Conferences
- Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Response type selection for a chatting spoken dialogue system based on distributed representation of utterances,” Spoken Language Symposium, Technical Journal of Spoken Language, SP2017-55, pp. 1-5, Dec. 2018.
- Motohki Shimogasa, Hiromitsu Nishizaki, Meiko Fukuda, Ryota Nishimura, and Norihide Kitaoka, “A study of speech recognition models for spontaneous speech of very elderly people,” Proceedings of the Acoustical Society of Japan, 1-R-10, pp. 977-978, Mar. 2018.
- Taiki Yamamoto, Ryota Nishimura, Masayuki Misaki, and Norihide Kitaoka, “Magic Word detection method in continuous speech using LSTM Neural Network,” Proceedings of the Acoustical Society of Japan, 1-R-21, pp. 1009-1012, Mar. 2018.
- Yuya Kobashi, Ryota Nishimura and Norihide Kitaoka, “Discovery of unknown words based on changes in words used in Twitter and adaptation of language models for speech recognition based on them,” in Proceedings of the Acoustical Society of Japan, 1-R-24, pp. 1017-1020, Mar. 2018.
- Ryota Nishimura, Miho Higaki, and Norihide Kitaoka, “Mapping of Acoustic Vector Time Series to Document Vectors Based on RNNs,” in Shinagaku Giho (PRMU2018-32, SP2018-12), 2018.
- Yoshiki Tabata and Akinori Kawachi, “A probabilistic circuit indistinguishability obfuscator for worst-case input samplers,” Symposium on Cryptography and Information Security (SCIS), 1B1-5, Jan. 2018.
- Kanta Kiyohara, Ryota Nishimura, and Norihide Kitaoka, “A Geometry Problem Learning Support System for Understanding Pointing and Oral Explanation,” FIT-2018, J-011, (2 pages), Mar. 2018.
- Ryota Nishimura, Chen Bohan, and Norihide Kitaoka, “A method for registering unknown words to language models in speech recognition,” in Proceedings of the Acoustical Society of Japan, 1-Q-23, pp. 127-130, Mar. 2018.
- Kengo Ohta, Ryota Nishimura, and Norihide Kitaoka, “Response Type Selection for a Chatting Spoken Dialogue System Based on LSTM-RNN Considering Word Order,” Proceedings of the Acoustical Society of Japan, 2-8-7, pp. 45-48, Mar. 2018.
- Eiji Seto, Ryota Nishimura, and Norihide Kitaoka, “User adaptation of an example-based chatter spoken dialogue system based on distributed representation of words,” Proceedings of the Acoustical Society of Japan, 2-8-8, pp. 49-52, Mar. 2018.
- Takuma Nakagawa, Ryota Nishimura, Yurie Iribe, Sachio Ishiguro, Susumu Osuga, and Norihide Kitaoka, “Multimodal interaction in the operation of automated vehicles,” in Proceedings of the Acoustic Society of Japan, 2-8-10, pp. 57-60, Mar. 2018.
- Kazuaki Kajinami, Ryota Nishimura, and Norihide Kitaoka, “Construction of a dialogue database for the development of a spoken dialogue failure detection method,” Proceedings of the Acoustical Society of Japan, 2-Q-14, pp. 177-180, Mar. 2018.
- Manami Kawashima, Yurie Iribe, and Norihide Kitaoka, “Discrimination of dementia tendency based on linguistic and acoustic features extracted from elderly people’s dialogue speech,” in Proceedings of the Acoustical Society of Japan, 2-Q-36, pp. 369-370, Mar. 2018.
2017
Tutorial Papers
- Yurie Iribe and Norihide Kitaoka, “Corpus construction of very elderly people’s speech for speech recognition,” Minor Special Issue – Sound environment for elderly and visually impaired people -, J. Acoust. -310, May, 2017.
International Conferences
- Kengo Ohta, Rikito Marumoto, Ryota Nishimura, Norihide Kitaoka, “Selecting type of response for chat-like spoken dialogue systems based on acoustic features of user utterances,” Proc. APSIPA2017, 5 pages, Dec. 2017.
- Takahiro Uchiya, Ryota Nishimura, Takahiro Hirano, Masaru Sakurai, ” Design of Reminiscence Therapy System for Elderly People with Dementia,” Proc. BWCCA2017-Workshop-RI3C-2017, pp. 844-853, Nov., 2017.
- Takuma Nakagawa, Ryota Nishimura, Yurie iribe, Yoshio Ishiguro, Shin Osuga, Norihide Kitaoka, “A human machine interface framework for autonomous vehicle control,” Proc. GCCE 2017, pp. 411-413, Oct., 2017.
- Takahiro Uchiya, Satoshi Otake, Ryota Nishimura, Daisuke Yamamoto, Ichi Takumi, “Extraction of User Preferences based on Voice Interaction ,” Proc. GCCE 2017, pp. 416-417, Oct., 2017.
- Ryota Nishimura, Takahiro Uchiya, Takahiro Hirano, Masaru Sakurai, “Proposal of Reminiscence Therapy System using Spoken Dialog to Suppress Dementia ,” Proc. GCCE 2017, pp. 418-419, Occt., 2017.
- Eichi Seto, Norihide Kitaoka, “User adaptation of input-response pairs in an example-based dialog system using distributed reporesentations of words,” Proc. ICAICTA2017, Aug., 2017.
- Akinori Kawachi, Kenichi Kawano, François Le Gall, and Suguru Tamaki, “Quantum query complexity of unitary operator discrimination,” Proc. COCOON’17, SESSION 2, Aug., 2017.
- Akinori Kawachi and Yoshiki Tabata, “On indistinguishability obfuscation of probabilistic circuits for worst-case-input subexponentially indistinguishable samplers,” The 12th International Workshop on Security (IWSEC’17), Poster Session, Aug., 2017.
Invited talk
- Norihide Kitaoka, Yurie Iribe, “Recording, analysis and recognition of elderly speech,” Symposium on Speech Resource Utilisation (invited talk), Sep. 2017.
Domestic Conferences and Research Meetings
- Takuma Nakagawa, Susumu Ohsuga, and Norihide Kitaoka, “Multimodal interaction with self-driving cars using voice, pointing and gaze recognition,” IEICE General Conference, D-14-4 (1 page), Mar. 2017.
- Yuki Kurokawa, Yurie Iribe, and Norihide Kitaoka, “Analysis of dementia tendency in the elderly using acoustic features,” Proceedings of the Acoustical Society of Japan, 1-Q-36, pp. 313-314, Mar. 2017.
- Yuki Sawada, Yurie Iribe, and Norihide Kitaoka, “Estimation of speech for systems during driving using multimodal information,” in Proceedings of the Acoustical Society of Japan, 2-P-6, pp. 149-150, Mar. 2017.
- Yuzo Fuyuno, Norihide Kitaoka, and Peng Zhiyuan, “An analysis of feature-preference relationships in task-oriented and non-task-oriented dialogues with dialogue systems,” in Proceedings of the Acoustic Society of Japan, 2-P-13, pp. 171-172, Mar. 2017.
- Kengo Ohta, Rikito Marumoto, and Norihide Kitaoka, “Response type selection for a chat dialogue system based on acoustic information of user utterances,” Proceedings of the Acoustical Society of Japan, 3-5-4, pp. 71-74, Mar. 2017.
- Akinori Kawachi, Kenichi Kawano, Rugal Francois, and Taku Tamaki, “Question Computational Complexity for Unitary Arithmetic Identification Problems,” RIMS Research Conference on Advanced Theoretical Computer Science (Winter LA Symposium), S3, Feb. 2017.
Book
- Kenji Mase and Norihide Kitaoka, Encyclopaedia of Artificial Intelligence (Chapter 9, General introduction), ed. by the Japanese Association for Artificial Intelligence, pp. 696-705, ISBN 978-4320124202, Jul. 2017.
2016
Journal Papers
- Bohan Chen, Norihide Kitaoka, Kazuya Takeda, “Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues,” EURASIP Journal on Audio, Speech, and Music Processing, 2016:19, 2016. (DOI: 10.1186/s13636-016-0097-6)
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, “Investigation of DNN-based audio-visual speech recognition,” IEICE Trans. Inf. & Syst., pp. 2444-2451, Oct., 2016.
Foreword
- Norihide Kitaoka, “FOREWORD: Special section on recent advances in machine learning in spoken language processing,” IEICE Trans. Inf. & Syst., Vol. E-99-D, No. 10, p. 2422, Oct., 2016.
International Conferences
- Takuma Nakagawa, Norihide Kitaoka, “Multimodal control system for autonomous vehicles using speec and gesture recognition,” 5th ASA/ASJ Joint Meeting, Nov., 2016.
- Eichi Seto, Norihide Kitaoka, “Example-based spoken chat system which can be cutomized for each user,” 5th ASA/ASJ Joint Meeting, Nov., 2016.
- Norihide Kitaoka, Shuhei Segawa, Kazuya Takeda, “Emotion recognition from speech using a physical model,” Proc. ICA2016, ICA2016-714 (8 pages), Sep., 2016.
- Yurie Iribe, Norihide Kitaoka and Shuhei Segawa, “Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese,” Proc. LREC 2016, pp. 4674-4677, May, 2016.
Domestic Conferences and Research Meetings
- Eiji Seto and Norihide Kitaoka, “Case Adaptation of a Chat Dialogue System Using Distributed Representation of Words,” Proc. of the IAAC, 2-Q-8, (4 pages), Sep. 2016.
- Nakagawa, Takuma, and Norihide Kitaoka, “Multimodal Interaction with Automatic Driving Vehicles Using Speech and Pointing,” Proc. of the NACS, 3-Q-11, (4 pages), Sep. 2016.
- Tomoki Hayashi, Norihide Kitaoka, Tomoki Toda, Kazuya Takeda, “An Adaptation Method in Recognition of Everyday Activities Based on Deep Neural Network,” IEICE Technical Report, SP2016-27, pp. 1-6, Aug. 2016.
- Bokhan Chen, Norihide Kitaoka and Kazuya Takeda, “Difference of prosodic information transmission efficiency casued by verbally meaningless acoustic difference : An experimental study,” Proceedings of the Acoustical Society of Japan, 2-Q-32, (2 pages), Mar. 2016.
Book
- Norihide Kitaoka (Editorial Board Member (Field Secretary, Speech)), Acoustic Keyword Book, ISBN 978-4-339-00880-7, Mar. 2016.
2015
Journal Papers
- Ken Ichikawa, Norihide Kitaoka, Satoru Tsuge, Kazuya Takeda, Kenji Kita, “Improving the Robustness of Various Text Retrieval Models for Speech Document Retrieval,” Transactions of Information Processing Society of Japan, Vol. 56, No. 3, 1003-1012, Mar. 2015.
- Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “An evaluation method of aggressiveness of driving behaviour using drive recorders,” IEEJ Journal of Industry Applications, Vol. 4, No. 1, pp. 59-66, 2015.
Letters
- Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “Modelling of Physical Characteristics of Speech under Stress,” IEEE Signal Processing Letters, (accepted), 2015.
International Conferences
- Shuhei Segawa, Norihide Kitaoka, Kazuya Takeda, “Elderly person’s emotional state estimation in conversation based on speech features for spoken dialogue systems,” 12th Western pacific Acoustics Conference 2015 (WESPAC2015), pp. 299-301, Dec., 2015.
- Bohan Chen, Norihide Kitaoka, Kazuya Takeda, “Relationship between Speaker/Listener Similarity and Information Transmission Quality in Speech Communication,” APSIPA ASC 2015, pp. 1190-1193, Dec., 2015.
- Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda, “Daily activity recognition based on acoustic signals and acceleration signals estimated with Gaussian process,” APSIPA ASC 2015, pp. 279-282, Dec., 2015.
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu, “Audio-visual speech recognition using deep bottleneck features and high-perfromanc lipreading,” APSIPA ASC 2015, pp. 575-582, Dec., 2015.
- Yurie Iribe, Norihide Kitaoka, Shuhei Segawa, “Development of new speech corpus for elderly Japanese speech recognition,” Oriental-COCOSDA/CASLRE, pp. 27-31, Oct., 2015.
- Bohan Chen, Norihide Kitaoka, Kazuya Takeda, “Effect of speaking rate and speech complexity on transmission quality during driving navigation task,” 7th Biennial Workshop on DSP for In-Vehicle Systems and Safety, 4 pages, Oct., 2015.
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu, “Audio-visual processing toward robust speech recognition in cars,” 7th Biennial Workshop on DSP for In-Vehicle Systems and Safety, 4 pages, Oct., 2015.
- Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu, “Investigation of DNN-based modeling for audio-visual speech recognition,” 2015 First International Workshop on Spoken Language Processing (MLSLP2015), (6 pages), Oct., 2015.
- Hiroshi Ninomiya, Norihide Kitaoka, Satoshi Tamura, Yurie Iribe, Kazuya Takeda, “Integration of Deep Bottleneck Features for Audio-Visual Speech Recognition,” Proc. INTERSPEECH, pp. 563-566, Sep., 2015.
- Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda, “Dayly activity recogntion based on DNN using environmental sound and acceleration signals,” Proc. EUSIPCO 2015, pp. 2351-2355, Sep. 2015.
- Yuto Dekiura, Tetsuya Matsumoto, Yoshinori Takeuchi, Hiroaki Kudo, Noboru Onishi, Norihide Kitaoka, Kazuya Takeda, “Fast Separation and Accurate Recognition of Overlapped Speech — Separation by Spectral Subtraction and Acoustic Model Training using Separated Speeches—,” 2015 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP’15), pp. 1-4, Mar., 2015.
- Katsuya Sakoyama, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “Tracking Roadside Signage Observed by Drivers,” 2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP’15), pp. 429-432, Mar., 2015.
Domestic Conferences and Research Meetings
- Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayami, “Multimodal speech recognition using deep learning – A survey of deep learning applications,” 2nd Silent Speech Recognition Workshop, ID 16, p . 8, Oct. 2015.
- Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayami, “Multimodal speech recognition using deep learning – improvement of image features,” 2nd Silent Speech Recognition Workshop, ID 15, p. 8. Oct. 2015.
- Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda, and Satoru Hayami, “Multimodal speech recognition using bottleneck features by deep learning,” IEICE Technical Report, SP2015-69, vol. 115, no. 253, pp. 57-62, Oct. 2015.
- Tetsushi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Susumu Osuga, Yurie Iribe, Kazuya Takeda and Satoru Hayami, “Multimodal speech recognition using acoustic and image features by deep learning,” Proceedings of the Acoustical Society of Japan, 3-2-5, (2 pages), Sep. 2015.
- Shuhei Segawa, Norihide Kitaoka and Kazuya Takeda, “Emotion Recognition of Elderly People from Speech for Application to Dialogue Strategies in Spoken Dialogue Systems,” Proceedings of the Acoustical Society of Japan, 3-Q-19, (2 pages), Sep., 2015.
- Chen, Bohan, Norihide Kitaoka, Mihoko Otake and Kazuya Takeda, “Probabilistic modelling of speaker alternation and evaluation of speaker activity using information content,” Proceedings of the Acoustical Society of Japan, 1-Q-35, (4 pages), Sep. 2015.
- Bohan Chen, Norihide Kitaoka, Mihoko Otake, Kazuya Takeda, “Evaluation of speaker engagement using turn-taking behaviour entropy. ,” IEICE Technical Report SP, SP2015-52, pp. 13-17, Jun. 2014.
- Nitoto Kawai, Norihide Kitaoka, Kazuya Takeda, “A method for correcting English pronunciation by presenting approximate pronunciation based on prosody-corrected learners’ speech and Japanese syllables,” Proceedings of the Acoustical Society of Japan, 1-2-10, (4 pages), Mar. 2015.
- Chen, H., N. Kitaoka, and K. Takeda, “Rational Speech Feature Control in Speech Information Transfer and Its Effect on Transfer Efficiency,” Proceedings of the Acoustical Society of Japan, 1-R-20, (4 pages), Mar. 2015.
- Tomoki Hayashi, Masashi Nishida, Norihide Kitaoka, Kazuya Takeda, “Recognition of daily life behaviour using environmental sound and acceleration signals by DNN,” Proceedings of the Acoustical Society of Japan, 2-1-16, (4 pages), Mar. 2015.
Book
- Norihide Kitaoka, Evolving Speech Communication between Humans and Machines (Part 4, Chapter 2: Task-oriented Dialogue), Nikkei Printing, ISBN 978-4-86469-065-2, Sep. 2015.
2014 (Oct.~)
International Conferences
- Norihide Kitaoka, Tomoki Hayashi, Kazuya Takeda, “Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features,” APSIPA ASC 2014, (5 pages), Oct., 2014
- Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda, “Development and preliminary analysis of sensor signal database of contiuous daily living activity over the long term,” APSIPA ASC 2014, (6 pages), Oct., 2014
- Panikos Heracleous, Pongtep Angkititrakul, Norihide Kitaoka, Kazuya Takeda, “Unsupervised energy disaggregation using conditional random fields,” IEEE ISGT Europe 2014, (6 pages), Oct., 2014.
- Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda, “Measuring Aggressive Driving Behavior Using,” IEEE ITSC14, 1886-1887, Oct., 2014.
Domestic Conferences and Research Meetings
- Kazuki Morita, Chiyomi Miyajima, Norihide Kitaoka and Kazuya Takeda, “Evaluation of Arranged Song Retrieval Performance Focusing on Differences in Song Structure,” IEICE General Conference, D-12-4, (1 page), Mar. 2015. Chen, H., Kitaoka, N., and Takeda, K. “Relationship between speech feature similarity among interlocutors and the information transfer effect of dialogue,” Spoken Language Symposium, SP2014-124, pp. 147-152, Dec. 2014.