机构:[1]Capital Med Univ, Beijing TongRen Hosp, Dept Otolaryngol Head & Neck Surg, Beijing 100730, Peoples R China首都医科大学附属北京同仁医院临床科室耳鼻咽喉-头颈外科[2]House Ear Res Inst, Los Angeles, CA 90057 USA[3]Univ Calif Los Angeles, David Geffen Sch Med, Dept Head & Neck Surg, Los Angeles, CA 90095 USA
Mandarin is a tonal language, and it is important to preserve lexical tone information in synthesized speech. With natural speech, Chinese cochlear implant (CI) users have difficulty perceiving voice pitch cues important for lexical tone perception; it is unclear whether this difficulty persists in Mandarin synthesized speech. In this study, intelligibility of naturally produced and synthesized Mandarin speech was measured in Chinese CI listeners; intelligibility was also measured in a control group of normal-hearing (NH) listeners. Five synthesized voices were selected to represent different talker genders (male, female, child), speaking rates (normal, slow), and speaking styles (emotional, accent). The data showed that while modern Mandarin text-to-speech (TTS) systems can provide perfect speech intelligibility for NH listeners, overall intelligibility was much poorer for CI than for NH listeners. CI performance was significantly poorer with synthesized speech than with natural speech (p< 0.001). CI listeners were highly sensitive to the "extra-atypical" synthesized emotional and accented speech. Performance with each of the synthesized speech types was significantly correlated with performance with natural speech in CI users (p< 0.01 in all cases). While modern TTS systems offer educational and communication benefits to CI users and hearing-impaired individuals, the selection of synthesized voices should be carefully considered in education applications of TTS for hearing-impaired individuals, especially CI children, since poor intelligibility performance may affect language learning. (C) 2018 Acoustical Society of America.
基金:
National Institutes of HealthUnited States Department of Health & Human ServicesNational Institutes of Health (NIH) - USA [R01-004792]
第一作者机构:[1]Capital Med Univ, Beijing TongRen Hosp, Dept Otolaryngol Head & Neck Surg, Beijing 100730, Peoples R China
通讯作者:
推荐引用方式(GB/T 7714):
Shi Ying,Chen Jingyuan,Gong Yue,et al.Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners[J].JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA.2018,143(5):2886-2891.doi:10.1121/1.5037590.
APA:
Shi, Ying,Chen, Jingyuan,Gong, Yue,Chen, Biao,Li, Yongxin...&Fu, Qian-Jie.(2018).Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners.JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA,143,(5)
MLA:
Shi, Ying,et al."Intelligibility of naturally produced and synthesized Mandarin speech by cochlear implant listeners".JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 143..5(2018):2886-2891