高级检索
当前位置: 首页 > 详情页

Exploring Novel Objective Voice Assessment Parameters: A Pilot Study

文献详情

资源类型:
Pubmed体系:
机构: [1]Department of Otorhinolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China [2]Department of Otorhinolaryngology, Head and Neck Surgery, Huadong Hospital, Fudan University, Shanghai, 200040, China [3]Department of Otorhinolaryngology, Head and Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
出处:
ISSN:

关键词: Mel-frequency cepstral coefficients Voice disorder Machine learning Voice quality Hoarseness

摘要:
Objective voice analysis is vital for diagnosing voice diseases. Traditional vocal assessment parameters, such as jitter, have remained relatively unchanged over the past few decades. Cepstrum technology offers new ways to explore novel vocal assessment parameters. We aimed to explore new parameters applicable to objective voice analysis.A total of 209 patients were analyzed, including 140 patients with hoarseness and 69 healthy controls. Traditional voice characteristics were extracted, and mel-frequency cepstral coefficient (MFCC) analysis was performed. Ten MFCC quantitative parameters were created by statistical analysis. The differences in these MFCC parameters between the two groups were compared. Machine learning algorithms were used to build models, and the importance of each feature was further analyzed on the basis of Shapley Additive exPlanations (SHAPs).There were significant differences between the groups in MFCC-Mean, MFCC-variance (MFCC-Var), MFCC-standard deviation (MFCC-Std), MFCC-75th percentile range, MFCC-maximum value, MFCC-Median, MFCC-skewness, and MFCC-kurtosis values. The values of these parameters in the hoarseness group were significantly greater than those in the normal group. The accuracy, sensitivity, specificity, and the values of the area under the receiver operating characteristic (ROC) curve (AUC) of the hoarseness prediction model established using only the 8 MFCC parameters mentioned above were 95.2%, 94.7%, 96.3%, and 95.5%, respectively, which were only slightly lower than those of the model established using traditional voice assessment indicators, with accuracy, sensitivity, specificity, and AUC values of 96.4%, 96.4%, 96.3%, and 96.4%, respectively. This finding indicates that voice assessment can be performed successfully using only these new MFCC indicators. SHAP analysis revealed that MFCC-Mean, MFCC-Var, and MFCC-Std were the most important predictors of voice disorders.This study is the first to explore novel parameters applicable to objective vocal analysis and shows that parameters such as MFCC-Mean, MFCC-Var, and MFCC-Std are expected to become new voice evaluation indicators.Copyright © 2025. Published by Elsevier Inc.

语种:
PubmedID:
中科院(CAS)分区:
出版当年[2025]版:
大类 | 4 区 医学
小类 | 4 区 听力学与言语病理学 4 区 耳鼻喉科学
最新[2025]版:
大类 | 4 区 医学
小类 | 4 区 听力学与言语病理学 4 区 耳鼻喉科学
第一作者:
第一作者机构: [1]Department of Otorhinolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
共同第一作者:
通讯作者:
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:28495 今日访问量:0 总访问量:1584 更新日期:2025-09-01 建议使用谷歌、火狐浏览器 常见问题

版权所有©2020 首都医科大学附属北京同仁医院 技术支持:重庆聚合科技有限公司 地址:北京市东城区东交民巷1号(100730)