Computer audition (CA) has experienced a fast development in the past decades by leveraging advanced signal processing and machine learning techniques. In particular, for its noninvasive and ubiquitous character by nature, CA-based applications in healthcare have increasingly attracted attention in recent years. During the tough time of the global crisis caused by the coronavirus disease 2019 (COVID-19), scientists and engineers in data science have collaborated to think of novel ways in prevention, diagnosis, treatment, tracking, and management of this global pandemic. On the one hand, we have witnessed the power of 5G, Internet of Things, big data, computer vision, and artificial intelligence in applications of epidemiology modeling, drug and/or vaccine finding and designing, fast CT screening, and quarantine management. On the other hand, relevant studies in exploring the capacity of CA are extremely lacking and underestimated. To this end, we propose a novel multitask speech corpus for COVID-19 research usage. We collected 51 confirmed COVID-19 patients' in-the-wild speech data in Wuhan city, China. We define three main tasks in this corpus, i.e., three-category classification tasks for evaluating the physical and/or mental status of patients, i.e., sleep quality, fatigue, and anxiety. The benchmarks are given by using both classic machine learning methods and state-of-the-art deep learning techniques. We believe this study and corpus cannot only facilitate the ongoing research on using data science to fight against COVID-19, but also the monitoring of contagious diseases for general purpose.
基金:
Zhejiang Lab's International Talent Fund for Young Professionals under (Project HANAMI), China; JSPS Postdoctoral Fellowship for Research in Japan from the Japan Society for the Promotion of Science (JSPS), Japan [P19081]; Ministry of Education, Culture, Sports, Science and Technology (MEXT), JapanMinistry of Education, Culture, Sports, Science and Technology, Japan (MEXT) [19F19081, 20H00569]; European UnionEuropean Commission [826506]
语种:
外文
被引次数:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2020]版:
大类|1 区工程技术
小类|1 区计算机:信息系统1 区工程:电子与电气1 区电信学
最新[2025]版:
大类|2 区计算机科学
小类|1 区计算机:信息系统1 区电信学2 区工程:电子与电气
JCR分区:
出版当年[2019]版:
Q1COMPUTER SCIENCE, INFORMATION SYSTEMSQ1TELECOMMUNICATIONSQ1ENGINEERING, ELECTRICAL & ELECTRONIC
最新[2023]版:
Q1COMPUTER SCIENCE, INFORMATION SYSTEMSQ1ENGINEERING, ELECTRICAL & ELECTRONICQ1TELECOMMUNICATIONS
第一作者机构:[1]Beijing Inst Technol, Inst Engn Med, Grp Audit Intelligent Med, Beijing 100081, Peoples R China
共同第一作者:
通讯作者:
通讯机构:[7]Wuhan Univ, Wuhan Hosp 3, Dept Plast Surg, Wuhan 430072, Peoples R China[8]Wuhan Univ, Tongren Hosp, Wuhan 430072, Peoples R China
推荐引用方式(GB/T 7714):
Qian Kun,Schmitt Maximilian,Zheng Huaiyuan,et al.Computer Audition for Fighting the SARS-CoV-2 Corona Crisis-Introducing the Multitask Speech Corpus for COVID-19[J].IEEE INTERNET OF THINGS JOURNAL.2021,8(21):16035-16046.doi:10.1109/JIOT.2021.3067605.
APA:
Qian, Kun,Schmitt, Maximilian,Zheng, Huaiyuan,Koike, Tomoya,Han, Jing...&Schuller, Bjoern W..(2021).Computer Audition for Fighting the SARS-CoV-2 Corona Crisis-Introducing the Multitask Speech Corpus for COVID-19.IEEE INTERNET OF THINGS JOURNAL,8,(21)
MLA:
Qian, Kun,et al."Computer Audition for Fighting the SARS-CoV-2 Corona Crisis-Introducing the Multitask Speech Corpus for COVID-19".IEEE INTERNET OF THINGS JOURNAL 8..21(2021):16035-16046