详情页 - 首都医科大学附属北京同仁医院知识库

当前位置：首页 > 详情页

Derivative-Guided Dual-Attention Mechanisms in Patch Transformer for Efficient Automated Recognition of Auditory Brainstem Response Latency

文献详情

资源类型：

WOS体系：

收录情况： ◇ SCIE

作者：

机构： [1]Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China [2]Chongqing Univ Posts & Telecommun, Inst Adv Sci, Chongqing 400065, Peoples R China [3]Tianyue Xinchuang Informat Technol Corp, Beijing 100020, Peoples R China [4]Capital Med Univ, Beijing Tongren Hosp, Beijing Inst Otolaryngol, Beijing 100005, Peoples R China [5]Peking Univ, Sch Math Sci, Beijing 100091, Peoples R China [6]Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen 518107, Guangdong, Peoples R China

出处：

DOI：

ISSN：

关键词： Auditory system Accuracy Data models Brainstem Transformers Semantics Hospitals Training Complexity theory Biological system modeling Auditory brainstem response patch Transformer dual-attention mechanism generalization

摘要：

Accurate recognition of auditory brainstem response (ABR) wave latencies is essential for clinical practice but remains a subjective and time-consuming process. Existing AI approaches face challenges in generalization, complexity, and semantic sparsity due to single sampling-point analysis. This study introduces the Derivative-Guided Patch Dual-Attention Transformer (Patch-DAT), a novel, lightweight, and generalizable deep learning (DL) model for the automated recognition of latencies for waves I, III, and V. Patch-DAT divides the ABR time series into overlapping patches to aggregate semantic information, better capturing local temporal patterns. Meanwhile, leveraging the fact that ABR waves occur at the zero crossing of the first derivative, Patch-DAT incorporates a first derivative-guided dual-attention mechanism to model global dependencies. Trained and validated on large-scale, diverse datasets from two hospitals, Patch-DAT (with a size of 0.36 MB) achieves accuracies of 92.29% and 98.07% at 0.1 ms and 0.2 ms error scales, respectively, on a held-out test set. It also performs well on an independent dataset with accuracies of 88.50% and 95.14%, demonstrating strong generalization across clinical settings. Ablation studies highlight the contributions of the patching strategy and dual-attention mechanisms. Compared to previous state-of-the-art DL models, Patch-DAT shows superior accuracy and reduced complexity, making it a promising solution for object recognition of ABR latencies. Additionally, we systematically investigate how sample size and data heterogeneity affect model generalization, indicating the importance of large, diverse datasets in training robust DL models. Future work will focus on expanding dataset diversity and improving model interpretability to further improve clinical relevance.

基金：

语种：

WOS：

PubmedID：

中科院(CAS)分区：

出版当年[2025]版：

大类 | 2 区医学

小类 | 1 区康复医学 2 区工程：生物医学

最新[2025]版：

大类 | 2 区医学

小类 | 1 区康复医学 2 区工程：生物医学

JCR分区：

出版当年[2023]版：

Q1 REHABILITATION Q2 ENGINEERING, BIOMEDICAL

最新[2023]版：

Q1 REHABILITATION Q2 ENGINEERING, BIOMEDICAL

影响因子： 4.8 最新[2023版] 5.4 最新五年平均 4.8 出版当年[2023版] 5.4 出版当年五年平均 4.9 出版前一年[2022版]

第一作者：

第一作者机构： [1]Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China [2]Chongqing Univ Posts & Telecommun, Inst Adv Sci, Chongqing 400065, Peoples R China

通讯作者：

推荐引用方式(GB/T 7714)：

APA：

MLA：

相关文献

[1]庆大霉素听力损伤后雏鸡ABR反应阈的变化 [2]庆大霉素听力损伤后雏鸡ABR反应阈的变化 [3]Audiological characteristics of infants with abnormal transient evoked otoacoustic emission and normal auditory brainstem response [4]Audiological assessment using click auditory brainstem response and chirp auditory steady-state response in pediatric patients with congenital microtia under 36 months: A single-center experience [5]听性脑干反应的相位谱分析和阈值测定 [6]Mechanical fault diagnosis of gas-insulated switchgear based on saliency feature of auditory brainstem response under noise background [7]不同强度听觉脑干反应相位谱构型分析客观诊断听损伤作用比较 [8]正常新生儿和婴儿的短音听性脑干反应和听觉稳态反应 [9]多频听觉稳态反应与切迹噪声掩蔽短纯音诱发的听性脑干反应对纯音听阈的预测 [10]正常新生豚鼠听性脑干反应的时域和频域特性