电子耳蜗前端麦克风阵列语音增强技术的研究与进展_《生物医学工程学杂志》

作者：

 陈又圣 , 陈伟芳 , 张璞 , 陈培培

深圳信息职业技术学院（广东深圳 518000）;

关键词：

电子耳蜗麦克风阵列语音增强波束形成

DOI：

10.7507/1001-5515.201805050

视频：

导出 下载 收藏 扫码 引用

摘要 全文 图表 视频 参考文献 施引文献 补充材料

麦克风阵列的方法在近年来被逐渐应用在电子耳蜗前端语音增强和提高言语识别率的研究里。该方法通过在空间不同的位置上放置若干麦克风，可以采集包含大量空间位置和方位信息的多通道信号，并形成增强目标信号和抑制干扰信号的特定波束指向模式。该方法更加适合用于电子耳蜗增强面对面交流的应用场景，其应用价值受到越来越多研究人员的关注。本文对麦克风阵列波束形成的原理进行阐述，并对目前文献中基于麦克风阵列的语音增强技术进行分析，归纳和总结了其中的技术难点和发展趋势。

引用本文： 陈又圣, 陈伟芳, 张璞, 陈培培. 电子耳蜗前端麦克风阵列语音增强技术的研究与进展. 生物医学工程学杂志, 2019, 36(4): 696-704. doi: 10.7507/1001-5515.201805050 复制

图1 麦克风阵列信号采集原理图

Figure1. Schematic diagram of signal acquisition principle in microphone array

图选项

下载全尺寸图像

下载幻灯片

图2 特定参数条件下的不同方位的系统幅频响应曲线

Figure2. System amplitude based on specific parameters

图选项

下载全尺寸图像

下载幻灯片

图3 不同延迟参数值的极性图和系统零点

Figure3. Beam patterns and system nulls for different delay parameters

图选项

下载全尺寸图像

下载幻灯片

图4 双耳佩戴电子耳蜗和助听器的示意图

Figure4. A schematic diagram of binaural cochlear and hearing aids

图选项

下载全尺寸图像

下载幻灯片

图5 单通道语音增强技术和麦克风阵列结合的去噪方法

Figure5. Noise suppression method based on the combination of single channel speech enhancement technology and microphone array technology

图选项

下载全尺寸图像

下载幻灯片

图6 不同频率条件下的双麦克风极性图

Figure6. Beam patterns of dual-microphone system based on different frequencies

图选项

下载全尺寸图像

下载幻灯片

图7 语音信号和环境噪声的频谱对比

Figure7. Spectrum comparison of speech signal and environ mental noise

图选项

下载全尺寸图像

下载幻灯片

图8 角度偏移 1～8° 的双指向性麦克风极性图对比

Figure8. Comparison of beam patterns for 1–8° angle offset in dual-microphone system

图选项

下载全尺寸图像

下载幻灯片

图9 双耳佩戴麦克风的双麦克风极性图的波束变化

Figure9. Changing of beams in beam patterns of dual-microphone system for situation of biauricular distance

图选项

下载全尺寸图像

下载幻灯片

1.	World Hearth Organization (WHO). Deafness and hearing loss[EB/OL]. (2018-03-15)[2019-02-20]. http://www.who.int/en/news-room/fact-sheets/detail/deafness-and-hearing-loss.
2.	银力, 屠文河, 高姗仙, 等. 耳聋与助听设备的选择. 中国医疗器械信息, 2016(5): 23-29, 63.
3.	向琳. 儿童人工耳蜗植入后康复效果及影响因素研究. 长春: 吉林大学, 2017.
4.	National Institute on Deafness and Other Communication Disorders (NIDCD). Cochlear implants[EB/OL]. (2017-03-06)[2019-02-20]. https://www.nidcd.nih.gov/health/cochlear-implants.
5.	Lu C K, Wang S W. Peak-triggered sampling circuitry for a fine-structure-aware cochlear implant//IEEE Region 10 Conference (TENCON). Penang, Malaysia: IEEE, 2017: 31-34.
6.	Langner F, Saoji A A, Büchner A, et al. Adding simultaneous stimulating channels to reduce power consumption in cochlear implants. Hear Res, 2017, 345: 96-107.
7.	Padilla M, Stupak N, Landsberger D M. Pitch ranking with different virtual channel configurations in electrical hearing. Hear Res, 2017, 348: 54-62.
8.	Guan T, Yang M, Wei Z, et al. Simulation of the optical stimulation mechanism of cochlear nerves. Journal of Tsinghua University, 2017, 57(10): 1102-1105.
9.	Jiang Bin, Xia Nan, Wang Xing, et al. Auditory responses to short-wavelength infrared neural stimulation of the rat cochlear nucleus//39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’17). Seogwipo, South Korea: IEEE, 2017: 1942-1945.
10.	Wang Jingxuan, Lu Jianren, Tian Lan. Effect of fiberoptic collimation technique on 808 nm wavelength laser stimulation of cochlear neurons. Photomed Laser Surg, 2016, 34(6): 252-257.
11.	Anderson S R, Kan A, Thakkar T, et al. Pitch magnitude estimation can predict across-ear pitch comparisons in cochlear-implant users. J Acoust Soc Amer, 2017, 141(5): 3815.
12.	Van Eyndhoven S, Francart T, Bertrand A. EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses. IEEE Trans Biomed Eng, 2017, 64(5): 1045-1056.
13.	Ma X J, Sudanthi W, Zhou Y, et al. Simulation for training cochlear implant electrode insertion//30th IEEE International Symposium on Computer-Based Medical Systems (IEEE CBMS 2017). Thessaloniki, Greece: IEEE, 2017: 1–6.
14.	Chen Yousheng, Chen Weifang. Research on fractional delay filter and mismatch feature based on least mean square rule for CI device//9th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC). Hangzhou, China: IEEE, 2017: 308-311.
15.	Arora S V, Vig R. Comparison of speech intelligibility parameter in cochlear implants by spatial filtering and coherence function methods. International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE). Ghaziabad, India: IEEE, 2016: 573-577.
16.	Wimmer W, Kompis M, Stieger C, et al. Directional microphone contralateral routing of signals in cochlear implant users: a within-subjects comparison. Ear Hear, 2017, 38(3): 368-373.
17.	Mosnier I, Mathias N, Flament J, et al. Benefit of the UltraZoom beamforming technology in noise in cochlear implant users. Eur Arch Otorhinolaryngol, 2017, 274(9): 3335-3342.
18.	Gong Qin, Chen Yousheng. Parameter selection methods of delay and beamforming for cochlear implant speech enhancement. Acoust Phys, 2011, 57(4): 542-550.
19.	Li Xingxing, Wang Dangwei, Ma Xiaoyan, et al. Robust adaptive beamforming using iterative variable loaded sample matrix inverse. Electron Lett, 2018, 54(9): 546-548.
20.	Zohourian M, Enzner G, Martin R. Binaural speaker localization integrated into an adaptive beamformer for hearing aids. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018, 26(3): 515-528.
21.	Xiao Jinjun, Luo Zhiquan, Merks I, et al. A robust adaptive binaural beamformer for hearing devices//2017 51st Asilomar Conference on Signals, Systems, and Computers. Pacific Grove, USA: IEEE, 2017: 1885-1889.
22.	Zeng Fangang. Challenges in improving cochlear implant performance and accessibility. IEEE Trans Biomed Eng, 2017, 64(8): 1662-1664.
23.	Lockwood M E, Jones D L, Bilger R C, et al. Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. J Acoust Soc Am, 2004, 115(1): 379-391.
24.	Ehlers E, Goupell M J, Zheng Yi, et al. Binaural sensitivity in children who use bilateral cochlear implants. J Acoust Soc Am, 2017, 141(6): 4264-4277.
25.	Lopez-Poveda E A, Eustaquio-Martín A, Stohl J S, et al. Intelligibility in speech maskers with a binaural cochlear implant sound coding strategy inspired by the contralateral medial olivocochlear reflex. Hear Res, 2017, 348: 134-137.
26.	Sheffield B M, Schuchman G, Bernstein J G. Pre- and postoperative binaural unmasking for bimodal cochlear implant listeners. Ear Hear, 2017, 38(5): 554-567.
27.	Goupell M J, Stakhovskaya O A, Bernstein J G. Contralateral interference caused by binaurally presented competing speech in adult bilateral cochlear-implant users. Ear Hear, 2018, 39(1): 110-123.
28.	罗鑫, 傅前杰, 王仁华. 联合使用助听器和增强电子耳蜗的使用者的中文语音识别. 北京生物医学工程, 2005, 24(4): 250-253, 267.
29.	Kates J M, Weiss M R. A comparison of hearing-aid array-processing techniques. J Acoust Soc Am, 1996, 99(5): 3138-3148.
30.	Chen Yousheng, Gong Qin. Real-time spectrum estimation-based dual-channel speech-enhancement algorithm for cochlear implant. Biomed Eng Online, 2012, 11(74). DOI: 10.1186/1475-925X-11-74.
31.	Kim J, Lee H, Hoonteak L, et al. A micromachined microphone based on the electret membrane and field-effect-transistor mechano-electrical transduction. J Acoust Soc Am, 2017, 142(4): 2567.
32.	朱振岭, 陈日林, 杨亦春. 差分传声器阵列低频特性优化研究. 应用声学, 2016, 35(6): 505-510.
33.	Duan X, Giddings R P, Mansoor S, et al. Performance tolerance of IMDD DFMA PONs to channel frequency response roll-off. IEEE Photonics Technology Letters, 2017, 29(19): 1655-1658.
34.	Chen Yousheng, Gong Qin. Broadband beamforming compensation algorithm in CI front-end acquisition. Biomed Eng Online, 2013, 12(18). DOI: 10.1186/1475-925X-12-18.

1. World Hearth Organization (WHO). Deafness and hearing loss[EB/OL]. (2018-03-15)[2019-02-20]. http://www.who.int/en/news-room/fact-sheets/detail/deafness-and-hearing-loss.
2. 银力, 屠文河, 高姗仙, 等. 耳聋与助听设备的选择. 中国医疗器械信息, 2016(5): 23-29, 63.
3. 向琳. 儿童人工耳蜗植入后康复效果及影响因素研究. 长春: 吉林大学, 2017.
4. National Institute on Deafness and Other Communication Disorders (NIDCD). Cochlear implants[EB/OL]. (2017-03-06)[2019-02-20]. https://www.nidcd.nih.gov/health/cochlear-implants.
5. Lu C K, Wang S W. Peak-triggered sampling circuitry for a fine-structure-aware cochlear implant//IEEE Region 10 Conference (TENCON). Penang, Malaysia: IEEE, 2017: 31-34.
6. Langner F, Saoji A A, Büchner A, et al. Adding simultaneous stimulating channels to reduce power consumption in cochlear implants. Hear Res, 2017, 345: 96-107.
7. Padilla M, Stupak N, Landsberger D M. Pitch ranking with different virtual channel configurations in electrical hearing. Hear Res, 2017, 348: 54-62.
8. Guan T, Yang M, Wei Z, et al. Simulation of the optical stimulation mechanism of cochlear nerves. Journal of Tsinghua University, 2017, 57(10): 1102-1105.
9. Jiang Bin, Xia Nan, Wang Xing, et al. Auditory responses to short-wavelength infrared neural stimulation of the rat cochlear nucleus//39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’17). Seogwipo, South Korea: IEEE, 2017: 1942-1945.
10. Wang Jingxuan, Lu Jianren, Tian Lan. Effect of fiberoptic collimation technique on 808 nm wavelength laser stimulation of cochlear neurons. Photomed Laser Surg, 2016, 34(6): 252-257.
11. Anderson S R, Kan A, Thakkar T, et al. Pitch magnitude estimation can predict across-ear pitch comparisons in cochlear-implant users. J Acoust Soc Amer, 2017, 141(5): 3815.
12. Van Eyndhoven S, Francart T, Bertrand A. EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses. IEEE Trans Biomed Eng, 2017, 64(5): 1045-1056.
13. Ma X J, Sudanthi W, Zhou Y, et al. Simulation for training cochlear implant electrode insertion//30th IEEE International Symposium on Computer-Based Medical Systems (IEEE CBMS 2017). Thessaloniki, Greece: IEEE, 2017: 1–6.
14. Chen Yousheng, Chen Weifang. Research on fractional delay filter and mismatch feature based on least mean square rule for CI device//9th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC). Hangzhou, China: IEEE, 2017: 308-311.
15. Arora S V, Vig R. Comparison of speech intelligibility parameter in cochlear implants by spatial filtering and coherence function methods. International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE). Ghaziabad, India: IEEE, 2016: 573-577.
16. Wimmer W, Kompis M, Stieger C, et al. Directional microphone contralateral routing of signals in cochlear implant users: a within-subjects comparison. Ear Hear, 2017, 38(3): 368-373.
17. Mosnier I, Mathias N, Flament J, et al. Benefit of the UltraZoom beamforming technology in noise in cochlear implant users. Eur Arch Otorhinolaryngol, 2017, 274(9): 3335-3342.
18. Gong Qin, Chen Yousheng. Parameter selection methods of delay and beamforming for cochlear implant speech enhancement. Acoust Phys, 2011, 57(4): 542-550.
19. Li Xingxing, Wang Dangwei, Ma Xiaoyan, et al. Robust adaptive beamforming using iterative variable loaded sample matrix inverse. Electron Lett, 2018, 54(9): 546-548.
20. Zohourian M, Enzner G, Martin R. Binaural speaker localization integrated into an adaptive beamformer for hearing aids. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018, 26(3): 515-528.
21. Xiao Jinjun, Luo Zhiquan, Merks I, et al. A robust adaptive binaural beamformer for hearing devices//2017 51st Asilomar Conference on Signals, Systems, and Computers. Pacific Grove, USA: IEEE, 2017: 1885-1889.
22. Zeng Fangang. Challenges in improving cochlear implant performance and accessibility. IEEE Trans Biomed Eng, 2017, 64(8): 1662-1664.
23. Lockwood M E, Jones D L, Bilger R C, et al. Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. J Acoust Soc Am, 2004, 115(1): 379-391.
24. Ehlers E, Goupell M J, Zheng Yi, et al. Binaural sensitivity in children who use bilateral cochlear implants. J Acoust Soc Am, 2017, 141(6): 4264-4277.
25. Lopez-Poveda E A, Eustaquio-Martín A, Stohl J S, et al. Intelligibility in speech maskers with a binaural cochlear implant sound coding strategy inspired by the contralateral medial olivocochlear reflex. Hear Res, 2017, 348: 134-137.
26. Sheffield B M, Schuchman G, Bernstein J G. Pre- and postoperative binaural unmasking for bimodal cochlear implant listeners. Ear Hear, 2017, 38(5): 554-567.
27. Goupell M J, Stakhovskaya O A, Bernstein J G. Contralateral interference caused by binaurally presented competing speech in adult bilateral cochlear-implant users. Ear Hear, 2018, 39(1): 110-123.
28. 罗鑫, 傅前杰, 王仁华. 联合使用助听器和增强电子耳蜗的使用者的中文语音识别. 北京生物医学工程, 2005, 24(4): 250-253, 267.
29. Kates J M, Weiss M R. A comparison of hearing-aid array-processing techniques. J Acoust Soc Am, 1996, 99(5): 3138-3148.
30. Chen Yousheng, Gong Qin. Real-time spectrum estimation-based dual-channel speech-enhancement algorithm for cochlear implant. Biomed Eng Online, 2012, 11(74). DOI: 10.1186/1475-925X-11-74.
31. Kim J, Lee H, Hoonteak L, et al. A micromachined microphone based on the electret membrane and field-effect-transistor mechano-electrical transduction. J Acoust Soc Am, 2017, 142(4): 2567.
32. 朱振岭, 陈日林, 杨亦春. 差分传声器阵列低频特性优化研究. 应用声学, 2016, 35(6): 505-510.
33. Duan X, Giddings R P, Mansoor S, et al. Performance tolerance of IMDD DFMA PONs to channel frequency response roll-off. IEEE Photonics Technology Letters, 2017, 29(19): 1655-1658.
34. Chen Yousheng, Gong Qin. Broadband beamforming compensation algorithm in CI front-end acquisition. Biomed Eng Online, 2013, 12(18). DOI: 10.1186/1475-925X-12-18.

《生物医学工程学杂志》

电子耳蜗前端麦克风阵列语音增强技术的研究与进展

摘要 全文 图表 视频 参考文献 施引文献 补充材料

引言

1 麦克风阵列信号采集和波束形成原理

2 电子耳蜗麦克风阵列语音增强的方法

2.1 固定波束形成方法

2.2 自适应波束形成方法

2.3 双耳电子耳蜗的方法

2.4 单通道语音增强技术和麦克风阵列结合方法

2.5 麦克风阵列语音增强方法的总结和言语识别率的关联分析

3 麦克风阵列语音增强技术在电子耳蜗应用中存在的问题

3.1 低频滚降失真

3.2 信号补偿中的噪声过度放大

3.3 电极数量限制及信号分辨率问题

3.4 麦克风间的增益失配和运动偏移失配问题

3.5 双耳信号采集及波束变化问题

4 总结与展望

引言

1 麦克风阵列信号采集和波束形成原理

2 电子耳蜗麦克风阵列语音增强的方法

2.1 固定波束形成方法

2.2 自适应波束形成方法

2.3 双耳电子耳蜗的方法

2.4 单通道语音增强技术和麦克风阵列结合方法

2.5 麦克风阵列语音增强方法的总结和言语识别率的关联分析

3 麦克风阵列语音增强技术在电子耳蜗应用中存在的问题

3.1 低频滚降失真

3.2 信号补偿中的噪声过度放大

3.3 电极数量限制及信号分辨率问题

3.4 麦克风间的增益失配和运动偏移失配问题

3.5 双耳信号采集及波束变化问题

4 总结与展望

上一篇

Format

Content

《生物医学工程学杂志》

电子耳蜗前端麦克风阵列语音增强技术的研究与进展

摘要 全文 图表 视频 参考文献 施引文献 补充材料

引言

1 麦克风阵列信号采集和波束形成原理

2 电子耳蜗麦克风阵列语音增强的方法

2.1 固定波束形成方法

2.2 自适应波束形成方法

2.3 双耳电子耳蜗的方法

2.4 单通道语音增强技术和麦克风阵列结合方法

2.5 麦克风阵列语音增强方法的总结和言语识别率的关联分析

3 麦克风阵列语音增强技术在电子耳蜗应用中存在的问题

3.1 低频滚降失真

3.2 信号补偿中的噪声过度放大

3.3 电极数量限制及信号分辨率问题

3.4 麦克风间的增益失配和运动偏移失配问题

3.5 双耳信号采集及波束变化问题

4 总结与展望

引言

1 麦克风阵列信号采集和波束形成原理

2 电子耳蜗麦克风阵列语音增强的方法

2.1 固定波束形成方法

2.2 自适应波束形成方法

2.3 双耳电子耳蜗的方法

2.4 单通道语音增强技术和麦克风阵列结合方法

2.5 麦克风阵列语音增强方法的总结和言语识别率的关联分析

3 麦克风阵列语音增强技术在电子耳蜗应用中存在的问题

3.1 低频滚降失真

3.2 信号补偿中的噪声过度放大

3.3 电极数量限制及信号分辨率问题

3.4 麦克风间的增益失配和运动偏移失配问题

3.5 双耳信号采集及波束变化问题

4 总结与展望

上一篇

Format

Content

摘要全文图表视频参考文献施引文献补充材料