Multicomponent Signal for Comparing Direct and Indirect Methods of Speech Transmission Index Measurement

Authors

  • Arkadiy Prodeus National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" https://orcid.org/0000-0001-7640-0850
  • Oleksandr Dvornyk National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”
  • Anton Naida National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”
  • Maryna Didkovska National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”
  • Oleksandr Grebin National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

DOI:

https://doi.org/10.18372/1990-5548.75.17546

Keywords:

test signal, speech intelligibility, direct method, indirect method, noise disturbance, reverberation

Abstract

When evaluating the intelligibility of speech distorted by noise and reverberation, direct or indirect methods of measuring the speech transmission index are used. However, it remains insufficiently studied how significantly differ the results of measurements obtained by direct and indirect methods. To find an answer to this question, the use of a multicomponent test signal consisting of four "elementary" signals separated by pauses is proposed in this paper. As "elementary" signals, it is proposed to use a maximum-length sequence, a speech shaped maximum-length sequence, a speech shaped stationary noise, and a speech shaped amplitude-modulated noise. Use of amplitude-modulated noise allows estimating speech transmission index by a direct method. Other "elementary" signals make it possible to estimate speech transmission index by two variants of indirect method. The proposed algorithms and corresponding computer programs were tested on trial signal models, while the consistency of the obtained results with the results of previous studies was revealed. The results of the signal models studies show that both considered variants of the indirect speech transmission index measurement method lead to underestimated results compared to the direct method. For one of the variants of the indirect method, the value of the estimate bias is 0.03–0.04, regardless of the interfering conditions. For another variant of the indirect method, the estimate bias varies from 0.01 to 0.18, depending on the interference conditions.

Author Biographies

Arkadiy Prodeus , National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Doctor of Engineering Sciences. Professor

Department of Acoustic and Multimedia Electronic Systems

Oleksandr Dvornyk, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”

Postgraduate student

Acoustic and Multimedia Electronic Systems Department

Anton Naida , National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”

Postgraduate student

Acoustic and Multimedia Electronic Systems Department

Maryna Didkovska , National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”

Candidate of Science (Engineering). Associate Professor

Department of Mathematical Methods of System Analysis

Oleksandr Grebin , National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Candidate of Science (Engineering). Associate Professor

Acoustic and Multimedia Electronic Systems Department

References

British Standard BS EN 60268-16. Sound system equipment. Part 16. Objective rating of speech intelligibility by speech transmission index. 2011.

Application Note, Measuring Speech Intelligibility Using DIRAC — Type 7841. Brüel & Kjær. 2013. Available at: https://www.bksv.com/media/doc/bo0521.pdf

Acoustics Engineering, Technical Note TN008 "DIRAC Stimuli", January 2008. Available at: https://www.acoustics-engineering.com/files/TN008.pdf

Farina, A. User Manual of Aurora 4.3, Parma (Italy): University of Parma A/S; (2012). Available at: http://pcfarina.eng.unipr.it/aurora/download/Manual-HelpFile/Aurora43Manual.pdf

P. Zhu, F. Mo,and J. Kang, “Experimental comparison between direct and indirect measurement methods for the objective rating of speech intelligibility,” Proc. 21st International Congress on Sound and Vibration (ICSV 21), 13-17 July, 2014, Beijing/China. Available at: https://www.researchgate.net/publication/288781895_Experimental_comparison_between_direct_and_indirect_measurement_methods_for_the_objective_rating_of_speech_intelligibility

Ponteggia, AN-013, Application note. Speech intelligibility assessment using CLIO 11. Available at: https://www.audiomatica.com/wp/wp-content/uploads/APPNOTE_013.pdf

NTi Audio, Application note. Speech Intelligibility. Measurement with the XL2 analyzer. Dec. 2020. 28 p. Available at: https://www.nti-audio.com/en/

D’Orazio, E. Rossi, and M. Garai, “Comparison of different in situ measurements techniques of intelligibility in an open-plan office,” Building Acoustics. 2018, 25(2), pp. 111–122. https://doi.org/10.1177/1351010X18776431

J. Kotus, B. Kostek, A. Kurowski and P. Szczuko, “A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems,” Proc. 2018 Joint Conference–Acoustics, Ustka, Poland, 2018, pp. 1–6. https://doi.org/10.1109/ACOUSTICS.2018.8502277

P. Zhu, W. Tao, F. Mo, F. Guo, X. Lu, and X. Liu, “Experimental comparison of speech transmission index measurement in natural sound rooms and auditoria,” Applied Acoustics, 165 (2020), pp. 1–21. Available at: https://doi.org/10.1016/j.apacoust.2020.107326

H. Möller, “A review of STI measurements,” Forum Acusticum, Dec 2020, Lyon, France. pp. 173–176.

A. Prodeus, “Rapid version of a formant-modulation method of speech intelligibility estimation,” Perspective Technologies and Methods in MEMS Design, Polyana, Ukraine, 2011, pp. 61–63. Available at: https://ieeexplore.ieee.org/document/5960269

A. Prodeus, “Formant-Modulation Method of Speech Intelligibility Evaluation: Measuring and Exactness,” Proc. of the VII International Conference MEMSTECH 2011. Lviv, Polyana, 2011, pp. 54–60. Available at: https://www.academia.edu/48296401/Formant_modulation_method_of_speech_intelligibility_evaluation_Measuring_and_exactness

M. Jeub, M. Schafer, and P. Vary, “A binaural room impulse response database for the evaluation of dereverberation algorithms,” Proc. Int. Conference on Digital Signal Processing (DSP), Santorini, Greece, 2009. https://doi.org/10.1109/ICDSP.2009.5201259

Birne, “An international comparison of long-term average speech spectra,” J. Acoust.Soc.Am., 96 (4), October 1994, pp. 2108–2120. https://doi.org/10.1121/1.410152

L. Morales, G. Leembruggen, S. Dance, and B. Shield, “A revised speech spectrum for STI calculations,” Applied Acoustics, vol. 132, March 2018, pp. 33–42. https://doi.org/10.1016/j.apacoust.2017.11.008

Downloads

Published

2023-03-26

Issue

Section

COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES