Multicomponent Signal for Comparing Direct and Indirect Methods of Speech Transmission Index Measurement
DOI:
https://doi.org/10.18372/1990-5548.75.17546Keywords:
test signal, speech intelligibility, direct method, indirect method, noise disturbance, reverberationAbstract
When evaluating the intelligibility of speech distorted by noise and reverberation, direct or indirect methods of measuring the speech transmission index are used. However, it remains insufficiently studied how significantly differ the results of measurements obtained by direct and indirect methods. To find an answer to this question, the use of a multicomponent test signal consisting of four "elementary" signals separated by pauses is proposed in this paper. As "elementary" signals, it is proposed to use a maximum-length sequence, a speech shaped maximum-length sequence, a speech shaped stationary noise, and a speech shaped amplitude-modulated noise. Use of amplitude-modulated noise allows estimating speech transmission index by a direct method. Other "elementary" signals make it possible to estimate speech transmission index by two variants of indirect method. The proposed algorithms and corresponding computer programs were tested on trial signal models, while the consistency of the obtained results with the results of previous studies was revealed. The results of the signal models studies show that both considered variants of the indirect speech transmission index measurement method lead to underestimated results compared to the direct method. For one of the variants of the indirect method, the value of the estimate bias is 0.03–0.04, regardless of the interfering conditions. For another variant of the indirect method, the estimate bias varies from 0.01 to 0.18, depending on the interference conditions.
References
British Standard BS EN 60268-16. Sound system equipment. Part 16. Objective rating of speech intelligibility by speech transmission index. 2011.
Application Note, Measuring Speech Intelligibility Using DIRAC — Type 7841. Brüel & Kjær. 2013. Available at: https://www.bksv.com/media/doc/bo0521.pdf
Acoustics Engineering, Technical Note TN008 "DIRAC Stimuli", January 2008. Available at: https://www.acoustics-engineering.com/files/TN008.pdf
Farina, A. User Manual of Aurora 4.3, Parma (Italy): University of Parma A/S; (2012). Available at: http://pcfarina.eng.unipr.it/aurora/download/Manual-HelpFile/Aurora43Manual.pdf
P. Zhu, F. Mo,and J. Kang, “Experimental comparison between direct and indirect measurement methods for the objective rating of speech intelligibility,” Proc. 21st International Congress on Sound and Vibration (ICSV 21), 13-17 July, 2014, Beijing/China. Available at: https://www.researchgate.net/publication/288781895_Experimental_comparison_between_direct_and_indirect_measurement_methods_for_the_objective_rating_of_speech_intelligibility
Ponteggia, AN-013, Application note. Speech intelligibility assessment using CLIO 11. Available at: https://www.audiomatica.com/wp/wp-content/uploads/APPNOTE_013.pdf
NTi Audio, Application note. Speech Intelligibility. Measurement with the XL2 analyzer. Dec. 2020. 28 p. Available at: https://www.nti-audio.com/en/
D’Orazio, E. Rossi, and M. Garai, “Comparison of different in situ measurements techniques of intelligibility in an open-plan office,” Building Acoustics. 2018, 25(2), pp. 111–122. https://doi.org/10.1177/1351010X18776431
J. Kotus, B. Kostek, A. Kurowski and P. Szczuko, “A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems,” Proc. 2018 Joint Conference–Acoustics, Ustka, Poland, 2018, pp. 1–6. https://doi.org/10.1109/ACOUSTICS.2018.8502277
P. Zhu, W. Tao, F. Mo, F. Guo, X. Lu, and X. Liu, “Experimental comparison of speech transmission index measurement in natural sound rooms and auditoria,” Applied Acoustics, 165 (2020), pp. 1–21. Available at: https://doi.org/10.1016/j.apacoust.2020.107326
H. Möller, “A review of STI measurements,” Forum Acusticum, Dec 2020, Lyon, France. pp. 173–176.
A. Prodeus, “Rapid version of a formant-modulation method of speech intelligibility estimation,” Perspective Technologies and Methods in MEMS Design, Polyana, Ukraine, 2011, pp. 61–63. Available at: https://ieeexplore.ieee.org/document/5960269
A. Prodeus, “Formant-Modulation Method of Speech Intelligibility Evaluation: Measuring and Exactness,” Proc. of the VII International Conference MEMSTECH 2011. Lviv, Polyana, 2011, pp. 54–60. Available at: https://www.academia.edu/48296401/Formant_modulation_method_of_speech_intelligibility_evaluation_Measuring_and_exactness
M. Jeub, M. Schafer, and P. Vary, “A binaural room impulse response database for the evaluation of dereverberation algorithms,” Proc. Int. Conference on Digital Signal Processing (DSP), Santorini, Greece, 2009. https://doi.org/10.1109/ICDSP.2009.5201259
Birne, “An international comparison of long-term average speech spectra,” J. Acoust.Soc.Am., 96 (4), October 1994, pp. 2108–2120. https://doi.org/10.1121/1.410152
L. Morales, G. Leembruggen, S. Dance, and B. Shield, “A revised speech spectrum for STI calculations,” Applied Acoustics, vol. 132, March 2018, pp. 33–42. https://doi.org/10.1016/j.apacoust.2017.11.008
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).