Comparison of Two Methods for Measuring Speech Intelligibility

Authors

  • Arkadiy Prodeus National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,” https://orcid.org/0000-0001-7640-0850
  • Oleksandr Dvornyk National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”
  • Anton Naida National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”
  • Oleksandr Grebin National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”

DOI:

https://doi.org/10.18372/1990-5548.78.18249

Keywords:

speech transmission index, measurement method, formant-modulation method, bias, standard deviation

Abstract

Comparing of the full speech transmission index method and full formant-modulation method of speech intelligibility measurement has been made. The methods were compared according to the accuracy of the measurements under the conditions of noise interference. The dependences of the STI estimates errors on the signal-to-noise ratio and on the duration of the test signals were obtained by means of computer simulation. It is shown that the accuracy of speech transmission index estimates is practically the same for both methods. In particular, it is shown that the use of test signals with a duration of 4 s is minimally acceptable and provides an estimation error of 0.03. Using 8 s and 16 s test signals reduces the speech transmission index estimation error to 0.02 and 0.01, respectively, for a wide signal-to-noise ratio range from minus 28 dB to plus 28 dB. The obtained results are close to those for the case of the joint action of noise and reverberation.

Author Biographies

Arkadiy Prodeus, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”

Doctor of Engineering Science

Professor

Acoustic and Multimedia Electronic Systems Department

Oleksandr Dvornyk, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”

Post-graduate Student

Acoustic and Multimedia Electronic Systems Department

Anton Naida , National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”

Post-graduate Student

Acoustic and Multimedia Electronic Systems Department

Oleksandr Grebin , National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”

Candidate of Science (Engineering)

Associate Professor

Acoustic and Multimedia Electronic Systems Department

References

J. Collard, “Theoretical Study of the Articulation and Intelligibility of a Telephone Circuit,” Electrical Communication, vol. 7, 1929, p. 168. Available at: https://www.worldradiohistory.com/Archive-ITT/20s/ITT-Vol-07-1929-03.pdf

K. Kryter, The Effects of Noise on Man, Academic Press, New York and London, 1970, 612 p. Available at: https://www.perlego.com/book/1897278/the-effects-of-noise-on-man-pdf

H. Steeneken, and T. Houtgast, "A physical method for measuring speech-transmission quality,” J. Acoust. Soc. Am., 67, 1980, pp. 318–326, https://doi.org/10.1121/1.384464

K. Rhebergena, “A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners,” J. Acoust. Soc. Am., 117 (4), Pt. 1, April 2005, pp. 2181-2192, https://doi.org/10.1121/1.1861713

A. Prodeus, “Formant-Modulation Method of Speech Intelligibility Evaluation: Measuring and Exactness,” Proc. VII Int. Conf. MEMSTECH 2011, Lviv, Polyana, Ukraine, 2011, pp. 54–60. Available at: https://ieeexplore.ieee.org/document/5960267

British Standard BS EN 60268-16. Sound system equipment. Part 16. Objective rating of speech intelligibility by speech transmission index. 2011.

A. Prodeus, "Rapid version of a formant-modulation method of speech intelligibility estimation," Proc. VII Int. Conf. MEMSTECH 2011, Lviv, Polyana, Ukraine, 2011, pp. 61–63. Available at: https://ieeexplore.ieee.org/document/5960269

NTi Audio, Application note. Speech Intelligibility. Measurement with the XL2 analyzer. Dec. 2020, 28 p. Available at: https://www.nti-audio.com/en/

J. Bradley, R. Reich, and R. Norcross, “A just noticeable difference in C50 for speech,” Applied Acoustics, (58), 1999, pp. 99–108, https://doi.org/10.1016/S0003-682X(98)00075-9

A. Prodeus, O. Dvornyk, A. Naida and M. Didkovska, "The Accuracy of Speech Transmission Index Estimation under Conditions of Joint Action of Noise and Reverberation," 2023 IEEE 13th International Conference on Electronics and Information Technologies (ELIT), Lviv, Ukraine, 2023, pp. 257–260, https://doi.org/10.1109/ELIT61488.2023.10310682

M. Jeub, M. Schafer, and P. Vary, “A binaural room impulse response database for the evaluation of dereverberation algorithms,” Proc. Int. Conference on Digital Signal Processing (DSP), Santorini, Greece, 2009, https://doi.org/10.1109/ICDSP.2009.5201259

D. Byrne, H. Dillon, and K. Tran, “An international comparison of long-term average speech spectra,” J Acoust Soc Am. 1994, 96 (4): 2108–2120, http://dx.doi.org/10.1121/1.410152

L. Morales, G. Leembruggenb, S. Dancec, and B. Shield, “A Revised Speech Spectrum for STI Calculations,” Applied Acoustics, 2018, 132: 33–42, https://doi.org/10.1016/j.apacoust.2017.11.008

Downloads

Published

2023-12-27

Issue

Section

COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES