Artificial intelligence

Scientific journal

ISSN 2710-1673

ONLINE: ISSN 2710-1681

Select your language


Real-Time Spontaneous Speech Recognition Based on Word Acoustic Composite Models

Robeiko V.1, Sazhok M.2
1 Speech Science and Technology Department, International Research and Training Center of Information Technologies and Systems «CyberMova»
2 International Research and Training Centre for Information Technologies and Systems

Full text (PDF)

UDC: 004.934
Publication Language: Ukrainian
Stuc. intelekt. 2012; 17(4):253-263

Abstract: This paper describes implementation of methods and algorithms for the automatic speech recognition based on word composition proceeding from acoustic phoneme models. Such a design of the speech-to-text decoder is conventional and most productive for Western languages. The aim is to explore this approach applied to the Ukrainian language that is highly inflective with relatively free word order. We use data-driven methods to estimate parameters for both acoustic and linguistic components of the mathematical model. The grapheme-to-phoneme conversion procedure takes into account word stress issue and spontaneous continuous speech features. The basic speech-to-text system is able to operate a 100k vocabulary in real-time. The prospective of dictionary and domain extension, parameter estimation improvement and ergonomic issues are discussed.

Keywords: Speech recognition, spontaneous continuous speech, generative model, real-time

References:

View full text (PDF)