商品詳細 | Knowledge Worker

丸善のおすすめ度

Novel Techniques for Dialectal Arabic Speech Recognition 2012nd ed. H 142 p. 12

Elmahdy, Mohamed, Gruhn, Rainer, Minker, Wolfgang 　著

在庫状況海外在庫有り	お届け予定日 1ヶ月	数量冊
価格 \22,159（税込）

この商品について問合せる

発行年月	2012年02月
出版社／提供元	Springer-Verlag New York
出版国	アメリカ合衆国
言語	英語
媒体	冊子
装丁	hardcover
ページ数／巻数	XXII, 110 p.
ジャンル	洋書／理工学／情報科学／知的情報処理
ISBN	9781461419051
商品コード	1004469454
新刊案内掲載月	2012年01月
商品URL	https://kw.maruzen.co.jp/ims/itemDetail.html?itmCd=1004469454

内容

"Novel Techniques for Dialectal Arabic Speech Recognition" describes novel approaches to improve automatic speech recognition for dialectal Arabic. Since speech resources for dialectal Arabic speech recognition are very sparse, the authors describe how existing Modern Standard Arabic (MSA) speech data can be applied to dialectal Arabic speech recognition, while assuming that MSA is always a second language for all Arabic speakers, and in most cases the original dialect of a speaker can be identified even though he is speaking MSA. Hence, an acoustic model trained with sufficient number of MSA speakers from different origins will implicitly model the acoustic features for the different Arabic dialects. In this case, it can be called dialect-independent acoustic modeling. In this book, Egyptian Colloquial Arabic (ECA) has been chosen as a typical Arabic dialect. ECA is the first ranked Arabic dialect in terms of number of speakers. A high quality ECA speech corpus with accurate phonetic transcription has been collected. MSA acoustic models were trained using news broadcast speech. Usually, MSA and dialectal Arabic do not share the same phoneme set. Therefore, in order to crosslingually use MSA in dialectal Arabic speech recognition, the authors have normalized the phoneme sets for MSA and ECA. After this normalization, they have applied state-of-the-art acoustic model adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A-Posteriori (MAP) to adapt existing phonemic MSA acoustic models with a small amount of dialectal ECA speech data. Speech recognition results indicate a significant increase in recognition accuracy compared to a baseline model trained with only ECA data.

Fundamentals.- Speech Corpora.- Phonemic Acoustic Modeling.- Graphemic Acoustic Modeling.- Phonetic Transcription Using the Arabic Chat Alphabet.

カート

カートに商品は入っていません。

前のページに戻る