Along the course od developing a security system, I scouted a text-to-speech IC XFS5051CE. I think it might interest folks in this forum so I’ll post it. The best part of this IC is, you can burn as many wav or pcm files as you want into a SPI flash, then play those files just like synthesizing speech from text… The following is text I sent through UART. Bolded words are the names of hard-burnt wav files.
The synthesized audio can be downloaded from my dropbox:
ATT and Festival are audio files of something read by either ATT Natural Voices Lab or The Festival Speech Synthesis System . Honestly chip/hardware based TTS is still not as good as software based solution but for us embedded people often times we simply don’t have access to internet. So I have no complains.
This IC handles volume, pitch and speed, but I don’t have the chance to play with those features yet.
My plan was to use S1V30120 from Epson but the minimum order of 600 just scared me off and Joe’s EMIC2 seems quite expensive for what it can do. So I looked a little bit harder and found this IC.
1K unit price of this IC runs at $15, kind of expensive too but given its capabilities to handle audio playback(either from SPI flash or from MCU through UART), I think it is still good choice particularly for .Net Micro Framework. NTMF so far doesn’t support I2S. Audio play needs to be handled by external IC or native code. Combining TTS and audio playback into one IC greatly simplify system design when both features are needed.