Speech generation and servo control of my Bender figure (https://www.thingiverse.com/thing:6200580).
The speech output is realised with the help of phonemes. Words entered are broken down into phonemes and matching audio segments are combined and played back. The mouth is moved to match. For example, the mouth opens for an "a" and closes for an "m". In addition, there are some audio snippets such as laughing or "hmmmm" sounds.
Audio sample generation: github:audioconvert
Web server with ESP8266: github:BenderLAN