Notes: Avatar Systems often come bundled with an AI Engine; however, I prefer unbundled, modularized, plug and play.
Speech (Voice-Out / Voice-In): Generally an Avatar System will include text-to-speech (TTS) (aka speech synthesis) with lip-sync. Speech-to-text (STT) (aka speech recognition) may currently be provided (1) on the desktop by Windows 7 speech tools (see Microsoft Speech API), (2) in the Chrome browser with Google Speech API, or (3) on the mobile platform with iPhone or Android speech tools.
Advanced applications: Face tracking (see Avatar Kinect), Facial recognition