This system enables real-time voice translation between two users with animated 3D avatars that lip-sync to the translated audio. The entire pipeline runs on-device using WebAssembly models for ...