A Modular Prototype of Emotion-Aware Proactive Voice Agent with Live2D Embodiment
DOI:
https://doi.org/10.31224/5993Keywords:
Proactive Conversational Agent, Live2D Embodied Dialogue, KoreanAbstract
We present a voice-based conversational agent that advances beyond reactive dialogue by integrating speech-to-text transcription with Whisper, emotion recognition, simple policy mechanisms, and Live2D embodiment. The system delivers supportive guidance either as inline prompts or card-style recommendations, while empathetic dialogue and expressive avatar cues enhance both transparency and user engagement. A log-based evaluation across ten sessions showed consistent stability, with an average latency of 7.1 seconds. This prototype illustrates a practical foundation for developing emotion-aware, proactive companions aligned with the vision of human-centered dialogue systems.
Downloads
Downloads
Posted
License
Copyright (c) 2025 Jae Young Suh, Mingyu Jeon

This work is licensed under a Creative Commons Attribution 4.0 International License.