Voice (STT/TTS)

abTARS transcribes voice messages and can respond with synthesized speech.

Speech-to-Text (STT)

Provider: Groq Whisper (whisper-large-v3-turbo).

bash

# ~/.abtars/config/.env
STT_ENABLED=true
GROQ_API_KEY=<secret>

LANGUAGE_HINT_PROMPT guides Whisper (e.g. "ez egy magyar szöveg. or English")
Whisper returns detected language code (hu, en, ja, etc.)
users.json defines expected languages per user: "languages": ["hu", "en"]
If detected language isn't in the user's list → agent asks back (likely hallucination on short audio)
Soft check via prompt — no hard rejection

When enabled, the agent can respond with voice messages on Telegram.

bash

TTS_ENABLED=true
TTS_PROVIDER=openai    # or other supported provider

Voice is Telegram-only — it's the only platform that sends voice note file IDs the bridge can download.