gemini-3.1-flash-ttsControllable TTS across 70+ languages, 200+ inline emotion tags. Audio output tokens.
Multi-turn dialogue, system prompts.
SSE chunks for incremental output.
Structured arguments via JSON schemas.
Pass images alongside the prompt.
Text and image to short video, async.
Coming soon.
This example calls the selected Google AI Studio route with gemini-3.1-flash-tts.
import OpenAI from 'openai';
const client = new OpenAI({
baseURL: 'https://api.miavo.xyz/v1',
apiKey: process.env.MACAW_API_KEY!,
});
const res = await client.chat.completions.create({
model: 'gemini-3.1-flash-tts',
messages: [
{ role: 'user', content: 'Write me a haiku about gateways.' },
],
});
console.log(res.choices[0].message.content);