Elevenlabs Speech to Text
Overview
Elevenlabs Speech to Text is a music-generation model served via Elevenlabs. Access it with the same Bearer token and OpenAI-compatible request shape as the other 287 models in the catalog — no provider-specific SDK required. Priced at $0.0245 Per minute, billed per use from your credit balance.
Use via API
curl https://aimarcusimage.eu/api/v1/jobs/createTask \
-H "Authorization: Bearer sk-aig-..." \
-H "Content-Type: application/json" \
-d '{
"model": "elevenlabs-elevenlabs-speech-to-text",
"input": {
"prompt": "upbeat synthwave, driving bass"
}
}'
Async jobs (video / image / music) return a taskId. Poll /api/v1/jobs/recordInfo?taskId=... or use a webhook to get the result URLs.
Prompt examples
Pricing compare
| Provider | Price Per minute | Notes |
|---|---|---|
| AI Generate | $0.0245 | This site, pay-as-you-go, $10 free on signup |
| fal.ai | $0.0300 | Usually the upstream reference rate |
| Direct from Elevenlabs | varies | Requires separate account and higher minimum commitment |
Related models
Suno, Replace Music Section
Suno, Generate Lyrics
Elevenlabs Text to Speech, turbo 2.5
Elevenlabs Sound Effect V2
Suno, Generate Music
Suno, Mashup
FAQ
How much does Elevenlabs Speech to Text cost?+
$0.0245 Per minute. You pay from your credit balance — no monthly subscription, no minimum commitment. First $10 on signup are free.
Is Elevenlabs Speech to Text a music model?+
Yes — Elevenlabs Speech to Text is a music-generation model from Elevenlabs, served through the AI Generate gateway.
How do I call Elevenlabs Speech to Text from my code?+
Use a standard HTTPS POST with a Bearer token to our /api/v1/ endpoint. The request shape matches the OpenAI-compatible convention — no provider-specific SDK needed. See the "Use via API" section above for a working curl example.
Can I use Elevenlabs Speech to Text in production?+
Yes. The endpoint is rate-limited to 20 requests per 10 seconds per API key, retries are idempotent via taskId, and async results are delivered via polling or webhook. Set a daily spend cap in dashboard settings to protect against runaway usage.
What is the markup compared to the upstream provider?+
We route through a mix of direct provider integrations (OpenAI, Anthropic) and wholesale aggregators (kie.ai, OpenRouter) — typically 20-40% above raw provider cost. Volume tiers at $50 / $200 / $1000 / $5000 monthly spend reduce the markup to as low as 10%.
Ready to try Elevenlabs Speech to Text?
Sign up in 30 seconds. $10 in free credits — enough for dozens of generations.