mic
API

Speech to Text API

Transcribe uploaded audio with task polling, subtitle output, and async delivery.

What is Speech to Text API?

Upload supported audio formats, create a speech-to-text task, and retrieve transcript artifacts such as JSON and SRT once the task completes.

Supported Models

Async STT tasks

Submit audio and check completion through the common task endpoint

Transcript artifacts

JSON and subtitle outputs are returned when the task is done

Multiple audio formats

Use common uploaded audio formats within the supported limits

Quick Start

Bash
api.cheapaiapi.com/v1
curl -X POST "https://api.cheapaiapi.com/v1/task/speech-to-text" \
  -H "Authorization: Bearer sk_your_api_key" \
  -F file=@meeting.mp3

Why CheapAI?

schedule

Async Processing

Designed for task submission and later retrieval rather than immediate sync output.

subtitles

Transcript Files

Receive JSON and subtitle outputs when available.

upload_file

File Upload

Submit common audio file types using multipart form data.

webhook

Webhook Optional

You can provide a callback URL instead of polling manually.

Async

Flow

JSON + SRT

Output

Multipart

Upload

/v1/task/:id

Task Polling

Frequently Asked Questions

How do I get the final transcript?expand_more

Poll the common task endpoint and read the transcript URLs from the returned metadata once the task is done.

What file formats are supported?expand_more

Use the file types documented on the speech-to-text docs page for the current route.

Can I skip polling?expand_more

Yes. If the route supports receive_url, you can ask CheapAI to POST the result to your webhook.

Start Building Today

Get $2 in free credits. Create an API key, set spend controls if needed, and start testing the current task-based endpoints.

Get API Key arrow_forward