Speech to Text

POST

audio

transcriptions

curl --request POST \
  --url https://geekai.dev/api/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form model=whisper-1 \
  --form 'prompt=<string>' \
  --form 'language=<string>' \
  --form response_format=text \
  --form temperature=0 \
  --form retries=0

"<string>"

Note: For speech model name settings, refer to the System Supported Transcription Speech Model List. The request/response parameter structure is fully compatible with OpenAI. When switching models, you only need to modify the corresponding model name. If the model request/response parameters are inconsistent with OpenAI, GeekAI will automatically convert and align them at the backend.

The response data format is fully compatible with the OpenAI speech transcription text interface.

cURL Request Example

curl --location 'https://geekai.dev/api/v1/audio/transcriptions' \
    --header 'Authorization: {YOUR_GEEKAI_API_KEY}' \
    --form 'file=@"/C:/User/GeekAI/Downloads/2cd359fe642c33deaa943f7306d73f30.mp3"' \
    --form 'model="whisper-1"' \
    --form 'response_format="text"'

Authorizations

Authorization

string

header

required

token

Body

multipart/form-data

Response

200

text/plain

successful response

text format transcription result

Image Generation Text to Speech

curl --request POST \
  --url https://geekai.dev/api/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form model=whisper-1 \
  --form 'prompt=<string>' \
  --form 'language=<string>' \
  --form response_format=text \
  --form temperature=0 \
  --form retries=0

"<string>"

Chat

Image

Audio

Video

Models

Agent

File

OCR

API KEY

cURL Request Example

Authorizations

Body

Response

Chat

Image

Audio

Video

Models

Agent

File

OCR

API KEY

​cURL Request Example

Authorizations

Body

Response

cURL Request Example