Alignment Timestamps

Extract word-level timestamps by aligning a transcript to an audio file.


POST/api/v1/alignment/extract-timestamps

Request Body

ParameterTypeRequiredDescription
modelstringYesModel key (see models below)
paramsobjectYesModel-specific parameters
attachmentMediaIdsstring[]YesAudio file to align against

Response

{
  "inferenceJobId": "ij_..."
}

Example

POST/api/v1/alignment/extract-timestamps
{
  "model": "elevenlabs-extract-timestamps",
  "params": {
    "text": "Welcome to Wonda, the AI-powered video editing platform."
  },
  "attachmentMediaIds": ["media_abc123"]
}

Models

elevenlabs-extract-timestamps

Align a transcript to audio and extract word-level timestamps via ElevenLabs.

ParameterTypeRequiredDescription
textstringYesThe transcript text to align against the audio

Attachments: 1 required audio file (wav, mp3, mpeg, m4a).