Timestamp From Audio

post https://api.play.ht/api/v2/transcriptions

Creates a new transcription that extracts text and time information from the audio generated by a text-to-speech (TTS) generation.

For example, you can generate SRT subtitles with the timestamps at word or sentence level.

Similarly, you can also generate VRT or JSON outputs, similarly, with the timestamps at word or sentence level.

The audio to be transcribed comes from a text-to-speech (TTS) generation. The TTS job ID is specified in the request body.

The API responds with information about the newly created Transcription job:

Status: 201 - Created
Header: Location: /api/v2/transcriptions/{id} - the URL for the recently-created job
Body: Job data (job ID and status)

When the job completes, the transcription result is delivered through the webhook sent to the webhook_url specified in the request body.