Convert Plain Text To Speech

This endpoint is used to trigger text to speech conversion.

It can generate text for Standard & Premium (S&P) voices. The identifiers for these voices look like en-US-JennyNeural.

If you are using PlayHT voices (their identifiers look like larry or a URL), please refer to the Generate Audio From Text endpoint page.


The request body for this /v1/convert endpoint must contain the text to be converted to speech, along with the voice to be used for the conversion. The response will contain data about the conversion job created in JSON format.

Please provide the text to be converted in the content field. It is an array of plain text strings.

Use the transcriptionId in the response to check the conversion status in the Get Article Conversion Status endpoint endpoint.

Body Params
content
array of strings
required
Defaults to Hey you!

Array of strings, where each string represents a paragraph in plain text format.

content*
string
required
Defaults to en-US-JennyNeural

Voice used to synthesize the text. Refer to the List Standard & Premium Voices endpoint for a list of supported voices.

string

A name to your file. You can use this name to find the audio in your Play.ht dashboard.

string

String representing the tone and accent of the voice to read the text. Make sure the value for narrationStyle is supported by the voice in your request. Refer to the List Standard & Premium Voices endpoint for more details.

string

String in the format <number>%, where <number> is in the closed interval of [20, 200]. Use this to speed-up, or slow-down the speaking rate of the speech.

pronunciations
array of objects
pronunciations
boolean

When enabled, the audio will be trimmed to remove any silence from the end of the file.

string

Pass this to update an existing audio file.

Responses

Language
Credentials
Response
Click Try It! to start a request and see the response here! Or choose an example:
application/json