Represents a verbose json transcription response returned by model, based on the provided input.
Name | Type | Description | Notes |
---|---|---|---|
language | String | The language of the input audio. | |
duration | String | The duration of the input audio. | |
text | String | The transcribed text. | |
words | List<TranscriptionWord> | Extracted words and their corresponding timestamps. | [optional] |
segments | List<TranscriptionSegment> | Segments of the transcribed text and their corresponding details. | [optional] |