Languages

VoiceBase supports several spoken languages and dialects.

Languages supported in the V2 (Beta) API:

Language Code Supported Frequencies (kHz)
Dutch nl-NL 8, 11
English US en-US 8, 16
English UK en-UK 8, 16
English Australian en-AU 8, 16
French fr-FR 8, 11
German de-DE 8, 11
Italian it-IT 8, 11
Portuguese, Brazil pt-BR 8
Spanish, Latin Am. es-LA 8, 16
Spanish, Spain es-ES 8, 11

Feature Support by Language:

Note: en-UK and en-AU use the en-US functions for Keywords, Topics, Number Formatting and PCI.

Configuring Language Support

Use the language configuration parameter to set the language. Omitting the parameter defaults the language to U.S. English (en-US).

For example, to transcribe a recording in Australian English:

{
  "configuration": {
    "language":"en-AU"
  }
}
  • configuration : The root configuration section.
    • language : The language code.
      • en-US : US English
      • en-UK : UK English
      • en-AU : Australian English
      • es-LA : Latin American Spanish
      • pt-BR : Brazilian Portuguese

Disabling Semantic Keywords and Topics

Semantic keywords and topics are not currently supported with Spanish and Portuguese. When transcribing in these languages, disable these options in your configuration.

{
  "configuration": {
  "language": "es-LA",
    "keywords": {
      "semantic": false
    },
    "topics": {
      "semantic": false
    }
  }
}
  • configuration : The root configuration section.
    • language : The language code.
      • es-LA : Latin American Spanish
      • pt-BR : Brazilian Portuguese
    • keywords : The keywords sub-section
      • semantic : The semantic keywords sub-section
        • false : The value is false to indicate no semantic keywords.
    • topics : The topics sub-section
      • semantic : The semantic topics sub-section
        • false : The value is false to indicate no semantic topics.

Examples

** Note: Export your api TOKEN prior to running any of the following examples.

export TOKEN='Your Api Token'

U.S. English

curl https://apis.voicebase.com/v2-beta/media \
    --form media=@recording.mp3 \
    --form 'configuration={
      "configuration": {
        "language": "en-US",
      }
    }' \
    --header "Authorization: Bearer ${TOKEN}"

U.K. English

curl https://apis.voicebase.com/v2-beta/media \
    --form media=@recording.mp3 \
    --form 'configuration={
      "configuration": {
        "language": "en-UK",
      }
    }' \
    --header "Authorization: Bearer ${TOKEN}"

Australian English

curl https://apis.voicebase.com/v2-beta/media \
    --form media=@recording.mp3 \
    --form 'configuration={
      "configuration": {
        "language": "en-AU",
      }
    }' \
    --header "Authorization: Bearer ${TOKEN}"

Latin American Spanish

curl https://apis.voicebase.com/v2-beta/media  \
    --form media=@recording.mp3 \
    --form 'configuration={
      "configuration": {
        "language": "es-LA",
        "keywords": {
          "semantic": false
        },
        "topics": {
          "semantic": false
        }
      }
    }' \
    --header "Authorization: Bearer ${TOKEN}"

Brazilian Portuguese

curl https://apis.voicebase.com/v2-beta/media  \
    --form media=@recording.mp3 \
    --form 'configuration={
      "configuration": {
        "language": "pt-BR",
        "keywords": {
          "semantic": false
        },
        "topics": {
          "semantic": false
        }
      }
    }' \
    --header "Authorization: Bearer ${TOKEN}"