> ## Documentation Index
> Fetch the complete documentation index at: https://developers.deepl.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Create a voice translation job

> Creates an async voice translation job. The response includes an upload URL for the source audio file.



## OpenAPI

````yaml post /v1/jobs/voice/translate
openapi: 3.0.3
info:
  title: DeepL API Documentation
  description: >-
    The DeepL API provides programmatic access to DeepL’s language AI
    technology.


    Note: this OpenAPI spec is embedded into our API documentation and has
    shortened descriptions.
  termsOfService: https://www.deepl.com/pro-license
  contact:
    name: DeepL - Contact us
    url: https://www.deepl.com/contact-us
  version: 3.11.0
servers:
  - url: https://api.deepl.com
    description: DeepL API Pro
  - url: https://api-free.deepl.com
    description: DeepL API Free
security: []
tags:
  - name: beta
    description: >-
      Experimental features that are under testing and not yet intended for
      production use.
  - name: TranslateText
    description: >-
      The text-translation API currently consists of a single endpoint,
      `translate`, which is described below.
  - name: TranslateDocuments
    description: >-
      The document translation API allows you to translate whole documents and
      supports the following file types and extensions:
        * `docx` - Microsoft Word Document
        * `pptx` - Microsoft PowerPoint Document
        * `xlsx` - Microsoft Excel Document
        * `pdf` - Portable Document Format
        * `htm / html` - HTML Document
        * `txt` - Plain Text Document
        * `xlf / xliff` - XLIFF Document, version 2.1
        * `srt` - SRT Document
        * `jpeg` / `jpg` / `png` - Image (currently in beta)
  - name: RephraseText
    description: >-
      The `rephrase` endpoint  is used to make corrections and adjustments to
      texts based on style or tone.
  - name: CorrectText
    description: >-
      The `correct` endpoint fixes spelling and grammar errors without broader
      rephrasing. Use it when you want

      a minimal-change correction pass rather than the broader rewriting
      performed by `rephrase`.
  - name: ManageMultilingualGlossaries
    description: >-
      The *glossary* functions allow you to create, inspect, edit and delete
      glossaries.

      Glossaries created with the glossary function can be used in translate
      requests by specifying the

      `glossary_id` parameter. A glossary contains (several) dictionaries.

      A dictionary is a mapping of source phrases to target phrases for a single
      language pair.

      If you encounter issues, please let us know at support@DeepL.com.


      Currently you can create glossaries with any of the languages DeepL
      supports (with the exception of Thai).


      The maximum size limit for a glossary is 10 MiB = 10485760 bytes and each
      source/target text,

      as well as the name of the glossary, is limited to 1024 UTF-8 bytes.

      A total of 1000 glossaries are allowed per account.


      When creating a dictionary with target language `EN`, `PT`, or `ZH`, it's
      not necessary to specify a variant

      (e.g. `EN-US`, `EN-GB`, `PT-PT`, `PT-BR`, or `ZH-HANS`).

      Dictionaries with target language `EN` can be used in translations with
      either English variant.

      Similarly `PT`, and `ZH` dictionaries can be used in translations with
      their corresponding variants.

      (When you provide the ID of a glossary to a translation, the appropriate
      dictionary is automatically applied. Currently glossaries can not yet be
      used with source language detection.)


      Glossaries created via the DeepL API are now unified with glossaries
      created via the DeepL website and DeepL apps.

      Please only use the v3 glossary API in conjunction with multilingual or
      edited glossaries from the website.
  - name: ManageGlossaries
    description: >-
      Please note that this is the spec for the (old) v2 glossary endpoint.

      We recommend users switch to the newer v3 glossary endpoints, which
      support editability and multilinguality.


      The *glossary* functions allow you to create, inspect, and delete
      glossaries.

      Glossaries created with the glossary function can be used in translate
      requests by specifying the

      `glossary_id` parameter.

      If you encounter issues, please let us know at support@DeepL.com.


      Currently you can create glossaries with any of the languages DeepL
      supports (with the exception of Thai).
  - name: MetaInformation
    description: Information about API usage and value ranges
  - name: TranslationMemories
    description: >-
      The translation memory endpoints allow you to interact with your account's
      translation memories, used to store

      and reuse previously created translations. Translation memories can be
      used in text translation requests by

      specifying the `translation_memory_id` parameter to denote a specific
      translation memory and the

      `translation_memory_threshold` which defines the minimum matching
      percentage required for a translation memory

      segment to be applied (recommended to be 75% or higher).
  - name: VoiceAPI
    description: >-
      The Voice API provides real-time voice transcription and translation
      services.

      Use a two-step flow: first request a streaming URL via REST, then
      establish a WebSocket connection for streaming audio and receiving
      transcriptions.
  - name: VoiceTranslateJob
    description: >-
      **Alpha.** Async voice translation jobs. This API may change without
      notice.
  - name: AdminApi
    description: >-
      Endpoints for organization administrators to manage API keys and retrieve
      usage analytics.
  - name: QualityEvaluation
    description: >-
      **Closed alpha.** Evaluate translation quality. Submit source/target
      segment pairs and retrieve per-segment quality issues categorized by error
      type and severity, with character spans pointing to where each issue
      occurs.
externalDocs:
  description: DeepL Pro - Plans and pricing
  url: https://www.deepl.com/pro#developer
paths:
  /v1/jobs/voice/translate:
    post:
      tags:
        - VoiceTranslateJob
      summary: Create a voice translation job
      description: >-
        Creates an async voice translation job. The response includes an upload
        URL for the source audio file.
      operationId: createVoiceTranslateJob
      parameters:
        - $ref: '#/components/parameters/VoiceTranslateJobIncludeParam'
      requestBody:
        required: true
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/VoiceTranslateCreateJobRequest'
            example:
              source_file:
                name: podcast-episode-42.mp3
                content_type: audio/mpeg
                content_length: 15728640
              parameters:
                source_language: en
              targets:
                - language: de
                  type: text/plain
                - language: es
                  type: audio/pcm;encoding=s16le;rate=16000
      responses:
        '201':
          description: Job created successfully.
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/VoiceTranslateCreateJobResponse'
              example:
                job_id: a74d88fb-ed2a-4943-a664-a4512398b994
                upload_url: >-
                  https://assets.deepl.com/collections/a74d88fb-ed2a-4943-a664-a4512398b994/assets/b1c2d3e4-f5a6-7890-abcd-ef1234567890
                signature: eyJhbGciOiJIUzI1NiIs...
        '400':
          $ref: '#/components/responses/VoiceTranslateJobBadRequest'
        '401':
          $ref: '#/components/responses/Unauthorized'
        '403':
          $ref: '#/components/responses/Forbidden'
        '429':
          $ref: '#/components/responses/TooManyRequests'
      security:
        - auth_header: []
components:
  parameters:
    VoiceTranslateJobIncludeParam:
      name: include
      in: query
      required: false
      description: >-
        Additional fields to include in the response.

        - `signed_url`: Include pre-signed URLs (`signed_upload_url` on create,
        `signed_download_url` on status) that can be used without an
        authorization header.
      schema:
        type: array
        items:
          type: string
          enum:
            - signed_url
      style: form
      explode: true
  schemas:
    VoiceTranslateCreateJobRequest:
      type: object
      required:
        - source_file
        - targets
      properties:
        source_file:
          $ref: '#/components/schemas/JobSourceFileRequest'
        parameters:
          $ref: '#/components/schemas/VoiceTranslateJobParametersRequest'
        targets:
          type: array
          minItems: 1
          description: >-
            One or more translation targets. Each target produces a separate
            result.
          items:
            $ref: '#/components/schemas/VoiceTranslateJobTargetRequest'
    VoiceTranslateCreateJobResponse:
      type: object
      required:
        - job_id
        - upload_url
        - signature
      properties:
        job_id:
          type: string
          format: uuid
          description: The unique identifier of the created job.
        upload_url:
          type: string
          format: uri
          description: >-
            The URL to upload the source audio file to via `PUT` with
            `Content-Type: application/octet-stream`.

            Requires the `Authorization: DeepL-Signature {signature}` header.

            See [Upload File](/api-reference/jobs-voice-translate#upload-file)
            for details.
        signature:
          type: string
          description: >-
            A short-lived token used to authorize the file upload. Pass it via
            the `Authorization` header as `DeepL-Signature {signature}` when
            uploading to the `upload_url`.
        signed_upload_url:
          type: string
          format: uri
          description: >-
            A pre-signed upload URL that does not require an authorization
            header. Only present when `?include=signed_url` is specified.
    JobSourceFileRequest:
      type: object
      description: Metadata about the source audio file to be uploaded.
      required:
        - name
        - content_type
        - content_length
      properties:
        name:
          type: string
          minLength: 1
          description: The file name of the source audio file.
          example: podcast-episode-42.mp3
        content_type:
          $ref: '#/components/schemas/VoiceTranslateJobSourceContentType'
        content_length:
          type: integer
          format: int64
          minimum: 1
          maximum: 1073741824
          description: >-
            The size of the source audio file in bytes. Maximum 1 GB
            (1,073,741,824 bytes).
          example: 15728640
    VoiceTranslateJobParametersRequest:
      type: object
      description: Processing parameters for the voice translation job.
      properties:
        source_language:
          allOf:
            - $ref: '#/components/schemas/SourceLanguage'
            - description: >-
                Language of the source audio. If omitted, the language is
                detected automatically.
    VoiceTranslateJobTargetRequest:
      type: object
      description: A translation target specifying the desired output language and format.
      required:
        - language
        - type
      properties:
        language:
          $ref: '#/components/schemas/TargetLanguage'
        type:
          $ref: '#/components/schemas/VoiceTranslateJobTargetOutputType'
    ErrorResponse:
      type: object
      required:
        - message
      properties:
        message:
          type: string
          description: A human-readable description of the error.
        code:
          type: string
          description: >-
            A machine-readable identifier for the error, when available. Clients
            should match on this value rather than on `message` when branching
            on error types.
          example: invalid_content_type
    VoiceTranslateJobSourceContentType:
      type: string
      description: The MIME type of the source audio file.
      enum:
        - audio/mpeg
        - audio/wav
        - audio/ogg
        - audio/flac
        - audio/mp4
        - audio/webm
    SourceLanguage:
      type: string
      description: >-
        Language of the text to be translated. If this parameter is omitted, the
        API will attempt to

        detect the language of the text and translate it.


        For the full list of supported source languages, see [supported
        languages](https://developers.deepl.com/docs/getting-started/supported-languages)
        or query the [`GET /v3/languages`
        endpoint](https://developers.deepl.com/api-reference/languages/retrieve-supported-languages-by-resource).
      example: EN
    TargetLanguage:
      type: string
      description: >-
        The language into which the text should be translated.


        For the full list of supported target languages, see [supported
        languages](https://developers.deepl.com/docs/getting-started/supported-languages)
        or query the [`GET /v3/languages`
        endpoint](https://developers.deepl.com/api-reference/languages/retrieve-supported-languages-by-resource).
      example: DE
    VoiceTranslateJobTargetOutputType:
      type: string
      description: The desired output format for the translation target.
      enum:
        - text/plain
        - application/x-subrip
        - audio/opus
        - audio/flac
        - audio/pcm;encoding=s16le;rate=16000
        - audio/pcm;encoding=s16le;rate=24000
        - audio/pcm;encoding=ulaw;rate=8000
        - audio/pcm;encoding=alaw;rate=8000
        - audio/x-matroska;codecs=aac
        - audio/x-matroska;codecs=flac
        - audio/x-matroska;codecs=opus
        - audio/x-matroska;codecs=pcm_s16le;rate=16000
        - audio/x-matroska;codecs=pcm_s16le;rate=24000
        - video/mp2t;codecs=aac
        - video/mp2t;codecs=opus
        - audio/ogg;codecs=flac
        - audio/ogg;codecs=opus
        - audio/webm;codecs=opus
  responses:
    VoiceTranslateJobBadRequest:
      description: Bad request. Check the error message and request parameters.
      content:
        application/json:
          schema:
            $ref: '#/components/schemas/ErrorResponse'
          example:
            message: '/targets/0/type: Value is not valid'
    Unauthorized:
      description: >-
        Authorization failed. Please supply a valid `DeepL-Auth-Key` via the
        `Authorization` header.
      content:
        application/json:
          schema:
            $ref: '#/components/schemas/ErrorResponse'
    Forbidden:
      description: >-
        Authorization failed. Please supply a valid `DeepL-Auth-Key` via the
        `Authorization` header.
      content:
        application/json:
          schema:
            $ref: '#/components/schemas/ErrorResponse'
    TooManyRequests:
      description: Too many requests. Please wait and resend your request.
      content:
        application/json:
          schema:
            $ref: '#/components/schemas/ErrorResponse'
  securitySchemes:
    auth_header:
      type: apiKey
      description: >
        Authentication with `Authorization` header and `DeepL-Auth-Key`
        authentication scheme. Example: `DeepL-Auth-Key <api-key>`
      name: Authorization
      in: header
      x-default: 'DeepL-Auth-Key '

````