Skip to main content

OpenAI Whisper

Effortlessly transcribe spoken audio into text using OpenAI Whisper.

What is OpenAI Whisper?

The image shows a software interface labeled 'OpenAI Whisper Tool'. It includes fields for entering an API key, setting temperature with a default value of 0, uploading an audio file, and providing a text prompt. There are input and output nodes labeled 'File', 'String', and 'Output'.

OpenAI Whisper is a powerful tool that transcribes spoken audio to text using OpenAI's Whisper model. This tool requires an audio file and an OpenAI API Key to function. Users can also adjust the model's style and temperature to refine the output. The transcription results can be provided in multiple formats including text, JSON, SRT, and VTT.

How to use it?

Using the OpenAI Whisper involves several steps, which are outlined below:

  1. Set OpenAI API Key:

    • Enter your OpenAI API Key. This is mandatory and is used to authenticate requests to OpenAI's API.
  2. Set Model Temperature:

    • Specify the model temperature. This parameter influences the creativity of the transcription. The default value is set to 0.
  3. Upload Audio File:

    • Provide the audio file you wish to transcribe. This is a required input.
  4. Optional Prompt:

    • Optionally, provide a prompt to guide the transcription process.
  5. Select Output Format:

    • Choose the desired output format for the transcription. Options include Text, JSON, SRT, VTT, and Verbose JSON. The default is set to Text.

Example of usage

OpenAI Whisper can be used to transcribe audio interviews into text for further analysis.

A flowchart with three modules connected by arrows. The first module, 'File Reader', shows settings for AWS S3, region set to Frankfurt, and file path 'fairy-tale.mp3'. The second, 'OpenAI Whisper', indicates audio file input and output as text. The third, 'Output', displays the output format as text.

Example Task: Transcribe an Audio Interview

Objective: Convert an audio file of an interview into text for documentation and analysis.

Step-by-Step Setup

  1. Configure OpenAI Whisper:

    • Select the OpenAI Credentials
    • Define the model temperature, for this example we use 1.
  2. Upload the Audio File:

    • Connect the audio file to the "Audio File Reader" input anchor.
  3. Choose Output Format:

    • Select the desired output format from the "Output" dropdown. For this example, choose "Text".
  4. Connect to Output:

    • Connect the "Transcription" output anchor to the Text Output, to display the transcribed text.

Additional Information

Ensure that your OpenAI API Key has permissions to access the Whisper transcription service. For best results, use clear audio recordings and, if possible, provide a guiding prompt to improve the accuracy and relevance of the transcriptions.

By following these steps, you can effectively use OpenAI Whisper to transcribe audio files into various text formats, enabling seamless integration into your workflows and applications.