Create a Job
A Job is a set of audio/video files to transcribe that have the same language and model requirements. Click Create Job under All Jobs to open the Create Job slide-out.
Step 1 – Upload Files: Start uploading files by clicking on the file upload zone or dragging and dropping files to it. You can delete the files that you have uploaded by mistake. We have the following limits of the files that you upload:
They must be common audio or video files (mp3, wav, mp4, ogg, mov, etc).
The service is designed for simple conversational content. The models might not perform well on non-conversational content, such as audible footsteps or a job site.
Size limit: 1GB per file.
Duration limit: 6 hours total.
Some files, especially video files, can be incompatible with web browsers and may be rejected. If that happens, please extract the audio from your video file and/or convert the audio to a popular format such as wav or mp3 for compatibility.
Step 2 – Select a model: You currently have 2 models to choose from:
Google Gemini, which is Google's flagship AI model that is capable of handling audio transcription tasks. Though currently experimental, it can produce surprisingly good results.
OpenAI Whisper, though created by OpenAI, is run on a Brown-maintained service, not on OpenAI servers.
Note: Unlike gemini.google.com, which the Brown community also has access to, the Gemini models that we provide in Transcribe is served differently and can only handle Risk Level 2 data.
Step 3 – Name: Name the Job. If not supplied, a name is automatically determined from the names of the files.
Step 4 – Language: selects a language or dialect. Different models may have different options.
Step 5: Click on Start to send the files for transcription - the Start button will appear faded if not all steps are complete.
Remember: The Job does not begin until you press Start!
Last updated
Was this helpful?