Freelance Opportunity Transcription Specialist Remote
Transcription Specialist
Uber AI Solutions is seeking detail-oriented transcription specialists to support a large-scale generative AI training program. In this engagement, you will transcribe and annotate audio files (Single & Multitrack) with accuracy, capturing utterance, stutter, and linguistic nuance exactly as spoken.
Supported Languages & Dialects
We are looking for freelancers in the following languages:
- Arabic: (ar-001 | ar-MSA), (ar-SA), (ar-AE | ar-UAE)
- Bengali: (bn-BD | bn-IN)
- Catalan: (ca-ES)
- Chinese: (zh-CN | zh-Hans), (zh-Hant), (zh-HK), (zh-TW)
- Croatian: (hr-HR)
- Czech: (cs-CZ)
- Danish: (da-DK)
- Dutch: (nl-NL)
- English: (en-US), (en-GB)
- Estonian: (et-EE)
- Finnish: (fi-FI)
- French: (fr-FR), (fr-CA)
- German: (de-DE), (de-CH)
- Greek: (el-GR)
- Hebrew: (he-IL)
- Hindi: (hi-IN)
- Hungarian: (hu-HU)
- Indonesian: (id-ID)
- Italian: (it-IT)
- Japanese: (ja-JP)
- Kannada: (kn-IN)
- Korean: (ko-KR)
- Lithuanian: (lt-LT)
- Maithili: (mai-IN)
- Malay: (ms-MY)
- Malayalam: (ml-IN)
- Norwegian: (no-NO)
- Polish: (pl-PL)
- Portuguese: (pt-PT), (pt-BR)
- Romanian: (ro-RO)
- Russian: (ru-RU)
- Sinhala: (si-LK)
- Slovak: (sk-SK)
- Spanish: (es-ES), (es-US), (es-419 | es-LATAM), (es-MX)
- Swedish: (sv-SE)
- Tagalog/Filipino: (tl-PH)
- Tamil: (ta-IN)
- Telugu: (te-IN)
- Thai: (th-TH)
- Turkish: (tr-TR)
- Ukrainian: (uk-UA)
- Urdu: (ur-PK)
- Vietnamese: (vi-VN)
What you’ll work on
- Transcription: Transcribe audio with 98% accuracy, capturing every disfluency, filler word (um, uh), false start, and stutter exactly as heard.
- Precision Timestamping: Align text segments to the audio waveform with millisecond precision (max gap <500ms).
- Speaker Identification: Accurately identify and label speakers in multi-speaker audio files (2–8 interlocutors).
- Tagging and Annotation: Apply correct tags for non-speech events—like (laughs) or (applause)—and unintelligible segments.
Skills and Qualifications
- Native-level fluency: You must be a native speaker of the assigned language with a deep understanding of cultural nuances and regional accents.
- Attention to Detail: You can distinguish between "clean" speech and "verbatim" speech (e.g., typing "I- I- I don't know" instead of "I don't know").
- Tech Savvy: You are comfortable learning and navigating new web-based annotation tools.
Engagement Details
- Location: Remote (Global)
- Volume: Steady task flow available for high-quality contributors. (Note: Additional details around the project will be provided as they become available.).
- Flexibility: Work on your own schedule, provided quality, consistency, and deadline standards are met.
- Type: Freelance/Independent Contractor
Why this matters
Your expertise will guide how AI systems handle complex logic and human-centered communication. By transcribing and refining audio and text and responses, you’ll help ensure that AI is not only accurate but also clear, safe, and engaging for professional use.