
Closed
Posted
I am preparing a large batch of English conversation recordings for machine-learning training and need extra hands right away. Your job is to listen to each clip, split or merge the audio where necessary, and add precise conversational tags—specifically turn-taking cues and backchanneling. The content is mostly casual chat, so there is no need to identify speakers, emotions, or specialized domains such as technology, healthcare, or education. You will work inside our browser-based interface that lets you zoom the waveform, set boundaries, and choose labels from a predefined menu. I will share a short style guide that explains when to mark an utterance as a backchannel (“uh-huh”, “right”) versus a speaker hand-off, plus a set of sample files so you can practice before the real work begins. Key details • Applications close in 3 days. • I need at least 10 hours of work from you before June 7th; more is welcome if you have the time. • Accuracy is critical—every file you finish will be spot-checked against the guidelines, and any corrections must be applied promptly. Deliverables 1. Fully segmented audio files with turn-taking and backchanneling tags applied. 2. A brief note on any edge cases or guideline ambiguities you encounter. If you’re comfortable with English conversational nuances and can start immediately, I’d love to bring you on.
Project ID: 40487583
9 proposals
Remote project
Active 4 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
9 freelancers are bidding on average $33 USD/hour for this job

With 25 years of technical audio location and post-production experience, including a Masters degree in Audio Production, I am the most seasoned and qualified candidate for your project. Unlike others who may claim to be skilled in audio editing based on tangential fields like video editing or translation, I have dedicated my career to audio. This specialization is crucial when it comes to precisely handling and annotating conversation clips, which require an intricate understanding of turn-taking cues and backchanneling - skills that I possess thanks to my extensive expertise working with AI voices. Beyond my impressive background, every single project I've undertaken has been completed to the utmost satisfaction without the need for any revisions. You mentioned that accuracy is critical for this task, so let me assure you - it's synonymous with my work ethic. Hiring me means hiring not only the best but also someone who understands the importance of deadlines. Despite being in high demand across various platforms (ranking within the top 900 freelancers from over 85 million users), I'm ready to prioritize your project and deliver no less than outstanding quality within your timeline. Let's talk more – take a chance on proven excellence today!
$50 USD in 40 days
8.0
8.0

Torochō, Japan
Payment method verified
Member since Jun 3, 2026
$10-5000 USD
₹600-1500 INR
₹750-1250 INR / hour
$30-250 CAD
$10-30 USD
₹12500-37500 INR
$250-750 USD
₹750-1250 INR / hour
$30-250 USD
₹12500-37500 INR
$30-250 USD
₹1500-12500 INR
$10-30 CAD
₹600-1500 INR
$25-50 USD / hour
£20-250 GBP
$15-25 AUD / hour
$15-25 USD / hour
₹750-1250 INR / hour
₹750-1250 INR / hour