
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
Looking for someone who can fine-tune an ASR (speech-to-text) model. This project is specifically for the Yiddish language, so I am looking for someone who has familiarity with Yiddish or has previously worked on Yiddish-related projects (such as speech, NLP, or text processing). Please share relevant experience, especially any work involving Yiddish.
Projektin tunnus (ID): 40330600
53 ehdotukset
Etäprojekti
Aktiivinen 15 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
53 freelancerit tarjoavat keskimäärin $2 194 USD tätä projektia

⭐⭐⭐⭐⭐ Create a Fine-Tuned Whisper Model for Yiddish Transcription ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project requirements and see you are looking for a fine-tuned Whisper model for Yiddish lecture transcriptions. You don't need to look any further; Zohaib is here to help you! My team has successfully completed 50+ similar projects for audio transcription. I will use the best Whisper implementation to achieve high accuracy and reliable performance for both live and archived audio. ➡️ Why Me? I can easily create the fine-tuned Whisper model as I have 5 years of experience in speech recognition and machine learning. My expertise includes audio processing, model training, and performance evaluation. Additionally, I have a strong grip on related technologies like PyTorch and NLP, ensuring a comprehensive approach to your project. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. Looking forward to discussing this with you in chat. ➡️ Skills & Experience: ✅ Speech Recognition ✅ Machine Learning ✅ Audio Processing ✅ Model Training ✅ Performance Evaluation ✅ Data Alignment ✅ Python Programming ✅ PyTorch ✅ Whisper Implementation ✅ Timestamp Preservation ✅ Error Analysis ✅ Report Writing Waiting for your response! Best Regards, Zohaib
$1 800 USD 2 päivässä
7,9
7,9

Hello, As the need for machine learning and deep learning solutions continues to grow, my team at Live Experts has positioned itself as a reputable provider of these services. With our high-level proficiency in these areas; highlighted in our broad range of successful projects: image & voice recognition, NLP, and more, we are confident in developing a fine-tuned Whisper model applicable to your Yiddish lecture collection. Complementing the employ of models like Whisper, we utilize powerful tools such as TensorFlow and Keras, ensuring the highest levels of precision in your outputs. Our expertise in Python further underscores our ability to work with whichever implementation you prefer, OpenAI's official repo or Hugging Face Transformers, and our wealth of experience in ML will ensure your hold-out set data is dealt with exactly as desired. We understand the significance of maintaining speaker nuances and accurate timestamps for subtitling purposes, as you've requested. Our team's meticulous nature ensures that final deliverables will be top-notch, a fact corroborated by our consistently low WERs and CERs on previous projects. Lastly, exhibiting our skill with model deployment and optimization, your single A100 GPU infrastructure will be more than sufficient for running the system we produce, yet it can also be quantized for CPU inference when convenient. Cambelieve vous ambition de m'engager et mon equipe?! Thanks!
$3 000 USD 1 päivässä
7,4
7,4

Hi, This is Elias from Miami. I checked your project description and understand you’re looking to fine-tune Whisper for Yiddish lecture transcription so it performs well on both archived audio and live-stream style input. The goal is not just better raw transcription, but strong domain vocabulary handling, reliable timestamps, and a setup that can train on a single A100 and later run efficiently on CPU after quantization. I’ve worked on model training and inference pipelines before, including evaluation, reproducibility, and deployment-focused optimization. My approach would be to start from a solid Whisper baseline, fine-tune on your aligned Yiddish pairs, validate against the hold-out set with WER/CER tracking, then package reproducible training and inference scripts plus a clear comparison report. I’d be happy to go through the details and suggest the best technical approach. I have a few questions to get a better understanding: Q1 – Which Whisper size are you expecting to start from, and do you already have a preferred implementation between OpenAI, Hugging Face, or faster-whisper? Q2 – Are your transcripts already normalized consistently for Yiddish spelling, punctuation, and speaker markers, or will part of the work include text cleanup rules before training? Q3 – For the live Zoom validation, do you want true streaming transcription output, or is short-window near-real-time batching acceptable? Looking forward to hearing from you.
$2 250 USD 7 päivässä
6,9
6,9

✅ Proposal for Fine-Tune Whisper for Yiddish Le With a robust background in AI and speech recognition technologies, I am uniquely qualified to enhance the Whisper model for Yiddish lectures. My experience includes extensive work with OpenAI’s models, Hugging Face Transformers, and PyTorch, ensuring a seamless integration and optimization process. I have successfully fine-tuned models for domain-specific vocabulary retention and improved error rates in multiple languages. My technical skills are complemented by my ability to run models efficiently on GPUs and adapt them for CPU usage. I am committed to delivering a fine-tuned Whisper model that meets your stringent accuracy and efficiency standards for both live and archival audio transcription. Lets collaborate to achieve exceptional transcription fidelity.
$3 000 USD 30 päivässä
7,0
7,0

Hi, As a individual developer and I can jump into on your suitable time. I can help in your project (most important in this project libraries, modules, and relative issue during this project fix, improve, development) With my expertise in full-stack development and experience working with modern web technologies like Python, PyTorch, Whisper fine-tuning, Hugging Face Transformers, audio preprocessing, timestamp alignment, quantization, and WER/CER evaluation pipelines, i can fine-tune a Yiddish speech-to-text model that improves domain vocabulary accuracy, preserves subtitle-ready timestamps, and runs efficiently on a single A100 with a path for CPU inference. You can expect clear communication, fast turnaround, and a high-quality result that fits seamlessly into your existing workflow. Best regards, Juan
$1 500 USD 7 päivässä
5,8
5,8

Hello Sir, Have you considered how a tailored ASR solution can enhance your Yiddish lectures? I specialize in fine-tuning models for specific languages, and my previous work with Yiddish-language projects ensures a nuanced approach. Let's connect to explore how we can elevate your lecture experience with advanced speech-to-text capabilities. Best, Smith
$2 250 USD 7 päivässä
5,7
5,7

I get how transcripts that drop Yiddish terms or blur speaker turns make recordings unusable for publication. I just finished fine-tuning Whisper for a minority language lecture series and hit single-digit WER while preserving word-level timestamps. The best thing about me is I’ve worked on a very similar project recently. I fine-tuned a Whisper variant using aligned audio–text pairs, added language-specific token handling, kept accurate timestamps for subtitling, and produced reproducible training and inference notebooks that ran on a single A100 and were later quantized for CPU. I understand your flow: ingest live stream or archive, preprocess and align, train on A100 with PyTorch/Hugging Face or OpenAI Whisper, validate with your eval script, export checkpoints, and provide quantized inference (ONNX/INT8) with timestamped output. I can save you time by reusing my training recipes, logging, and evaluation pipelines. Quick questions: are your alignments word-level or only utterance-level, and do you want integrated speaker diarization/IDs or just timestamps per utterance? If that fits, let’s chat or hop on a short call. Regards Ali Zain!!
$2 250 USD 7 päivässä
4,8
4,8

Hello, With over 7 years of experience in Machine Learning (ML) and Python, I have carefully reviewed your requirement for fine-tuning Whisper for Yiddish lectures. To achieve near-publishing accuracy, I propose to fine-tune the Whisper model using aligned Yiddish audio-text pairs. I will use a PyTorch-based implementation to ensure the model archives a single-digit word-error-rate on the hold-out set, preserves accurate timestamps for subtitling, and can run on a single A100 GPU while being quantized for CPU inference. The deliverables will include the fine-tuned model checkpoint(s), reproducible training and inference scripts with clear hyper-parameter logging, and a detailed report comparing baseline versus tuned performance in terms of WER, CER, and latency. Upon successful testing through live Zoom feed and batch file transcription, the project will be considered complete. I would like to discuss this project further with you. Please connect with me for a detailed chat. You can visit my Profile: https://www.freelancer.com/u/HiraMahmood4072 Thank you.
$1 550 USD 7 päivässä
4,6
4,6

Hi there, I understand you need a fine-tuned Whisper model that can handle Yiddish lectures with high accuracy, preserving domain-specific vocabulary, speaker nuances, and precise timestamps for both real-time and batch transcription. The key challenge here is reducing WER while maintaining alignment quality and ensuring the model is efficient enough for A100 deployment and optional CPU quantization. My approach is to fine-tune Whisper using your aligned audio-text pairs with careful preprocessing, domain adaptation, and tokenization adjustments for Yiddish. I will optimize training to minimize WER on your hold-out set, implement timestamp alignment validation, and prepare an inference pipeline that supports both streaming and offline transcription. I’ll also apply quantization (e.g., INT8) to enable efficient CPU inference without significant accuracy loss. You’ll receive a trained model, reproducible training pipeline, and an optimized inference setup that delivers near-publishing quality transcripts with reliable timestamps. This ensures your lecture content can be transcribed consistently at scale while meeting your accuracy and performance requirements. Regards, Ahmad
$1 500 USD 7 päivässä
4,4
4,4

Hello, I understand the scope of your project and the need for a fine-tuned Whisper model to accurately transcribe your Yiddish lecture recordings with domain-specific vocabulary and speaker nuances. The goal is to achieve near-publishing accuracy in transcriptions, whether the audio is streamed live or from an archive. I have prior experience developing a similar project involving Python, Machine Learning (ML), and Deep Learning. In that project, I encountered challenges with domain-specific vocabulary and speaker nuances, which I addressed by fine-tuning the model to improve transcription accuracy. I would love to discuss this project further with you to understand your specific requirements and how I can tailor the solution to meet your needs. Please let me know a convenient time for a call to delve deeper into the project details. Looking forward to the opportunity to collaborate on this exciting project. Regards, Jayabrata Bhaduri
$2 300 USD 7 päivässä
4,0
4,0

⭐⭐⭐⭐⭐ ✅Hi there, hope you are doing well! I have fine-tuned speech-to-text models before using domain-specific audio-text pairs to achieve high transcription accuracy efficiently. From my experience, the most critical aspect to successfully completing this project is preserving timestamp alignment while reducing word error rate during fine-tuning. Approach: ⭕ I will fine-tune Whisper using the aligned Yiddish audio-text pairs you provide. ⭕ Optimize hyperparameters to minimize error rates on your hold-out set. ⭕ Ensure timestamp accuracy for subtitling. ⭕ Prepare reproducible scripts for training and inference with clear logging. ⭕ Quantize the model to run effectively on CPU post-training. ❓ Could you please share the size and diversity of your aligned training dataset? ❓ Is there a preferred Whisper implementation you favor for this project? ❓ Do you have specific latency targets for real-time transcription? I am confident in delivering a fine-tuned Whisper model checkpoint and full pipeline that meets your accuracy and performance goals efficiently. Best regards, Nam
$2 500 USD 15 päivässä
3,8
3,8

Hello! This is James from Hollywood. I've carefully read your project description about fine-tuning Whisper for your Yiddish lecture recordings. I understand the importance of this task and I'm excited about the opportunity to assist you with it. With over 15 years of experience as a senior Full-Stack & AI Engineer, I have a strong background in Python, Machine Learning, and Audio Processing. I’ve worked on similar projects, including optimizing audio transcription systems and enhancing NLP capabilities for diverse applications. My focus is always on delivering solutions that are not only technically sound but also practical and maintainable. To ensure I fully grasp your needs, could you please clarify the following questions to help me better understand the project? 1. What specific enhancements are you looking for in the Whisper model regarding Yiddish audio? 2. Are there particular audio quality considerations or formats I should be aware of? 3. Do you have any existing datasets or resources that can aid in the fine-tuning process? I’m committed to ensuring that this project meets its goals effectively. I propose a phased approach, starting with an assessment of your current recordings, followed by model fine-tuning and iterative testing to ensure the best results. Looking forward to your response!
$2 500 USD 10 päivässä
3,2
3,2

Hi there, I’m excited about the opportunity to fine-tune the Whisper model for your collection of Yiddish lecture recordings. I've reviewed your requirements and am confident in my ability to deliver a high-quality solution that meets your needs. With over 9 years of experience in Python and deep learning, I've successfully implemented similar projects and achieved single-digit word error rates using custom models. I specialize in audio processing and NLP, and I will ensure that the final model not only accurately transcribes the lectures but also maintains critical speaker nuances and timestamps for subtitling. I can start this project immediately and look forward to collaborating closely to align on your specific requirements. Best regards, Sadam
$2 700 USD 30 päivässä
2,5
2,5

Hi there, Fine-tuning Whisper to achieve the transcription accuracy you need for your Yiddish lectures is critical. Having worked on similar projects for the past 4 years, I understand how to enhance models to maintain domain-specific vocabulary and subtleties. I will utilize aligned Yiddish audio-text pairs to train the model on either OpenAI's official repo or Hugging Face Transformers. My plan includes: - Implementing a training pipeline that ensures a single-digit word-error-rate on your hold-out set. - Preserving timestamps essential for subtitling. - Ensuring compatibility with a single A100 GPU, along with quantization for CPU inference. Upon project completion, you'll receive the fine-tuned model checkpoint(s), reproducible scripts with clearly logged hyper-parameters, and a concise report comparing the baseline and tuned performances (WER, CER, latency) to ensure everything meets your standards. Thanks,
$2 500 USD 20 päivässä
0,0
0,0

Hey , I just went through your job description and noticed you need someone skilled in Audio Processing, Machine Learning (ML), Natural Language Processing, Deep Learning and Python. That’s right up my alley. You can check my profile —I’m Software engineer working at large-scale apps as a lead developer with U.S. and European teams. I’ve handled several projects using these exact tools and technologies. Before we proceed, I’d like to clarify a few things: Are these all the project requirements or is there more to it? Do you already have any work done, or will this start from scratch? What’s your preferred deadline for completion? Why Work With Me? 1) Over 230 successful projects completed. 2) I have not received a single bad feedback since the last 5-6 years. 3) You will find 5 star feedback on the last 100+ major projects which shows my clients are happy with my work. 4) Long-term track record of happy clients and repeat work. I prioritize quality, deadlines, and clear communication. Availability: 9am – 9pm Eastern Time (Full-time freelancer) I can share recent examples of similar projects in chat. Let’s connect and discuss your vision in detail. Kind Regards, Imran Haider
$1 500 USD 5 päivässä
0,0
0,0

Hi there! I’m excited about your project on fine-tuning Whisper for Yiddish lectures. Transcribing nuanced content with domain-specific vocabulary is crucial, and I understand the importance of achieving near-publishing accuracy. With my experience in Python and deep learning, I can effectively implement the fine-tuning process using your provided audio-text pairs. I have a strong background in Natural Language Processing and have worked extensively with Whisper models. Ensuring a low word-error rate and maintaining accurate timestamps for subtitling aligns perfectly with my expertise. I’ve previously developed models that efficiently run on A100 GPUs and are quantized for CPU inference, guaranteeing a streamlined performance. You can expect clear documentation on hyper-parameters along with a comprehensive report comparing the baseline and fine-tuned performance metrics. Let’s discuss how we can make this a success together!
$1 500 USD 7 päivässä
0,0
0,0

Hi there, I’m Simon, excited to help you turn your growing Yiddish lecture collection into a reliable, near-publishable transcript system. I love languages in the wild, your live streams and archives present a real enrichment challenge, and I’m keen to tailor a Whisper fine-tune that preserves speaker nuance and domain vocabulary while keeping latency in check for subtitling. Section 2: I’d approach this with a reproducible recipe: data alignment, domain-adaptive fine-tuning on your paired Yiddish audio-text, and a lightweight eval harness that reports WER/CER, latency, and timestamps. I’ll provide scripts for training and inference, with hyperparameters logged and clear guidance on quantization for CPU speed. The result: a single-GPU workflow that hits your hold-out WER target and scales to streaming input. Section 3: I’ll attach concise, relevant experiment snapshots (baseline vs. tuned) and a compact report. Ready for a quick call to align on your hold-out set and timeline?
$1 500 USD 5 päivässä
0,0
0,0

Hello, With proficiency in Python and experience building predictive models using TensorFlow, PyTorch, and scikit-learn, I am confident in my ability to deliver a finely-tuned Whisper model for your Yiddish lecture transcriptions. My expertise extends to NLP, an essential aspect of this project that will allow me to incorporate domain-specific vocabulary and capture speaker nuances effectively. I can utilize OpenAI's official repo or Hugging Face Transformers, as per your preference, ensuring the final system achieves single-digit word-error-rate on your hold-out set. Being well-versed in backend solutions, I can ensure the solution runs optimally on a single A100 GPU while also being able to quantize it for CPU inference. My meticulousness with comprehensive documentation during development - including hyperparameters - will give you the ability to reproduce and maintain the solution even after project completion. Moreover, my proficiency with AWS services like Lambda and ECS combined with Docker and Kubernetes make me uniquely equipped for AI model deployment at scale - which undoubtedly aligns with your need for live transcription as well as batch file transcriptions. Let's work together to not only provide a reliable and accurate speech-to-text solution but also deliver a sustainable and future-ready product. Thanks!
$1 500 USD 15 päivässä
0,0
0,0

Spring Valley, United States
Liittynyt maalisk. 27, 2026
$250-750 USD
£20-250 GBP
€30-250 EUR
₹1500-12500 INR
₹600-1500 INR
$30-250 USD
$1500-3000 USD
$8-15 USD/ tunnissa
$250-750 USD
$250-750 USD
₹400-750 INR/ tunnissa
₹1500-12500 INR
$30-250 AUD
$2-8 USD/ tunnissa
$10-30 USD
$2-8 USD/ tunnissa
$30-250 USD
$1500-3000 USD
₹250000-500000 INR
€250-750 EUR