
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
Looking a simple python script for voice pipeline that is fully local (offline), runs on Windows (Core i7), processes microphone audio in real time, uses VAD to detect end-of-speech, performs streaming STT, and immediately streams the recognized text into TTS for echo playback. End-to-end latency must be under 1 second. Any open-source stack that meets this latency is welcome
Projektin tunnus (ID): 40197631
28 ehdotukset
Etäprojekti
Aktiivinen 9 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
28 freelancerit tarjoavat keskimäärin $19 USD tätä projektia

Hello client, I'm Denis Redzepovic, an experienced developer with expertise in Software Architecture, Natural Language Processing, C Programming, Linux, Python, Deep Learning, Audio Engineering and Audio Processing. I have worked extensively on diverse Python projects, ranging from backend development and automation to data processing and API integrations. My deep understanding of Python’s libraries and frameworks allows me to build efficient, scalable, and maintainable solutions. I pay close attention to code quality and performance to ensure your project runs flawlessly. With my solid experience, I’m confident I can deliver results that exceed your expectations. I focus on writing clean, maintainable, and scalable code because I know the difference between 99% and 100%. If you hire me, I’ll do my best until you’re completely satisfied with the result. Let’s discuss your project details so I can tailor the perfect Python solution for you. Thanks, Denis
$30 USD 1 päivässä
5,6
5,6

Greetings, I see you're looking for a Python script that can process microphone audio in real-time, all while ensuring it's fully offline and runs smoothly on a Windows system. Your requirements for voice activity detection, speech-to-text conversion, and immediate text-to-speech playback, all under a one-second latency, are quite clear. To tackle this, I would leverage an open-source stack that efficiently integrates these components. My experience with audio processing and deep learning will help create a robust solution tailored to your needs. I have worked on similar projects, optimizing audio pipelines and ensuring seamless performance, which will be beneficial in achieving the low latency you require. I’m excited about the opportunity to help bring your vision to life. Best regards, Saba Ehsan
$15 USD 30 päivässä
5,4
5,4

Hi there, I’ve carefully reviewed your project requirements, and with my extensive experience in developing Python scripts and applications, I’m confident that I can deliver a high-quality solution tailored to your needs. Whether it’s automation, data processing, or custom application development, I have the skills to ensure your project’s success. I’d love to discuss how I can contribute and help bring your vision to life. Feel free to check out my portfolio for more examples of my work: Portfolio: https://www.freelancer.com/u/webmasters486 Looking forward to hearing from you! Best regards, Muhammad Adil
$30 USD 1 päivässä
5,1
5,1

Hi, I can build a fully offline, real-time voice pipeline for Windows that listens to your microphone, detects speech, converts it to text, and immediately plays it back via TTS (“echo playback”) with expected end-to-end latency. The solution will use open-source tools only — sounddevice for audio I/O, Silero VAD for speech detection, Vosk for streaming STT, and Piper TTS for fast offline speech synthesis. The pipeline will be threaded and streaming, ensuring smooth real-time performance on a Core i7 CPU without any cloud services or paid APIs. I can deliver a working Python prototype that demonstrates the full flow, along with clear instructions for running and extending it locally. Looking forward to helping you bring this low-latency voice system to life. Best regards, Sameer
$50 USD 7 päivässä
2,9
2,9

Hi there, I am ready to start the project immediately and provide high-quality work. I have 10 year of professional experience in video editing and creation, and I have completed 350+ similar projects. You can check an example of one of those projects in my portfolio here: https://www.freelancer.com/u/Vsion2 I'm interested in discussing your project, If you have any questions or special requirements, please don’t hesitate to message me. I'd be pleased to have the chance to assist you further with your project Best Regards Alema Akter
$15 USD 1 päivässä
2,5
2,5

Hi Perdeep, I am Vasyl, a seasoned developer with 8+ years of experience in Python, React, Angular, and Node.js. I have carefully reviewed your project requirements for an Offline Real-Time Voice Echo Pipeline. I specialize in backend development with Node.js and frontend technologies like React, Vue, and Angular, particularly in B2B/B2C SAAS and marketing tools. For this project, I propose implementing a Node.js Express backend and a React frontend, with a MySQL database. I will analyze the project requirements, set up an agile environment, design the database schema, and integrate backend APIs with the frontend UI. My proactive approach and attention to detail ensure successful project completion. Let's discuss further details in chat. Thanks, Vasyl
$14 USD 7 päivässä
1,6
1,6

I appreciate the opportunity to work on your Python voice pipeline project. Your requirement for a seamless, fully local offline solution with real-time microphone processing and sub-1-second end-to-end latency is clear and essential for a smooth user experience. I may be new to Freelancer, but I bring solid experience to the table, including expertise in VAD, streaming STT, and integrated TTS systems optimized for performance on Windows. I’m happy to offer a free call to go over the project if you’d like. Regards, Blaze Nicholas
$10 USD 14 päivässä
0,3
0,3

Hi there, I understand that your main goal is to develop an efficient offline real-time voice echo pipeline that enhances audio quality and reduces latency. In my previous role, I successfully implemented a real-time audio processing system, which decreased echo artifacts by 40% and improved overall audio clarity for a leading communication platform. Additionally, I optimized algorithms that enhanced processing speed, resulting in a 25% reduction in latency during voice transmission. To address your requirements, I will design a robust voice echo pipeline that leverages advanced audio processing techniques to mitigate echo effectively. I will also ensure compatibility with offline environments to maintain high performance without relying on continuous internet connectivity. I would be happy to discuss your needs and get started right away. Best regards, Adrian
$15 USD 7 päivässä
0,0
0,0

Hello, With over 11 years of experience as a Full Stack Developer, my focus on robust architectural design combined with my experience working with real-time applications, AI-driven systems, and voice-based projects will be invaluable to your Offline Real-Time Voice Echo Pipeline project. My expertise in Python, which I have used for similar projects involving voice processing, aligns perfectly with what you need. I have demonstrated this in my work on the Axion VOIP Phone System from October 2018 to June 2025 as the Full Stack Developer and AI Expert. During my time at Axion VOIP, I developed scalable backend services for a full-stack VOIP system that included speech-to-text transcription. I used FastAPI and Core PHP while integrating AssemblyAI and Amazon Transcribe's for AI-powered speech processing. This experience has equipped me with the in-depth knowledge and skillset needed to process microphone audio in real-time, detect end-of-speech using VAD, implement streaming STT and TTS tools all within an offline environment. I'm also well-versed with modern CI/CD pipelines and automation using GitHub Actions and Docker. This skill can be applied to ensure smooth integration and deployment of the proposed script. With me, you can expect thorough documentation alongside clean and reliable architecture keeping maintenance at a minimum. Let's build an efficient offline voice echo pipeline within your latency requirements together! Thanks!
$10 USD 3 päivässä
0,0
0,0

I work on projects where we help clients reach their goals or improve their online presence, focusing on creating efficient and user-friendly solutions that enhance interaction. We’ll help you build a fully local Python voice pipeline that runs smoothly on Windows, capturing real-time microphone audio, detecting speech boundaries with VAD, and streaming recognized text into TTS with minimal latency. I bring strong off-platform experience working with integrated voice technologies and real-time processing, ensuring a clean and seamless pipeline that meets your under-1-second latency requirement. I’d be glad to chat more about your specific needs and how to keep things buzzing efficiently—after all, who doesn’t love instant feedback? Let's have a chat, Alicia
$12 USD 14 päivässä
0,0
0,0

i've done related case projects for my bachelor's final assignment. i'm used to scraping, normalizing audio data for my ML model implementation
$17 USD 4 päivässä
0,0
0,0

Hello, I am excited to apply for the Offline Real-Time Voice Echo Pipeline project. With extensive experience in Python and real-time audio processing, I can develop a local solution that meets your requirements. My approach includes utilizing Voice Activity Detection (VAD) for accurate end-of-speech detection and implementing a robust Speech-to-Text (STT) and Text-to-Speech (TTS) pipeline to ensure under 1-second latency. I am committed to delivering a seamless user experience and can provide ongoing support post-development. Let’s discuss how I can contribute to your project’s success! Regards,
$15 USD 7 päivässä
0,0
0,0

Hello. I can build a fully offline Python voice pipeline for Windows that captures microphone audio in real time. I’ll integrate VAD to detect end-of-speech, perform streaming STT, and immediately stream recognized text to TTS for echo playback. I have experience optimizing end-to-end latency to under 1 second using open-source libraries. The solution will run entirely locally, require no internet, and be stable on a Core i7 system. I’ll deliver a clean, documented Python script that’s easy to run and extend. Looking forward to hearing from you! Best regards.
$20 USD 1 päivässä
0,0
0,0

I specialize in building low-latency, fully offline voice pipelines in Python on Windows. I have hands-on experience with real-time audio processing, VAD, streaming STT, and TTS integration, focusing on performance optimization to achieve sub-1-second end-to-end latency. I deliver clean, well-documented code and ensure the system runs reliably on standard hardware like Core i7 machines.
$15 USD 7 päivässä
0,0
0,0

Hi, as a machine learning expert with expertise in deep learning am interested in this project. Am proficient in pytorch and audio engineering and hope to complete this project. Looking forward to working on this project.
$20 USD 1 päivässä
0,0
0,0

Hi, I can build this real-time voice echo pipeline for Windows. I've worked with audio processing and NLP tools. Proposed stack for <1s latency: • VAD: Silero VAD (lightweight, fast) • STT: Faster-Whisper (optimized Whisper, streaming capable) • TTS: Piper TTS or Coqui TTS (fast local inference) • Audio: PyAudio for mic input, sounddevice for playback All fully offline, no cloud APIs. Runs on Core i7 Windows. The pipeline: Mic → VAD (detect speech end) → STT (transcribe) → TTS (synthesize) → Speaker Can deliver in 3-5 days with working Python script + setup instructions. Best, Shashank
$15 USD 7 päivässä
0,0
0,0

Hi, I can build a fully local, offline Python voice pipeline that runs on Windows and meets your low-latency requirement. How I’d implement it (under 1s end-to-end): Audio capture: PyAudio / sounddevice for low-latency mic input VAD: WebRTC VAD to reliably detect end-of-speech in real time Streaming STT: Vosk or faster-whisper (offline, CPU-friendly on i7) with chunked/streaming recognition TTS: Piper or Coqui TTS running locally for immediate echo playback Pipeline design: Non-blocking threads/async queues so VAD → STT → TTS flows continuously without stalls Feel free to message me
$15 USD 7 päivässä
0,0
0,0

Hi, I can build this offline voice pipeline in Python for Windows. It will run fully local and process microphone audio in real time. I’ll use VAD for end-of-speech detection, streaming STT, and immediate TTS echo playback. The focus will be sub-1-second end-to-end latency on an i7 system. I’ve worked on real-time audio pipelines before and know how to keep buffering and inference fast. You’ll get a clean, easy-to-run script using proven open-source tools. Delivery will take 3-5 days. The total cost will be $60 USD. Happy to start right away.
$60 USD 3 päivässä
0,0
0,0

NO SATISFACTION, NO PAYMENT. Failing to correctly implement a real-time, low-latency voice pipeline risks user frustration, lost productivity, and undermines the value of the solution. Our deep understanding of audio processing and real-time systems uniquely positions us to deliver a seamless experience that meets your stringent latency and offline requirements. We’ve successfully deployed similar offline voice pipelines beyond this platform, and as we build our platform presence, we offer you a strategically discounted rate that reflects our commitment to quality and long-term partnership. If you’d like to discuss your specific workflow or clarify any details, a quick reply is all that’s needed. Warm regards Liam Jasson
$10 USD 14 päivässä
0,0
0,0

Hi, I can build a fully local, sub-1s voice pipeline on Windows that does exactly what you described: real-time mic capture, VAD-based end-of-speech, streaming STT, and immediate TTS echo playback—no cloud, no internet. A proven stack here is WebRTC VAD for low-latency speech detection, Vosk (Kaldi) for streaming offline STT, and Coqui TTS or eSpeak NG for fast local synthesis, wired together with PyAudio/SoundDevice and a small ring-buffer architecture. This runs comfortably on a Core i7 and keeps end-to-end latency well under a second with proper chunking. Do you want partial (word-by-word) TTS echo, or only after final end-of-speech detection?
$15 USD 1 päivässä
0,0
0,0

Texas, United States
Maksutapa vahvistettu
Liittynyt elok. 9, 2019
$2-8 USD/ tunnissa
$10-30 USD
$10 USD
$10-30 USD
$10-15 USD
$10-30 USD
£1500-3000 GBP
$30-250 USD
₹1500-12500 INR
$15-25 USD/ tunnissa
min ₹2500 INR/ tunnissa
$250-750 USD
$250-750 AUD
$25-50 USD/ tunnissa
$1500-3000 USD
₹12500-37500 INR
£20-250 GBP
$500-1260 USD
$10-30 USD
$250-750 USD
₹600-610 INR
min $50 USD/ tunnissa
₹12500-37500 INR
₹750-1250 INR/ tunnissa
₹12500-37500 INR