
In Progress
Posted
Paid on delivery
We're building a screenless voice communication device for kids, built on ESP32. We have a working prototype that can make and receive calls. Looking for an embedded/firmware developer to help push the prototype further. What's working ESP32-based device making real calls via SIP Basic audio in/out Core call flow functional What we need help with Latency — end-to-end delay is too high. Get it under 150ms. Start with OPUS codec and jitter buffer tuning. Echo — calls have echo. Implement AEC (look at esp-sr, built into ESP-IDF). Contact storage — need to store a small list of names and numbers on-device via a config file (SPIFFS or NVS). On-device UI to scroll and select. ESP32-to-ESP32 calling — alongside calling real numbers, add support for device-to-device calls over SIP across different WiFi networks. This means setting up a SIP proxy (Kamailio) on a VPS and registering each device to it. Stack ESP32 (ESP-IDF) SIP / VoIP Ideal candidate Strong ESP32 / ESP-IDF experience Comfortable with SIP/VoIP Has done audio work on embedded hardware before
Project ID: 40373919
9 proposals
Remote project
Active 1 mo ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

Hi I can help optimize and scale your ESP32-based voice device with a strong focus on low-latency audio and SIP reliability. I have solid experience with ESP32, ESP-IDF, and real-time embedded audio systems. For latency, I will optimize OPUS settings and fine-tune jitter buffers to achieve sub-150 ms delay. For echo, I will implement AEC using ESP-SR within ESP-IDF. I will design efficient contact storage using SPIFFS/NVS with a simple on-device UI flow. For device-to-device calling, I will configure SIP routing via Kamailio on a VPS. I have prior experience handling SIP registration, NAT traversal, and multi-device communication. The solution will be stable, scalable, and production-ready. Available to start immediately and iterate quickly on your prototype.
$240 USD in 5 days
6.2
6.2
9 freelancers are bidding on average $152 USD for this job

Hi, I’m an embedded systems engineer with 7+ years of experience in ESP32/ESP-IDF, VoIP, and real-time audio, and I’ve reduced latency by 45% (to <120 ms) while deploying SIP devices with stable AEC across 5k+ units. In my opinion, the best way to improve your device is to optimize the full audio chain, not just parts of it—by tuning OPUS, jitter buffer, and task timing together. I would also implement a clean AEC setup using ESP-SR with proper gain and buffering to eliminate echo reliably. This project is very similar to my previous work. I’ve built ESP32 SIP devices where I cut latency from ~250 ms to under 120 ms, and I’ve set up Kamailio-based systems for stable device-to-device calling across networks. The most important skill here is real-time audio optimization on limited hardware. I handle this by profiling each stage and tuning timing, buffers, and CPU usage carefully. ✅ So, I will divide your project into three major steps. 1️⃣ Optimize audio pipeline for low latency and stable streaming. 2️⃣ Implement and tune AEC for clear full-duplex audio. 3️⃣ Set up SIP (Kamailio) and add device-to-device calling with contacts. I will provide the best technical solution for your project. Best regards. Yaroslav
$240 USD in 7 days
4.9
4.9

Hello sir, Did go through your job description and glad to share that I have enormous experience in working with ESP32 SIP Voice Calling - Refinement I'm a seasoned programmer and Engineer with quality experience in Flutter, React, Node.JS, SpringBoot, Frontend and Backend Development, Python, Matlab, R studio, C, C++, C#, OpenCV, OpenGL, Tesseract OCR, google vision, Statistical programming/R progamming data analysis Computing for Data Analysis Time Series & Econometric, Machine learning, AI, Deep learning, Matlab and Mathematica, 3D modeling, CAD/CAM,AutoCAD, 2D, Architectural Engineering, SolidWorks, Unity 3D, PCB, Electronics, Arduino, Automation, Embedded and Firmware , IOT, Electrical/Mechanical Engineering I am a TOP Rated Freelancer, and you can check my reviews here as well: https://www.freelancer.com/u/mzdesmag. Looking forward to potentially working together on this project. Thanks and Best regards, Adekunle.
$30 USD in 1 day
4.7
4.7

Hey , I just went through the project description, and I see you are looking for someone experienced in Embedded Systems, Arduino, VoIP, WiFi, SIP, Audio Processing and Debugging. It instantly reminded me of a client who faced similar challenges, and I knew I had a tailor-made solution for it. Please review my profile to confirm that I have great experience working with these tech stacks. While I have few questions: • Is there anything else you’d like to add to the project details? • What’s the top hurdle you’re facing with this project? • What is the timeline to get this done? Why Choose Me? 250+ Projects. 5 Years. Zero Misses. My reputation is built on a single metric: Flawless Execution. While others promise quality, my last 100+ consecutive 5-star reviews prove it. I don’t just finish the job; I set the standard. Timings: 9am - 9pm Eastern Time (I work as a full time freelancer) The portfolio here is just the tip of the iceberg. To respect client confidentiality, my recent heavy-hitters aren't public, but I can share them 1-on-1. Click the 'CHAT' button, and I’ll send over the relevant samples immediately for your review. Regards, Abdul Haseeb Siddiqui.
$30 USD in 3 days
0.0
0.0

Hello, I hope you are doing well. I have strong experience with ESP32, ESP-IDF, and SIP-based voice systems, including embedded audio optimization. I can reduce latency below 150 milliseconds by tuning the OPUS codec and jitter buffer, implement acoustic echo cancellation using ESP-SR, and improve overall call quality. I will also set up secure contact storage using SPIFFS or NVS with a simple on-device selection flow, and configure a SIP proxy using Kamailio to enable reliable ESP32-to-ESP32 calling across networks. I am comfortable debugging real-time audio issues and optimizing embedded performance. I am ready to start immediately and refine your prototype into a stable, production-ready system. Thank you very much for your time and consideration.
$150 USD in 3 days
0.0
0.0

Achieving sub-150ms latency in your ESP32 device is feasible with optimized OPUS codec settings and precise jitter buffer adjustments. Given the necessity for Acoustic Echo Cancellation, integrating esp-sr from ESP-IDF will be pivotal in minimizing echo during calls. For contact storage, utilizing SPIFFS for a small config file will allow seamless retrieval and display. Additionally, implementing Kamailio as a SIP proxy will facilitate direct ESP32-to-ESP32 communication across varying WiFi networks. Expect initial deliverables in 30 days. Want me to sketch a quick action plan so you can see the approach?
$110 USD in 30 days
0.0
0.0

Hello, This is a very compelling product, and it’s great to see you already have real SIP calling working on ESP32—that’s a solid foundation. I’ve worked on ESP32 firmware, real-time audio systems, and VoIP integrations, including latency optimization and embedded audio processing, which are critical for a device like this. I’m comfortable working iteratively and collaboratively to refine this into a reliable, production-ready device. Best regards, Ljubinka
$100 USD in 4 days
0.0
0.0

Providence, United States
Payment method verified
Member since Mar 8, 2026
$30-250 USD
$30-250 USD
₹37500-75000 INR
₹600-1500 INR
₹750-1250 INR / hour
$10-30 USD
€250-750 EUR
$30-250 USD
$30-250 USD
$10-1000 USD
$10-30 USD
€6-12 EUR / hour
₹600-1500 INR
$10-30 USD
₹15000-25000 INR
₹1500-12500 INR
€250-750 EUR
₹1500-4000 INR
$250-750 USD
$30-250 AUD
$10-30 USD