Opportunity for MLOPS Engineer: Serve a wav2vec Speech Recognition Model through Triton Server

Job Description:

We are looking for a talented MLOPS engineer to work on a challenging speech recognition project. The project has a tight deadline of 5 days. The tasks involved are:

Based on the wav2vec2 model available in the repository lgris/wav2vec2-large-xlsr-open-brazilian-portuguese-v2, convert it to ONNX and TensorRT

Evaluate the WER of the model in TensorRT compared to the original model in Hugginface

Create a Dockerfile with the Triton server configured with an endpoint to consume the model in TensorRT

Create a Dockerfile with a Python server using [login to view URL] to send audio to the Triton server for inference

Create a Dockerfile with a JavaScript client sending audio from the microphone to the Python server, from Python to the Triton server through GRPC, and back to the browser with the transcription

Create a Docker Compose file with the three services communicating with each other and ready for testing

Compare the inference times of the PyTorch model served directly from Python, the TensorRT model served directly from Python, and the model served through the TensorRT server

Evaluate the latency of the communication between the Python server and the TensorRT server

The goal is to perform audio inference captured from the user's microphone in browser through [login to view URL] communication with the Python server and then from this to the Triton server to be able to receive multiple concurrent requests from different users

Attention should be paid in the Python server to have a session for each user, so that the streaming audio can be returned to the user who sent the audio.

If you have the skills and experience to tackle this project, we would love to hear from you. Please apply with your portfolio and relevant experience. Time is of the essence, so apply as soon as possible.

Taidot: Python, Tietojärjestelmäarkkitehtuuri, JavaScript, NLP, DevOps

Tietoa asiakkaasta:
( 0 arvostelua ) Petrópolis, Brazil

Projektin tunnus: #35917970

14 freelanceria on tarjonnut keskimäärin $650 tähän työhön


Hello Good evening , I just finished reading the job description . I see you are looking for someone experienced in developing products using NLP, Python, DevOps, Software Architecture and JavaScript. This is something Lisää

$750 USD 18 päivässä
(116 arvostelua)

Nice to talk you felipeniren, After reading in detail the requirements of your project and concluding that they match my areas of knowledge and skills, I would like to introduce myself. My name is Anthony Muñoz and I Lisää

$624 USD 7 päivässä
(5 arvostelua)

Hello, I read your project details and really interested in your mentioned job. I have 5+ years’ experience doing similar jobs related to these skills NLP, Python, DevOps, Software Architecture and JavaScript. I think Lisää

$750 USD 6 päivässä
(22 arvostelua)

Hello. As a Professional NLP Engineer, I have strong knowledge and rich experience with Python, Pytorch, Tensorflow, NLP, ChatBot, OpenAI ChatGPT, Fine-tuning the OpenAI API model, ASR(Automatic Speech Recognition usin Lisää

$500 USD 7 päivässä
(0 arvostelua)

Offer! Offer! Offer! Greetings and well wishes! Welcome to my profile, where quality and client satisfaction are of paramount importance. I have worked with many clients in other platforms across the world and whom I w Lisää

$250 USD 1 päivässä
(0 arvostelua)