
Search by job, company or skills

Backend Engineer – Real-Time Audio & Voice Systems
Role Overvie
wWe are seeking a Senior Backend Engineer – Real-Time Audio & Voice Systems to design
,build, and operate low-latency, highly reliable voice and audio pipelines that power Interactly.ai
sconversational AI platform. This role requires deep hands-on experience with real-time audi
ostreaming, speech-to-text (STT), text-to-speech (TTS), and telephony integrations, usin
gPython and Node.js in production environments
.You will play a critical role in building scalable, fault-tolerant systems for high-volume voic
einteractions
.
Key Responsibiliti
es● Design, develop, and operate real-time audio processing pipelines for conversation
alvoice agent
s.● Build and maintain backend services using Python (FastAPI/async frameworks) a
ndNode.js for low-latency workflow
s.● Implement bi-directional audio streaming using WebSockets, WebRTC, or simil
arprotocol
s.● Integrate and optimize Speech-to-Text (STT) and Text-to-Speech (TTS) engin
es(cloud-based or self-hosted
).
● Build and maintain telephony integrations (inbound/outbound calling, call routing, c
allrecording, DTMF, conferencin
g).● Optimize audio latency, jitter, and reliability across distributed syste
ms.● Handle high-concurrency workloads and real-time session state manageme
nt.● Implement monitoring, logging, and alerting for voice pipelines and call flo
ws.● Collaborate with AI/LLM teams to integrate speech pipelines with conversational log
ic.● Own production deployments, on-call rotations, and incident response for voice syste
ms.
Required Qualificat
ions● 4+ years of backend engineering experience, with a strong focus on real-
timesyst
ems.● Strong hands-on experience with Python and Node.js in production environme
nts.● Proven experience building real-time audio or voice applicati
ons.● Deep understanding of STT and TTS pipelines, audio codecs, and streaming conce
pts.● Solid experience integrating telephony platforms (inbound/outbound calls, SIP, PS
TN).● Experience with WebSockets, WebRTC, or RTP-based stream
ing.● Strong understanding of asynchronous programming and event-driven architectu
res.● Experience with Docker and CI/CD pipeli
nes.● Ability to debug and optimize distributed, latency-sensitive syst
ems.
Nice-to-Have S
kills● Experience with LLM-powered voice assistants and conversational AI sys
tems.● Familiarity with audio codecs (Opus, PCM, μ-law) and sampling strate
gies.● Experience with cloud infrastructure (AWS/GCP) and autoscaling real-time serv
ices.
● Exposure to message queues, Redis, or real-time state s
tores.● Prior experience in healthcare, contact centers, or telecom do
mains.Job ID: 148920689
We don’t charge any money for job offers