Bangalore : On Site
Experience: 4–5 Years
About Lyzr
Lyzr is revolutionizing enterprise AI adoption with its cutting-edge Agent API Studio and Universal Agents. As winners of the Accenture Ventures 2024 GenAI Challenge, Lyzr is leading the charge in AI-driven business transformation. Our solutions empower organizations to leverage intelligent automation and drive meaningful impact.
About The Role
We are building low-latency, high-reliability voice agents powered by the Lyzr platform. You will own the architecture and core systems that enable live voice conversations. Key focus ares will be on end-to-end latency, robustness, and scalability.
What You'll Do
- Architect and build the real-time voice pipeline
- Drive latency down across the stack
- Optimize LLM inference
- Collaborate with research and product on model selection and training/finetuning
- Ensure reliability and safety with guardrails
- Mentor engineers, set best practices for streaming service design, and contribute to technical roadmaps.
Minimum Qualifications
- 3+ years building production distributed systems with a focus on real-time, low-latency, or high-throughput services.
- Proficiency in Python and Go or Rust.
- Hands-on experience with streaming audio/video and protocols such as WebRTC, RTP/SRTP, Opus, or gRPC streaming.
- Experience with using different speech models and using LLMs
- Strong understanding of performance engineering: profiling, async I/O, batching, and cache design.
- Track record of shipping reliable services