Job Description
MLE Responsibilities
Design, implement, and maintain back-end API endpoints to support Gen AI and RAG (Retrieval-Augmented Generation) applications.
Manage and optimize chat history storage in DynamoDB, ensuring scalability and efficient retrieval.
Develop and test integrations with front-end UI components for seamless user experience. (React and others)
Update and refine back-end APIs to align with evolving product requirements and deployment architecture.
Build and maintain retrieval pipelines leveraging vector databases for RAG.
Implement document ingestion pipelines, including text preprocessing, embeddings generation, and indexing.
Ensure system performance, scalability, and reliability, including scalability and consistency testing across enterprise users.
Collaborate with data scientists, product managers, and front-end engineers to deliver end-to-end AI-powered solutions.
Write unit, integration, load, and consistency tests to validate system robustness under enterprise-scale usage.
Deploy and maintain applications on cloud platforms (AWS preferred) using services like Lambda, ECS, S3, DynamoDB, and API Gateway.
Implement observability and monitoring for model and API health..