Search by job, company or skills

KPG99 INC

LLM Engineer

Fresher
new job description bg glownew job description bg glownew job description bg svg
  • Posted 3 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Title:LLM Engineer

Location: Remote (Offshore)

Contract Type: Contract

Job Description

Role Overview

We are seeking a skilled LLM Engineer proficient in Python programming and experienced in developing, deploying, and optimizing large language models (LLMs). The ideal candidate will have hands-on experience with FastAPI or Flask frameworks, Lang Chain implementation, and building Retrieval-Augmented Generation (RAG) pipelines. You will play a key role in integrating cutting-edge AI technologies to solve complex business problems, focusing on vector stores and retrievers while deploying scalable solutions on AWS.

Key Responsibilities

Python Development:

  1. Design, develop, and maintain scalable web services using FastAPI or Flask frameworks.
  2. Write efficient, reusable, and modular Python code to support API-driven LLM applications.
  3. Lang Chain & Supporting Frameworks:
  4. Implement Lang Chain to build custom pipelines for document indexing, retrieval, and summarization.
  5. Integrate Lang Chain's RAG capabilities with other components like vector stores and retrievers to support real-time querying and document processing.
  6. RAG Pipelines:
  7. Architect and deploy Retrieval-Augmented Generation (RAG) systems for chatbots, knowledge systems, and other generative AI applications.
  8. Optimize RAG systems for speed, accuracy, and scalability across multiple use cases.
  9. Vector Stores & Retrievers:
  10. Work with vector databases like Pinecone, Chroma, FAISS, or Milvus to store and manage embeddings.
  11. Implement retrievers and re-rankers to improve query efficiency, ensuring high-quality and relevant outputs for users.
  12. AWS Cloud Deployment:
  13. Deploy and manage LLM-based applications on AWS, leveraging services such as Lambda, EC2, S3, EKS, and RDS.
  14. Ensure the scalability, availability, and reliability of deployed applications.
  15. Dashboards and Monitoring (Optional):
  16. Create monitoring dashboards using tools like Grafana or Tableau for real-time system monitoring, analytics, and performance insights.
  17. Experimentation with Generative AI:
  18. Research and integrate the latest advancements in generative AI technologies.
  19. Experiment with fine-tuning and adapting large language models (like GPT, BERT) for new, innovative use cases.

Required Technical Skills

  • Python proficiency, especially with web frameworks like FastAPI or Flask.
  • Strong experience with Lang Chain and associated libraries.
  • Proven expertise in building and optimizing RAG pipelines.
  • Proficiency in using vector databases (e.g., Pinecone, FAISS).
  • Experience with retrievers and re-rankers.
  • Solid understanding of AWS services (Lambda, EC2, RDS, etc.).
  • Knowledge of SQL and NoSQL databases.
  • Familiarity with dashboarding tools such as Grafana and Tableau.

Soft Skills

  • Problem-solving: Ability to handle complex and dynamic challenges with AI solutions.
  • Collaboration: Experience working in multidisciplinary teams (data scientists, DevOps, etc.).
  • Adaptability: Eagerness and passion to keep up with the latest AI advancements and incorporate them into solutions.

Communication: Excellent verbal and written communication skills to convey technical information to both technical and non-technical stakeholders

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 144653057