Role Overview
We are seeking a Senior AI Engineer with strong expertise in Prompt Engineering, LLM fine-tuning, and Small Language Model (SLM) development to design, train, optimize, and deploy domain-specialised language models.
A key focus of this role will be engineering high-performance prompts for 8B-class models (such as LLaMA, Mistral, and Qwen) and transitioning these prompts into fine-tuned models for production reliability.
You will design prompt architectures, instruction schemas, and evaluation pipelines that ensure models produce accurate, structured, and deterministic outputs suitable for enterprise automation workflows.
Key Responsibilities
- Design production-grade prompt architectures for 8B-class models.
- Develop structured prompts for enterprise tasks such as classification, extraction, reasoning, and summarization.
- Optimize prompts for accuracy, latency, and cost efficiency.
- Build prompt evaluation frameworks to measure accuracy, hallucination rates, and consistency.
- Design reusable prompt libraries and prompt templates for enterprise workflows.
- Develop prompt-to-model migration strategies converting high-performing prompts into fine-tuned SLMs.
- Design and fine-tune LLMs for domain-specific enterprise tasks.
- Develop Small Language Models (SLMs) optimized for enterprise deployment.
- Build instruction tuning and supervised fine-tuning (SFT) pipelines.
- Design evaluation datasets and automated benchmarking frameworks.
- Implement retrieval augmented generation (RAG) pipelines and tool-augmented workflows.
- Collaborate with speech AI and document AI teams to build multimodal systems.
- Deploy models in private cloud or on-premise environments with strong security controls.
Required Qualifications
Education
Master's degree or PhD in Computer Science, AI, Machine Learning, or a related field.
Experience & Technical Skills
- Strong expertise in Prompt Engineering for 7B13B models (especially 8B models).
- Experience designing prompts for structured enterprise outputs.
- Experience building prompt evaluation datasets and benchmarking frameworks.
- Ability to convert prompt workflows into fine-tuned models.
- 46 years of experience in ML/NLP with 3+ years focused on LLMs or foundation models.
- Hands-on experience fine-tuning open-source models such as LLaMA, Mistral, Falcon, or Qwen.
- Experience with LoRA, QLoRA, adapters, and model distillation techniques.
- Strong understanding of transformers, tokenization, embeddings, and attention mechanisms.
- Strong Python engineering skills and experience with PyTorch.
AI Platform & Infrastructure
- Experience with GPU-based training and inference.
- Familiarity with Hugging Face, Accelerate, DeepSpeed, and Triton.
- Experience with vector databases and RAG architectures.
- Experience deploying models using Docker, Kubernetes, and cloud platforms.
Compliance & Enterprise Readiness
- Experience working in regulated environments.
- Understanding of data privacy, access controls, and AI auditability.
- Ability to design AI guardrails and human-in-the-loop workflows.
Nice to Have
- Experience applying LLMs in healthcare, insurance, or financial services.
- Exposure to speech-to-text or document AI pipelines.
- Experience building agentic AI systems.
- Experience optimizing models for low latency enterprise workloads.
Why Join Contiinex
- Build specialised enterprise-grade language models.
- Work on real-world automation problems beyond generic chatbots.
- Help shape next-generation AI systems for regulated industries.
- Collaborate directly with deep-tech leadership and domain experts.