
Search by job, company or skills
Job Title: AI Architect Vision Language Models (VLM) & Computer Vision
Company: People Tech Group
Location: Hyderabad
Employment Type: Initially it will be Contract
About People Tech Group
People Tech Group is a technology-first digital innovation company specializing in AI Engineering, Data Platforms, Cloud Modernization, Enterprise Applications, and Digital Product Development. We partner with global enterprises to build scalable, intelligent, and future-ready systems powered by next-gen AI and modern engineering.
Role Overview
We are looking for an AI Architect with strong expertise in Vision-Language Models (VLMs), Multimodal AI, and advanced Computer Vision to lead design and development of AI-driven solutions. This role requires a hands-on architect who can design, prototype, optimize, and deploy multimodal AI pipelines, while owning architecture decisions, performance tuning, and integration strategies.
Key Responsibilities:
Architect end-to-end multimodal AI solutions using Vision-Language Models (VLMs).
Build scalable architectures for image, video, OCR, object detection, and vision-language workflows.
Develop custom Computer Vision and VLM pipelines for real-time or batch inference.
Fine-tune transformer-based models using LoRA, QLoRA, and multi-GPU training.
Implement data pipelines for image/video datasets and evaluation frameworks.
Build high-quality POCs, prototypes, and production-grade AI solutions.
Evaluate and experiment with SOTA models in VLMs, diffusion models, and multimodal AI.
Required Skills:
Strong background in Computer Vision (object detection, segmentation, OCR, embedding models).
Hands-on experience with VLMs (LLaVA, GPT-4o, BLIP-2, PaLI, Kosmos, etc.).
Strong proficiency in Python, PyTorch, TensorFlow, OpenCV, HuggingFace.
Experience deploying models on cloud (Azure/AWS/GCP) with GPU acceleration.
Deep knowledge of transformer architectures and multimodal fusion.
Experience with MLOps, model optimization, and distributed training.
Why People Tech Group:
Opportunity to architect next-gen AI-first enterprise solutions.
Work with cutting-edge LLMs, VLMs, GenAI & CV models.
Fast-paced, innovation-led environment with global enterprise customers.
Consulting flexibility with high-impact delivery ownership.
Job ID: 133681675