A leading provider of AI training data and annotation services operating in the Artificial Intelligence & Machine Learning data-labeling sector. We prepare high-quality ground-truth datasets for NLP, computer vision, and speech models that power enterprise and research-grade AI systems.
We are hiring remote Data Annotators across India to join a distributed team delivering annotation at scale with strict quality and turnaround SLAs.
Primary title: Data Labeling Specialist
Role & Responsibilities
- Label, tag, and annotate text, image, and audio data to project-specific guidelines with high accuracy.
- Create bounding boxes, segmentation masks, and fine-grained labels for computer vision datasets.
- Follow annotation taxonomies and updated guidelines; document ambiguities and propose clarifications.
- Perform quality checks, resolve feedback from QA leads, and maintain target inter-annotator agreement (IAA).
- Use annotation platforms (Labelbox, CVAT, Doccano) efficiently and meet daily productivity and SLA targets.
- Collaborate with project managers to estimate effort, flag issues, and suggest improvements to annotation workflows.
Skills & Qualifications Must-Have
- Master's degree in Computer Science, Linguistics, Cognitive Science, Statistics, or related field.
- Prior experience in data annotation or data labeling for NLP, CV, or speech projects.
- Hands-on familiarity with Labelbox, CVAT, or Doccano (or equivalent annotation tools).
- Practical knowledge of bounding boxes and segmentation workflows for image/video data.
- Strong written English for following guidelines and documenting edge cases.
- Reliable remote work setup (high-speed internet, quiet workspace) and ability to meet deadlines.
Preferred
- Experience with Amazon SageMaker Ground Truth or other managed annotation platforms.
- Understanding of inter-annotator agreement metrics and QA processes.
- Basic scripting ability (Python) to assist with small data-prep or validation tasks.
Benefits & Culture Highlights
- Remote-first, flexible hours with project-based and performance incentives.
- Opportunity to work on cutting-edge AI training datasets across NLP, CV, and speech domains.
- Mentorship, clear career paths into QA lead or data operations roles, and skills development.
This role is best suited for detail-oriented candidates comfortable working in structured annotation environments and delivering consistent, high-quality labels that directly improve ML model performance. Applicants based in India only fully remote.
Skills: project,strong english communication skills,content writer,seo,datasets,data,speech,annotation