Search by job, company or skills

  • Posted 18 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About the Role

Build and own the complete data infrastructure for TourIQ: ETL pipelines, third-party platform integrations, external data ingestion, ML feature preparation, and time-series data management. You ensure clean, timely data flows from every source into the platform and ML models.

Key Responsibilities

Integration Pipelines

  • Build and maintain connectors to third-party platforms: bi-directional data sync for products, schedules, and transactions
  • Implement configurable sync frequencies and adaptive polling
  • Build a reusable connector architecture so new integrations can be added quickly
  • Handle API failures gracefully: rate limiting, exponential backoff, error tracking; field mapping between schemas
  • Document all data schemas, field mappings, and pipeline designs

External Data

  • Integrate multiple industry-relevant external data sources (weather, events, demand signals, and others)

ML Data Infrastructure

  • Compute and maintain ML features across multiple signal categories
  • Manage time-series data with proper retention and compression policies
  • Build data pipelines supporting new customer onboarding scenarios
  • Scheduled feature computation and data quality jobs

Data Quality & Operations

  • Data quality monitoring and alerting; connection management with error tracking
  • Database backups, recovery testing, migrations (zero-downtime); assist with RDS management

Must-Have

  • 2+ years in Data Engineering or data-heavy backend roles
  • Strong Python for ETL/ELT pipelines including PySpark for large-scale processing; solid PostgreSQL
  • REST APIs, webhooks, event-driven data ingestion; data modeling and quality practices
  • API failure handling (rate limits, retries, partial data)
  • Preparing data for ML training: feature engineering, validation, train/test splits

Good to Have

  • TimescaleDB; travel/hospitality platforms; task schedulers; Alembic; Kubernetes basics

Mindset

Owns data reliability. Meticulous about quality and edge cases. Comfortable working closely with ML engineers. Startup ownership.

Why Join

Own core data infrastructure from day one. Build pipelines powering ML-driven pricing. High ownership in an early-stage B2B startup. Onsite in Jaipur.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 142408963

Similar Jobs