
Search by job, company or skills
Role: Data Lake architect
Location: Bangalore
Experience: 10 + years
Work Mode: Remote
JOB RESPONSIBILITIES:
Architect and manage Data Lake (Spark-based) platform.
Design and govern:
1. Data ingestion (batch and streaming)
2. Transformation and enrichment pipelines
3. Use Airflow (or equivalent) for orchestration and scheduling.
Integrate data from:
4. DMS and enterprise applications
5. Event streams (Kafka)
6. External and partner systems
Define data models and schemas optimized for:
7. Analytics & reporting
8. AI / ML use cases
9. Agentic workflows
10. Ensure data quality, lineage, performance, and scalability.
11. Work closely with Framework and Enterprise Architects to align platforms.
Required Skills & Experience
812 years in data engineering / data architecture.
Strong hands-on expertise in:
12. Spark-based data lakes
13. Airflow or similar orchestration tools
14. Large-scale ETL / ELT pipelines
15. Experience handling enterprise-scale, multi-source data
Job ID: 138124287