Location: PAN India/ Remote
Years of Experience: 8-12 Years
Data Pipeline & Platform Development
- Design, develop, and maintain scalable ETL/ELT data pipelines using AWS Glue, dbt, and Airflow/Astronomer, AWS healthlake
- Build and optimize data warehousing solutions using Snowflake, including schema design, performance tuning, and cost optimization.
- Should have knowledge or willing to learn Azure data pipelines
Data Quality & Validation
- Implement and manage data validation frameworks (e.g., dbt tests, custom Python validations, etc).
- Establish and automate data quality checks, anomaly detection, schema validation, and reconciliation processes.
- Ensure data accuracy, completeness, timeliness, and alignment with healthcare regulatory standards.
Data Governance & Compliance
- Work with HIPAA-compliant processes to ensure PHI/PII protection.
- Maintain metadata, lineage, and data documentation in alignment with governance best practices.