
Search by job, company or skills
Why Mizuho
At Mizuho, we provide the stability of an international industry leader with the career trajectory of a growing business. Our steady, strategic growth gives our people at all levels rewarding degrees of responsibility and richer work experience than a boutique firm or an established giant could offer alone.
It's the local expertise of our employees that makes our global network so powerful. By collaborating with colleagues and clients who have the same ambition and drive, you can amplify your sphere of influence and base of knowledge as part of one of the largest—and growing—banks in the world
Cloud Data Engineer will lead and support end-to-end data engineering initiatives with a strong focus on cloud data integration and enterprise data warehouse solutions. This role encompasses data design, architecture, development, testing, deployment, and production support.
The ideal candidate will bring deep expertise in building scalable data pipelines and integration frameworks using PySpark, Python, SQL, and Airflow, along with strong experience in data warehousing and cloud-based architectures.
Key Responsibilities:
• Design, develop, and optimize scalable data pipelines on Databricks using PySpark for large-scale data processing and analytics.
• Lead cloud data integration initiatives, enabling ingestion and transformation of structured and semi-structured data across enterprise systems (batch and streaming).
• Build and enhance enterprise data warehouse solutions, including dimensional modeling (facts, dimensions) and curated data layers.
• Drive Gold-layer transformations to deliver high-quality, business-ready datasets for analytics and reporting.
• Collaborate with business stakeholders, data scientists, and analysts to translate requirements into scalable data solutions.
• Establish engineering best practices, including CI/CD pipelines, testing frameworks, and code quality standards.
• Ensure data quality, governance, and regulatory compliance across data pipelines.
• Lead performance optimization efforts for Spark jobs and SQL queries for scalability and cost efficiency.
• Drive architectural decisions for cloud data platforms, integration patterns (batch/streaming), and data warehouse evolution (Bronze/Silver/Gold layers).
Requirements:
Core Competencies:
Preferred Qualifications:
Organization Overview:
Mizuho Global Services (MGS), Pune is an integral part of Mizuho Financial Group, one of the world's leading financial institutions with a strong global presence across the Americas, EMEA, and Asia. Based in India, MGS Pune supports Mizuho's international businesses by delivering high-quality, scalable, and resilient services across multiple functions.
MGS Pune plays a critical role in driving operational excellence, standardization, and innovation for Mizuho Americas. By combining deep domain expertise with strong process, technology, and analytical capabilities, it partners closely with regional and global teams to support corporate and investment banking, capital markets, and corporate services functions, while adhering to the highest standards of risk management, regulatory compliance, and control.
MGS Pune offers competitive compensation and benefits package aligned with industry standards and local market practices.
MGS Pune is an equal opportunity employer and is committed to fostering an inclusive and diverse workplace.
Employment is subject to applicable background verification checks in accordance with Indian laws and company policies.
https://www.mizuhogroup.com/asia-pacific/mizuho-global-services/careers
Job ID: 149110267
Skills:
snowflake , Sql, Pl-sql, Aws S3, AWS Glue, API-based data integration, Data warehousing concepts and architecture, ETL ELT design patterns
Skills:
snowflake , Spark, Sql, S3, Apache Airflow, Aws Services, Emr, AWS Glue, Windows, Lambda, Yarn, Hadoop, Pyspark, Linux, Hive, Python, Redshift, Ec2, Scala, AWS Aurora, Shell Scripts, Athena, Step Functions, Singlestore
Skills:
Azure Sql, Azure Data Factory, Azure Synapse, Etl, Power Bi
Skills:
Data Engineer, Data Warehousing, Big Data, Cloud Technologies, Docker, Kubernetes, DataFlow, Airflow, Analytics, Business Intelligence
Skills:
Java, Hadoop, PostgreSQL, Prometheus, SQL Server, Kafka, Node.js, Sql, Datadog, ELT, Numpy, Pandas, Cloudwatch, Gcp, Terraform, MySQL, Ansible, Spark, Oracle, Azure, Python, AWS, Etl
We don’t charge any money for job offers