Senior Python Data Engineer (Pandas, NumPy)

Volga Infotech

Mumbai, India

2-4 Years

Save

Posted 19 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Role Overview

We are seeking a Senior Python Data Engineer with exceptional hands-on expertise in Python-based data processing, Pandas, NumPy, and data modeling.

This role is centered around building high-quality data transformation pipelines and designing robust data models that turn raw data into reliable, analytics-ready datasets.

This is a hands-on, individual contributor role for someone who enjoys working deeply with data and writing clean, efficient Python code.

Key Responsibilities

Design and implement data models including analytical, reporting, and transformation-ready schemas.
Build and maintain data transformation pipelines using Python (Pandas, NumPy).
Transform raw, semi-structured, and structured data into clean, validated, analytics-ready datasets.
Apply best practices for data normalization, denormalization, and aggregation.
Optimize Python data workflows for performance, memory usage, and scalability.
Implement data validation, reconciliation, and quality checks within transformation pipelines.
Collaborate with analytics, product, and business teams to understand data requirements.
Write clean, maintainable, well-documented Python code.
Review and improve existing transformation logic and data models.

Required Skills & Experience

Strong expertise in Python with heavy, daily use.
Deep hands-on experience with Pandas and NumPy (non-negotiable).
Strong understanding of data modeling concepts:
Fact & dimension modeling
Star / Snowflake schemas
Analytical vs transactional models
Solid experience in data transformation, data wrangling, and feature preparation.
Good proficiency in SQL and relational databases.
Experience working with large datasets and complex joins/aggregations.
Strong problem-solving skills and attention to detail.

Nice to Have

Experience building custom ETL / ELT pipelines in Python.
Exposure to data quality frameworks or validation techniques.
Familiarity with Git and collaborative development workflows.
Experience supporting BI, reporting, or analytics teams.
Understanding of performance optimization techniques in Pandas (vectorization, chunking, memory tuning).

Experience

2 to 4 years of relevant experience in Python data engineering, analytics engineering, or data processing roles.