
Search by job, company or skills
510 Years
We are seeking a Senior Databricks & Data Science Engineer with strong hands-on experience in building scalable data engineering, analytics, and machine learning solutions using Databricks on Azure and AWS. The role involves working with large-scale datasets, advanced analytics, and ML workflows while following Agile delivery practices and ITIL service management processes.
- Develop and maintain Databricks notebooks, workflows, and jobs
- Build ETL / ELT pipelines using Apache Spark, PySpark, Databricks SQL, and Delta Lake
- Ingest and process data from Azure Data Lake Gen2, AWS S3, relational databases, APIs, and streaming sources
- Optimize Spark workloads for performance, scalability, and cost
- Implement data validation, cleansing, and error-handling mechanisms
- Perform exploratory data analysis (EDA) using Databricks notebooks
- Perform feature engineering and feature selection
- Build, train, evaluate, and tune machine learning models
- Use Python libraries such as Pandas, NumPy, Scikit-learn, and Spark MLlib
- Track experiments and models using MLflow
- Work with Azure Databricks and AWS Databricks environments
- Integrate Databricks with Azure Data Lake, Azure Synapse, and Azure Key Vault
- Integrate Databricks with AWS S3, IAM roles, and CloudWatch
- Ensure secure data access and cloud-native authentication
- Support cloud cost optimization and performance monitoring
- Create and manage Databricks Jobs and schedules
- Monitor job execution, failures, retries, and SLA adherence
- Troubleshoot Spark errors, data quality issues, and pipeline failures
- Provide production support and ensure stability of data pipelines
- Work within Agile/Scrum teams, participating in sprint planning, stand-ups, reviews, and retrospectives
- Follow ITIL processes for Incident, Problem, Change, and Release Management
- Perform root cause analysis (RCA) for production incidents and drive preventive actions
- Ensure controlled releases and smooth promotion of data pipelines and ML models
- Databricks (Notebooks, Jobs, Workflows)
- Apache Spark, PySpark, Databricks SQL
- Delta Lake
- Python for data engineering and data science
- Machine learning fundamentals
- MLflow
- Azure and/or AWS cloud data services
- Git version control
- Delta Live Tables (DLT)
- Unity Catalog
- Spark Structured Streaming
- Advanced analytics and predictive modeling
Strong analytical and problem-solving skills, ability to work with business stakeholders, good communication skills, and strong documentation practices.
Tata Communications is a digital ecosystem enabler that powers today’s fast-growing digital economy. We enable the digital transformation of enterprises globally, including 300 of the Fortune 500. We carry around 30% of the world’s internet routes and connects businesses to 60% of the world’s cloud giants.
We have been a part of the rich heritage of the internet in India. Over the last 25 years, enterprise-enabled services have been essential to the adoption of digital services in the country. Connectivity is an essential fabric of sustenance for the economy. We are committed to enabling Industry leaders in this New World of Communications™, with our unique promise of delivering secure connected digital experiences.
In 2020, we announced the launch of ‘Secure Connected Digital Experience’ (SCDx), a proposition intended to meet this growing, worldwide demand for new ways of operating, which includes far higher levels of working from home, rising security risks, a shift to digital commerce, and more contactless experiences. It will help companies currently relying on short-term fixes by providing holistic, secure, enterprise-level digital solutions that address current challenges and are fit for the long term.
Job ID: 148648119
We don’t charge any money for job offers