
Search by job, company or skills
Design and develop highly scalable, Real timesystems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive,Ranger, Kafka, Flink and Nifi)
. Build robust data ingestion andtransformation frameworks using Java, Spark, Python, and shell scripting foringesting multi model data(image, audio, video, unstructured documents) withboth batch and real-time.
. Develop full stack applications and internalengineering tools using Python, shell scripting, and modern web frameworks(e.g., Flask, React).
. Collaborate closely with data scientists tooperationalize machine learning models using Cloudera Machine Learning (CML).
Mandatory skills
. Hadoop ecosystem (Spark, Hive, Kafka, Flink, NiFi, Iceberg, Trino)
. Java, Python, Spark (batch & real-time processing)
. Data ingestion & transformation frameworks
. Performance tuning on Hadoop platforms
. Shell scripting
. Real-time data processing systems
. ML model operationalization (CML / Spark ML)
Job ID: 149190503
Skills:
Kafka, Grafana, Datadog, Sql, Rabbitmq, Gcp, Docker, Sqs, Flask, FastAPI, Rest Apis, Python, AWS
Skills:
cml , Java, Ranger, Hadoop, Kafka, React, Hive, shell scripting, XGBoost, Flask, Python, LangChain, Hugging Face, Flink, Ozone, Iceberg, Spark MLlib, Trino, Nifi, Cloudera Machine Learning
Skills:
bedrock , BigQuery, Sql, Tensorflow, Pandas, Pytorch, Gcp, Terraform, Kubernetes, Python, AWS, Airflow, GKE, scikit-learn, Pub Sub, SageMaker, Cloud Run, Vertex AI, Kubeflow, Vertex Pipelines
Skills:
model selection , Version Control, Testing, Tensorflow, Pytorch, Python, evaluation techniques, data preprocessing, Documentation, mathematical concepts, agentic workflows, LLM orchestration frameworks, AI agents, feature engineering
Skills:
model selection , Tensorflow, Version Control, Pytorch, Python, Testing, Data preprocessing, Documentation, Feature engineering, Mathematical concepts, LLM orchestration frameworks, AI agents, Agentic workflows
We don’t charge any money for job offers