
Search by job, company or skills

Recruiters please do not reach out to me, we want to hire this role directly.
#jobvacancy #jobopening #jobs #bangalore DataFrontier Innovations pvt ltd
Data Scientist (Early Career) – Healthcare AI & Agentic Systems
Location: Bangalore (Hybrid)
Experience: 2–4 years
Company: DataFrontier Innovations Pvt. Ltd.
Email: [Confidential Information]
About DataFrontier
DataFrontier builds advanced AI, data engineering, and analytics systems that solve real-world, high-impact problems — especially in healthcare and enterprise environments.
We are currently working on:
• Falls prediction systems for elderly care using clinical and behavioral data
• Agentic AI systems that automate workflows and decision-making
• Production-grade data platforms handling sensitive, regulated datasets
If you want to build systems that actually get used — not just models that sit in notebooks — this role is for you.
Role Overview
We are looking for a hands-on Data Scientist who can work across the full lifecycle — from data understanding to model deployment — and is excited about applying AI in healthcare and automation.
This is not a pure research role. You will be building, shipping, and improving real systems.
What You'll Work On
• Build and improve predictive models for falls risk using clinical, behavioral, and time-series data
• Design features from messy real-world healthcare datasets (EHR, sensor data, logs)
• Work on Agentic AI pipelines (LLMs + tools + workflows) to automate decision systems
• Develop and test models like XGBoost, Random Forest, time-series models, and hybrid approaches
• Collaborate with data engineers to build robust pipelines and feature stores
• Implement model explainability (SHAP, feature importance) for clinical usability
• Evaluate data quality and completeness for new customers (critical for healthcare deployments)
• Work closely with product and clients to translate business problems into ML solutions
Required Skills
• Strong fundamentals in Python (pandas, numpy, scikit-learn)
• Experience with ML models (classification, regression, tree-based models)
• Solid understanding of feature engineering and data preprocessing
• Familiarity with SQL and working with structured datasets
• Basic understanding of model evaluation metrics and validation techniques
• Exposure to real-world datasets (not just Kaggle-level clean data)
Good to Have (High Impact)
• Experience with healthcare data / EHR / clinical datasets
• Exposure to LLMs / LangChain / agentic frameworks
• Knowledge of time-series modeling
• Experience with cloud platforms (AWS / GCP)
• Understanding of data pipelines / ETL workflows
• Familiarity with model deployment or APIs
What We're Looking For
• Someone who can think, not just code
• Comfortable working with unclean, incomplete, real-world data
• Willing to own problems end-to-end, not just tasks
• Curious about AI beyond standard ML — especially agentic systems
• Strong communication skills — ability to explain models to non-technical stakeholders
What This Role Is NOT
• Not a pure research role
• Not a train model and forget role
• Not limited to notebooks — you will work on production systems
Why Join Us
• Work on real healthcare impact problems
• Exposure to international clients and deployments
• Build next-gen systems in Agentic AI + predictive analytics
• Small team → high ownership and fast growth
How to Apply
Send your resume + 2–3 projects (GitHub / case studies) to:
[HIDDEN TEXT]
Subject: Data Scientist – Early Career Application
If you're looking for comfort, this role is not for you.
If you're looking to build something meaningful, let's talk.
Job ID: 148903947
Skills:
Pyspark, Power Bi, SAS, Microsoft Office, Sql, Multivariate Analysis, XGBoost, Clustering, Python, Random Forests, SVMs, ML models, Light GBM, dimensionality reduction
Skills:
Tensorflow, Nlp, Pytorch, Docker, Python, Azure ML, Kubernetes, vector search, SAM, Pinecone, CI CD, Whisper, cloud ML stacks, Vertex AI, video transformers, LLaVA, LangChain, multimodal LLM frameworks, ViT, speech models, BLIP, SageMaker, Cv, FAISS, Weaviate, LlamaIndex
Skills:
, Sql, Github, Hadoop, Clustering, Python, Big Data, Hypothesis Testing, Spark, Generative AI, experimentation, Time Series Forecasting, RAG Prompt Engineering, Classification, Traditional ML, Agentic AI, Regression
Skills:
Matplotlib, Machine Learning, Tableau, Sql, Nosql, Tensorflow, Pytorch, Data Visualization, Seaborn, Python, T5, Generative AI, Hugging Face Transformers, GPT, R, Statistics, BERT
Skills:
Machine Learning, Python Programming, Deep Learning, ML Ops, Generative AI, Multi model Agent LLMs, Azure AI foundry services
We don’t charge any money for job offers