Sr Principal Cybersecurity Architect

Providence India

India

7-10 Years

Save

Posted 5 days ago
Be among the first 10 applicants

Early Applicant

Job Description

osition Overview

Providence is seeking an exceptional Principal Cybersecurity Architect to lead the research, development, and deployment of Small Language Models (SLMs) specialized for healthcare and cybersecurity applications. This role combines cutting-edge machine learning research with practical implementation, requiring deep expertise in model architecture design, advanced fine-tuning techniques, graph-based knowledge systems, and vector-based retrieval augmentation.

The ideal candidate will architect and train efficient, domain-specialized language models usingstate-of-the-arttechniques includingLoRA/QLoRA, graph neural networks, vector embeddings, and hybrid retrieval systems. This position requires both depth in machine learning and hands-on implementation skills to deliver high-impact solutions for healthcare operations, clinical workflows, and security applications.

Key Responsibilities

Design, develop, and optimize Small Language Models for healthcare and cybersecurity domains using advanced architectures (efficient transformers,MoE, sparse attention), compression techniques (quantization: INT8/INT4/GPTQ/AWQ, pruning, knowledge distillation), and emerging SLM architectures (Phi-3, Gemma, Mistral 7B variants) with rigorous ablation studies balancing model size, inference speed, accuracy, and resource requirements

Implement parameter-efficient fine-tuning (PEFT) techniques includingLoRA,QLoRA, prefix tuning, prompt tuning, and adapter layers design instruction tuning pipelines aligning SLMs with clinical terminology and security protocols develop multi-task learning frameworks and deploy continuous learning strategies using DPO, RLHF, and active learning while optimizing hyperparameters to prevent overfitting on limited healthcare datasets

Implementknowledge graphs capturing domain relationships and build graph-augmented language models integrating Neo4j/Cosmos DB with unstructured text processing, leveraging GNN components (PyTorchGeometric, DGL,GraphSAGE) for entity relationshipmodelingand complex graph traversal reasoning

Create entity extraction and relation classification pipelines that automatically populate andmaintainknowledge graphs from clinical documents, security logs, and operational data to enable graph-based reasoning across patient journeys, care coordination workflows, and security incident chains

Optimize model inference usingvLLM,TensorRT-LLM, llama.cpp, GGML, and Optimum for maximum throughput/minimum latency implement scalable model serving architectures (Triton,TorchServe,FastAPI) with batching, caching, and load balancing design A/B testing frameworks and champion-challenger patterns for safe production model updates

Design and implement vector database architectures for semantic search and RAG applications using specialized embeddingsoptimizedforcybertext, with hybrid search combining dense vector retrieval, sparse BM25 matching, and graph-based context expansion

Build advanced RAG architectures implementing multi-vector retrieval, reranking strategies (Cohere, Cross-Encoders), context compression, and query decompositionoptimizevector indexing (HNSW, IVF, Product Quantization) balancing accuracy, latency, and memory footprint and develop chunking pipelines for diversecyberdataformats

Experience inend-to-end data pipelines forcybertext corpora processing, implementing data quality frameworks through de-identification and synthetic data generation (Synthea), bias detection, and representativeness metrics while building experiment tracking systems (MLflow) for versioning and reproducibility

Develop distributed training workflows usingDeepSpeed, FSDP, or Megatron-LM for efficient multi-GPU/multi-node training on Azure ML create automated data augmentation pipelines (back-translation, paraphrasing, entity substitution, synthetic generation) to expand limited healthcare training datasets whilemaintainingclinical validity

Design comprehensive evaluation frameworks measuring healthcare-specific task performance (NER, relation extraction, summarization, QA, reasoning) with custom benchmark datasets aligned to clinical accuracy requirements, regulatory compliance, and operational impact implement automated robustness testing for adversarial attacks, edge cases, OOD detection, and hallucination detection

Conduct comparative analysis of SLM variants across accuracy, latency, throughput, memory, and cost metrics build interpretability tools using attention visualization, SHAP values, and integrated gradients to ensure clinical safety and explainability for healthcare stakeholders and regulatory requirements

Understand onmonitoring and observability dashboards tracking model performance drift, data distribution shifts, embedding quality degradation, and retrieval accuracy in production collaborate withMLOpsteams on containerization (Docker), Kubernetes/AKS deployment, and CI/CD pipelines for automated testing and deployment workflows

Build security-focused SLMs for threat detection, log analysis, vulnerability assessment, incident response automation, and policy compliance checking

Partner with clinical stakeholders, security teams, and product managers to identify high-impact use cases and translate domain requirements into technical specifications mentor junior data scientists on SLM development, fine-tuning techniques, and healthcare AI contribute to knowledge sharing through documentation, tech talks, and research communities stay current with emerging research through literature review, conference participation, and experimentation

Required Qualifications

Education & Experience

7-10years of experience in data science, machine learning research, or AI engineering withdemonstratedtrack recordof developing and deploying language models

3+ years hands-on experience with transformer architectures, fine-tuning methodologies, and production NLP system development

Proven experience applying machine learning in healthcare, life sciences, or highly regulated industries with understanding of domain-specific challenges

Core Technical Skills

Expert-levelproficiencyin Python and core ML libraries includingPyTorch, Hugging Face Transformers, PEFT, TRL (Transformer Reinforcement Learning), andBitsAndBytes

Deep hands-on experience with parameter-efficient fine-tuning:LoRA,QLoRA,AdaLoRA, prefix tuning, prompt tuning, P-Tuning, IA3, and adapter methods

Strong knowledge of model quantization techniques and tools: GPTQ, AWQ, GGUF/GGML,bitsandbytes(4-bit/8-bit quantization), and post-training quantization strategies

Expertisein transformer architectures including attention mechanisms (multi-head, grouped-query, sliding window), positional encodings (RoPE,ALiBi), and architectural variants (encoder-only, decoder-only, encoder-decoder)

Practical experienceoptimizingmodel training: mixed precision (FP16, BF16), gradient checkpointing, gradient accumulation, distributed training (DDP, FSDP,DeepSpeedZeRO), and memory-efficient techniques

Strong foundationin machine learning fundamentals including optimization algorithms, regularization techniques, loss functions, evaluation metrics, and experimental design

Graph & Vector Technologies

Hands-on experience with graph databases: Neo4j, Amazon Neptune, Azure Cosmos DB (Gremlin API), orTigerGraphincluding Cypher/Gremlin query languages and graph datamodelling

Proficiencyin graph neural networks usingPyTorchGeometric, DGL (Deep Graph Library),Spektral, orGraphSAGEfor node classification, link prediction, and graph classification tasks

Deep experience with vector databases and similarity search: Pinecone,Weaviate,Qdrant, Milvus, Chroma, FAISS, or Annoy including index optimization and hybrid search implementations

Expertisein embedding models and techniques: sentence transformers, bi-encoders, cross-encoders, domain-adapted embeddings, and multi-modal embeddings

Strong knowledge of RAG architectures: naive RAG, advanced RAG (with reranking, query transformation), modular RAG, and evaluation frameworks (RAGAS,TruLens)

Understanding of knowledge graph construction techniques including entity extraction, relation extraction, entity linking, and ontology alignment

Data & Infrastructure Skills

Proficiencywith cloud ML platforms, preferably Azure ML, Azure Databricks, or equivalent (AWS SageMaker, Google Vertex AI) for model training and deployment

Strong SQL skills and experience with data warehouses (Snowflake preferred) and data lakes (Azure Data Lake, Delta Lake) for working with healthcare datasets

Experience with experiment tracking and model management:MLflow, Weights & Biases, Comet.ml, Neptune.ai, or similar platforms

Familiarity with containerization (Docker), orchestration (Kubernetes), and version control (Git) for reproducible ML workflows

Knowledge of data processing frameworks: Pandas, Polars,Dask, Spark (PySpark) for large-scale data manipulation and feature engineering

Domain Knowledge

Understanding of healthcare data standards (HL7, FHIR), clinical terminologies (SNOMED CT, ICD-10, LOINC,RxNorm), and EHR systems (Epic, Cerner, Oracle Health)

Knowledge of healthcare privacy regulations (HIPAA, HITECH) and data security requirements including de-identification standards (Safe Harbor, Expert Determination)

Familiarity with clinical workflows, medical documentation practices, and healthcare operational challenges that AI systems must address

Understanding of responsible AI principles including bias detection, fairness metrics, model interpretability, and ethical considerations for healthcare AI

Professional Competencies

Strong analytical and problem-solving skills with ability to formulate complex business problems as machine learning tasks

Excellent communication skills with ability to explain technical concepts to clinical, business, and executive stakeholders

Demonstrated ability to manage end-to-end data science projects from problem definition through production deployment

Strong collaboration skills with experience working across data engineering, ML engineering, product, and clinical teams

Self-directed learner who stays current with rapidly evolving AI/ML research and proactively experiments with emerging techniques

Preferred Qualifications

PhD in Machine Learning, Natural Language Processing, Computer Science, or related field with publications in top-tier conferences (NeurIPS, ICML, ACL, EMNLP, ICLR)

Research contributions to efficient language models, parameter-efficient fine-tuning, knowledge graphs, or retrieval-augmented generation domains

Experience with instruction tuning datasets and methodologies including self-instruct, Alpaca-style datasets, FLAN, and human preference alignment (DPO, PPO)

Hands-on experience with inference optimization frameworks:vLLM,TensorRT-LLM, text-generation-inference, CTranslate2, or ONNX Runtime for production deployment

Familiarity with agentic AI frameworks includingLangChain,LangGraph,LlamaIndex,AutoGen,CrewAI, or Semantic Kernel for building multi-step AI workflows

Experience with prompt engineering techniques, prompt optimization frameworks (DSPy,PromptSource), and evaluation of prompt effectiveness

Knowledge of clinical NLP benchmarks and datasets: i2b2, MIMIC, n2c2,BioASQ,PubMedQA, and clinical outcome prediction tasks

Familiarity with multi-modal models combining text with clinical imaging, genomics, or time-series physiological data

Experience with federated learning or privacy-preserving ML techniques (differential privacy, secure multi-party computation) for sensitive healthcare data

Contributions to open-source ML/AI projects (Hugging Face,PyTorch,LangChainecosystem) or active participation in ML research communities

Prior experience in healthcare technology organizations, clinical research institutions, or regulated industries with complex compliance requirements

Success Metrics & Objectives

Performance in this role will be evaluated based on the following key metrics aligned with Providence's AI Data Science and Engineering OKR framework:

Model Performance & Quality

Achieve target accuracy, F1 scores, and domain-specific metrics for SLMs on healthcare tasks (e.g., 85% F1 for clinical NER, 90% accuracy for classification tasks)

Deliver inference latency meeting real-time requirements (target: 100ms p95 latency) whilemaintainingmodel quality through optimization techniques

Reduce model size by 50%+ through quantization and compression whileretaining95%+ of original model performance

Technical Innovation

Successfully implement and deploy 3+ novel fine-tuning or optimization techniques advancing Providence's SLM capabilities within first year

Build production-grade graph-augmented RAG system improving retrieval accuracy by 30%+ over baseline vector-only approaches

Business Impact

Deploy 2-3 production SLM applications within first 12 monthsdemonstratingmeasurable clinical workflow improvement or operational efficiency gains

Achieve 20%+ reduction in manual clinical documentation time or 30%+ improvement in security analyst productivity through AI automation

Optimizeinference costs achieving 40-60% reduction through model compression, efficient serving, and resource optimization strategies

Collaboration & Knowledge Sharing

Maintain high stakeholder satisfaction scores ( 80%) from clinical and operational partners on model performance and collaboration quality

Deliver technical documentation, training materials, and knowledge-sharing sessions enabling team members to leverage SLM capabilities

Mentor 2-3 junior team members contributing to skill development in fine-tuning techniques, graph learning, or vector retrieval systems

Key Technologies & Tools

This role requires hands-onexpertisewith the following technology stack:

Core ML Frameworks & Libraries

PyTorch, Hugging Face (Transformers, PEFT, TRL, Accelerate, Datasets, Tokenizers, Diffusers), TensorFlow/Keras, scikit-learn,XGBoost,LightGBM,BitsAndBytes, Optimum, ONNX Runtime

Fine-Tuning & Optimization

LoRA,QLoRA,AdaLoRA, Prefix Tuning, P-Tuning v2, IA3, Adapters, GPTQ, AWQ, GGUF/GGML,AutoGPTQ, llama.cpp,vLLM,TensorRT-LLM,DeepSpeed, FSDP, Flash Attention, Axolotl,UnslothAI

Graph Technologies

Neo4j, Amazon Neptune, Azure Cosmos DB (Gremlin),PyTorchGeometric, DGL (Deep Graph Library),NetworkX,GraphSAGE, Node2Vec, Graph Attention Networks (GAT), Knowledge Graph Embeddings (TransE,DistMult,ComplEx)

Vector Stores & Embeddings

Pinecone,Weaviate,Qdrant, Milvus, Chroma, FAISS, Annoy, Sentence Transformers, all-MiniLM, BGE Embeddings, E5 Embeddings, Instructor Embeddings,BioBERT,ClinicalBERT,PubMedBERT,SciBERT

RAG & Retrieval Frameworks

LangChain,LlamaIndex, Haystack, RAGAS,TruLens, CohereRerank, Cross-Encoders,ColBERT, BM25 (rank-bm25, Elasticsearch), Hybrid Search, Query Expansion,HyDE(Hypothetical Document Embeddings)

Agentic & Orchestration

LangGraph,LangChainAgents,AutoGen,CrewAI, Semantic Kernel,LlamaIndexWorkflows,DSPy, Guidance, LMQL,ReAct(Reasoning + Acting), Chain-of-Thought prompting

Cloud & Infrastructure

Azure ML, Azure Databricks, Azure OpenAI Service, Azure Cognitive Services, Azure Data Lake, Azure AKS (Kubernetes), Snowflake, Docker, Kubernetes, Ray, Terraform

MLOps& Experiment Tracking

MLflow, Weights & Biases, Comet.ml, Neptune.ai, DVC (Data Version Control),ClearML, Kubeflow, Airflow, Prefect, Great Expectations, Evidently AI

Model Serving & Inference

vLLM,TensorRT-LLM, Text Generation Inference (TGI), Triton Inference Server,TorchServe,FastAPI, Ray Serve,BentoML, Seldon Core

Data Processing & NLP

spaCy, NLTK, Gensim, Pandas, Polars,Dask,PySpark, Unstructured.io,LangChainDocument Loaders,PDFPlumber, Tesseract OCR, Regex, Beautiful Soup

Healthcare & Clinical

FHIR (Python FHIR Client), HL7 (python-hl7),ClinicalBERT,BioBERT,MedCAT,ScispaCy,QuickUMLS, UMLS APIs, SNOMED CT, ICD-10, LOINC,RxNorm, Synthea (synthetic patient data)

More Info

Job Type:

Permanent Job

Industry:

Hospitals /Healthcare /Diagnostics

Function:

Cybersecurity And Healthcare Ai

Employment Type:

Full time

About Company

Providence IndiaJob Source: careers.providence.in

Providence, one of the US's largest not-for-profit healthcare systems, is committed to high quality, compassionate healthcare for all. Driven by the belief that health is a human right and the vision, ‘Health for a better world', Providence and its 121,000 caregivers strive to provide everyone access to affordable quality care and services.

Job ID: 149142197

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 12-06-2026 05:37:38 AM

Homejobs in IndiaSr Principal Cybersecurity Architect