Production-level Python — pandas and modern data transformation patterns not ad-hoc scripting.
End-to-end pipeline ownership — has personally built and operated production pipelines (ingest → transform → publish) with scheduling, monitoring, and alerting.
Cloud storage — hands-on with Azure Data Lake Storage (ADLS) preferred AWS S3 or GCP equivalent acceptable.
Databricks or Apache Spark for scalable transformations — strongly preferred.
Strong SQL & data modeling fundamentals joining heterogeneous datasets.
Comfort with structured and semi-structured data (JSON, Parquet, CSV).
8+ years of production data engineering with hands-on API integration 2+ years specifically with Cisco DNAC APIs and Wi-Fi telemetry.
Microsoft Intune, CMDB, or other enterprise device datasets: