Position Summary:
We are seeking a talented and driven Data Scientist to join our dynamic team at Illumina. In this role, you will collaborate with cross-functional teams including scientists, engineers, and bioinformaticians to analyze complex biological data, build advanced models, and generate actionable insights that support research, product development, and commercial goals. Your work will be integral to advancing genomics, clinical applications, and the future of personalized medicine.
Key Responsibilities:
- Design, develop, and implement statistical models, machine learning algorithms, and analytical pipelines
- Collaborate with research, informatics, engineering, and product teams to define project goals and data strategies aligned with business objectives
- Apply data mining and predictive modeling techniques to uncover trends and patterns in diverse biological and clinical datasets
- Validate model performance and ensure reproducibility, scalability, and robustness of analytics solutions
- Work with software engineering teams to integrate models into production-ready internal and customer-facing tools
- Communicate technical findings and insights effectively to both technical and non-technical audiences via reports, presentations, and visualizations
- Remain current with industry trends, tools, and best practices in data science, AI, and computational biology
- Support development of LLM-driven tools and chatbot-based enterprise solutions in collaboration with AI systems teams
- Contribute to peer-reviewed publications, scientific conferences, and internal documentation
Required Qualifications:
- Bachelor's degree in Data Science, Computer Science, Statistics, Mathematics, Bioinformatics, Computational Biology, Engineering, or a related field
- Proven experience working with large-scale datasets, data wrangling, statistical analysis, and machine learning
- Proficiency in at least one programming language for data analysis (Python, R, or Julia)
- Experience with cloud platforms such as AWS or Azure, and data warehouse tools like Snowflake
- Familiarity with Kubernetes, Apache, and data visualization platforms such as Tableau
- Understanding of version control systems like Git for collaborative development
- Excellent communication skills with the ability to present technical findings clearly to diverse audiences
- Strong problem-solving skills, detail-oriented, and ability to handle multiple projects independently
Preferred Qualifications:
- Master's degree or Ph.D. in Data Science, Bioinformatics, Computational Biology, or a related field
- 2+ years of relevant experience or equivalent academic/industry project experience
- Hands-on experience with genomic or next-generation sequencing (NGS) data
- Experience in a regulated environment with knowledge of privacy standards (e.g., HIPAA, GDPR)
- Track record of contributing to open-source projects or publishing in scientific journals
- Background in healthcare, biotechnology, or life sciences