Job Description
We're hiring a Software Engineer to build the infrastructure that powers our AI agents and ML systems end-to-end — from fine-tuning foundation models to shipping production-grade agent harnesses. You'll work across the stack: building MLOps pipelines, customizing LLMs, and deploying scalable agent systems on Kubernetes. This role sits at the intersection of ML engineering, platform engineering, and applied AI.
How You Will Contribute And What You Will Learn
- Design and build agent harnesses in Python — the runtime scaffolding that enables AI agents to perceive, reason, plan, and act reliably
- Develop and maintain a robust MLOps framework using Kubeflow and complementary tooling (MLflow, Argo, Airflow, or similar) to orchestrate training, evaluation, and deployment workflows
- Fine-tune foundation LLMs using techniques such as LoRA/QLoRA, SFT, and RLHF; manage datasets, training runs, and evaluation pipelines
- Deploy and operate services on Kubernetes, including model serving, autoscaling, and observability
- Build and integrate AI agents using modern agent frameworks (LangGraph, CrewAI, AutoGen, LlamaIndex, or similar)
- Apply software engineering rigor — SOLID principles, secure coding, static analysis, code reviews, and CI/CD — across all deliverables
- Collaborate with researchers, ML engineers, and product teams to take prototypes from notebook to production
Key Skills And Experience
Must to Have:
- Bachelor's or Master's degree in Engineering, along with around 6+ years of experience in Python development, including building and supporting production systems
- Hands-on experience working with agent-based or agentic systems, using at least one framework such as LangGraph, CrewAI, AutoGen, LangChain, or LlamaIndex
- Exposure to designing or contributing to MLOps pipelines, with familiarity with tools like Kubeflow
- Practical experience in fine-tuning large language models (for example, open-source models like Llama, Mistral, Qwen, or similar)
- Experience deploying containerized applications on Kubernetes, including areas like Helm, operators, networking, and resource management
- Familiarity with at least one major cloud platform (AWS, GCP, or Azure), including services related to compute, storage, identity access management, and machine learning
- Understanding of software engineering practices such as modular design (SOLID principles), design patterns, secure coding practices, static analysis tools (for example, mypy, ruff, Bandit, SonarQube), and testing approaches (unit and integration testing)
Nice to Have:
- Exposure to distributed training approaches, using tools such as DeepSpeed, FSDP, or Accelerate
- Familiarity with vector databases, retrieval-augmented generation (RAG) systems, and evaluation frameworks for language models
- Experience working with model serving solutions such as vLLM, TGI, KServe, or Triton
About Us
Advancing connectivity to secure a brighter world.
Nokia is a global leader in connectivity for the AI era. With expertise across fixed, mobile and transport networks, powered by the innovation of Nokia Bell Labs, we're advancing connectivity to secure a brighter world.
Learn more about life at Nokia .
Our recruitment process
We act inclusively and respect the uniqueness of people. Our employment decisions are made regardless of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disability, protected veteran status or other characteristics protected by law. We are committed to a culture of inclusion built upon our core value of respect.
If you're interested in this role but don't meet every listed requirement, we still encourage you to apply. Unique backgrounds, perspectives, and experiences enrich our teams, and you may be just the right candidate for this or another opportunity.
The length of the recruitment process may vary depending on the specific role's requirements. We strive to ensure a smooth and inclusive experience for all candidates. Discover more about the recruitment process at Nokia .