About the Role
We are seeking a Senior Platform Engineer (OpenStack) to design and build scalable Python-based automation services and APIs on top of our OpenStack cloud infrastructure. This is a hands-on engineering role where you will write production-grade Python code daily while ensuring platform reliability, scalability, and performance.
You will be responsible for building core platform services such as API layers, workflow automation systems, and resource management frameworks that make our cloud infrastructure programmable and efficient. We are looking for engineers who emphasize clean architecture, maintainable code, comprehensive testing, and pragmatic delivery.
Key Responsibilities
- Build and maintain Python-based services for OpenStack VM provisioning, configuration, and lifecycle management using FastAPI/Flask and openstacksdk
- Design and develop robust REST APIs for infrastructure operations (VM creation, volume management, metadata updates) with proper validation and error handling
- Implement automated workflows for cloud-init provisioning, post-deployment configuration, and resource cleanup
- Develop comprehensive test suites (unit, integration, and end-to-end) with a minimum of 80% coverage
- Integrate with OpenStack services such as Nova (Compute), Neutron (Networking), and Cinder (Storage)
- Build and manage CI/CD pipelines using GitLab CI for testing and deployment
- Implement monitoring, alerting, and logging using Prometheus, Grafana, and structured logging frameworks
- Document APIs, deployment processes, and troubleshooting guides
- Collaborate with infrastructure and cross-functional teams for requirement gathering and solution design
- Maintain production systems, including incident response, debugging, and performance optimization
- Develop storage management capabilities including snapshots, backups, and multi-attach scenarios
Required Qualifications
- 5+ years of software engineering experience, with at least 3+ years in platform/backend engineering
- 3+ years of experience building production-grade REST APIs with validation, error handling, and OpenAPI documentation
- 2+ years of experience with cloud platforms (OpenStack/AWS/GCP/Azure)
- Strong expertise in Python 3, including FastAPI/Flask, async programming, type hints, and Pydantic
- Experience with software architecture patterns such as microservices, service-oriented architecture, and design patterns (Repository, Factory, Dependency Injection)
- Hands-on experience with CI/CD pipelines, deployment, monitoring, and production support
- Strong collaboration and communication skills
Preferred Qualifications
- Experience with Infrastructure as Code tools (Terraform, Ansible, Pulumi)
- Hands-on expertise with OpenStack services (Nova, Neutron, Cinder)
- Experience with containerization and orchestration (Docker, Kubernetes)
- Familiarity with messaging systems (RabbitMQ, Kafka, Redis Pub/Sub)
- Experience with relational databases (PostgreSQL, MySQL)
- Knowledge of monitoring and observability tools (Prometheus, Grafana, ELK Stack)
- Strong understanding of Linux systems and networking fundamentals
- Experience with API gateways, service mesh technologies, and distributed system scalability patterns