About The Role
We are seeking a highly skilled Senior Platform Engineer to join our growing Platform Engineering team out of our Bangalore office. In this role, you will be responsible for designing, building, and maintaining scalable, secure, and reliable platform services that support development and operations across the organization.
You will work closely with engineering, security, and DevOps teams to streamline application delivery and infrastructure management using modern DevOps and cloud-native principles. This is a hands-on individual contributor role requiring deep technical expertise, strong collaboration skills to work across globally distributed teams, and the ability to operate independently during US evening maintenance windows.
Key Responsibilities
Platform Engineering & Infrastructure
- Design, implement, and maintain robust platform solutions ensuring high availability, scalability, and security across on-premises and cloud environments
- Build and operate shared platform services used by development and operations teams across the organization
- Automate infrastructure provisioning and configuration using Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation
- Support containerized workloads using Kubernetes and/or other orchestration platforms across test, UAT, pre-prod, and production environments
Cloud Infrastructure
- Drive cloud infrastructure best practices with a primary focus on AWS, including networking (VPC, Transit Gateway), identity and access (IAM, SSO), data services (S3, RDS, ElastiCache), and compute (EC2, EKS)
- Support hybrid cloud and on-premises infrastructure patterns spanning data centers and cloud-native services
- Participate in cost-aware infrastructure design, collaborating with FinOps to monitor and optimize resource utilization
Observability & Reliability
- Implement and maintain platform observability including monitoring, logging, and alerting systems using tools such as Splunk, Prometheus, Grafana, CloudWatch, or similar
- Participate in incident response, root cause analysis, and post-incident reviews for platform-related issues
- Champion SRE principles to enhance system resilience, performance, and operational excellence
Security & Compliance
- Ensure compliance with security, privacy, and governance policies across all platform services
- Integrate security scanning and DevSecOps practices into CI/CD pipelines and platform operations
- Support audit and compliance requirements, including change management, access controls, and evidence gathering
Collaboration & Mentorship
- Work closely with Platform Engineering, Server Ops, Network, DevOps, Application Engineering, and Security teams across US, UK, and India
- Mentor junior engineers and contribute to platform engineering roadmaps and knowledge sharing
- Maintain comprehensive documentation including operational runbooks, architecture diagrams, and SOPs
Required Qualifications
- 6+ years of experience in platform engineering, infrastructure, DevOps, or SRE roles
- Expertise in at least one major cloud provider (AWS, Azure, or GCP), with strong preference for AWS
- Solid experience with containerization (Docker) and orchestration (Kubernetes)
- Proficiency with Infrastructure as Code tools (Terraform, Ansible, Pulumi, or CloudFormation)
- CI/CD automation experience (e.g., GitLab CI, Jenkins, ArgoCD)
- Strong scripting and automation skills (Python, Bash, or Go)
- Experience with observability and telemetry systems (Splunk, Prometheus, Grafana, ELK, or Datadog)
- Strong understanding of networking fundamentals, security, and infrastructure architecture
- Familiarity with ITIL Change Management processes, including writing maintenance plans, risk assessments, and backout procedures
- Excellent communication and collaboration skills, with the ability to work effectively across globally distributed teams
Preferred Qualifications
General
- Experience in financial services or other regulated industries with strict SLA and security requirements
- Experience working in globally distributed teams with US-based stakeholders
- Multi-cloud or hybrid infrastructure experience
- Familiarity with secrets management tools (e.g., KeyVault, HashiCorp Vault) and policy enforcement (e.g., OPA)
- Relevant certifications (e.g., Kubernetes, AWS, Terraform) are advantageous
Expertise in One of the Following Specializations
API Management (Apigee / API Gateway)
- 3+ years of hands-on experience managing and scaling API platforms such as Apigee, Kong, or AWS API Gateway
- Strong understanding of API design principles (REST, SOAP, GraphQL), rate limiting, caching, and API security (OAuth2, JWT, mTLS)
- Experience defining and enforcing API governance, authentication/authorization models, traffic management, and analytics
- Experience with Apigee hybrid or multi-region deployments
- Experience with DevSecOps practices in API lifecycle and access control
Message Queue & Event Streaming (MQ / Kafka)
- 3+ years of hands-on experience administering and scaling enterprise messaging platforms such as IBM MQ, RabbitMQ, or Apache Kafka
- Strong understanding of message queue clustering, high availability, routing, and failover configurations.
- Experience with cloud-native messaging services (e.g., AWS MSK, Amazon SQS, Azure Service Bus).
- Experience with Kafka topic management, consumer group orchestration, schema registry, and Connect integrations
- Experience with performance tuning, capacity planning, and troubleshooting for high-throughput messaging environments.