Search by job, company or skills

Tokopedia

Principal SRE Engineer (SE5)

new job description bg glownew job description bg glownew job description bg svg
  • Posted 22 days ago
  • Be among the first 10 applicants
Early Applicant
Quick Apply

Job Description

Cloud Administration

  • Hands-on in administering cloud-based infrastructure deployment which includes tasks such as provisioning of resources, user administration, monitoring computing resource utilization, network setup, backup/restore, and incident management.

Automation

  • Hands-on in designing and building SRE tooling to automate monitoring, incident response, and alerting to reduce time-consuming functions that are still necessary.
  • Proficient to build and improve CI/CD tooling to automate and streamline deployments Proficient in design and build of GitOps practise for infrastructure management DevOps
  • Proficient in CI/CD tools like GitLab CI/CD, Jenkins, and CircleCI, with experience in infrastructure automation using Terraform, Ansible, and CloudFormation.

K8s Administration

  • Hands-on experience in deploying and managing applications on Kubernetes, with knowledge of pod and container lifecycle management, service and ingress resource management, and persistent storage solutions.
  • Skills in Kubernetes networking concepts, including services, ingress controllers, and network policies, with a focus on scalability, high availability, and security.

IaC on Cloud:

  • Experience in infrastructure provisioning and management using Infrastructure as Code (IaC) tools like Terraform, Terragrunt, CloudFormation, and Azure Resource Manager (ARM).
  • Knowledgeable in IaC best practices, including version control, testing, and continuous integration/continuous deployment (CI/CD) pipelines for infrastructure code.

Networking

  • Proficient with Cloud Load Balancers, Cloud Networking, Wireless (Aruba)Build and manage Cloud product features for Enhanced Networking like VPC, API Gateway, CloudFront, Route 53, Cloud WAN, Direct Connect, PrivateLink, Transit Gateway, Elastic Load Balancing (ELB), etc.

What you will need

  • 10+ years of experience in SRE or DevOps space (at least 8+ in a large enterprise Cloud)
  • Experience maintaining and operating large-scale applications in cloud platforms such as AWS or GCP is a must-have.
  • Strong hands-on experience in Kubernetes is a must-have.
  • Deep knowledge of Linux as a production environment, and container technologies. e.g. Docker.
  • Ability to automate repetitive tasks and familiarity with scripting languages.
  • Strong understanding of infrastructure-as-code principles and best practices such as Terraform
  • Solid understanding of networking concepts and protocols.
  • Understanding of microservices architecture, event-driven architecture, Chef/Ansible and CI/CDStrong technical aptitude including excellent troubleshooting and communication skills.

More Info

Job Type:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

Tokopedia is an Indonesian technology company with a mission to democratize commerce through technology. Since its founding in 2009, Tokopedia has been a force that pioneers digital transformation in Indonesia. Consistent in building a bridge to connect millions of people, we have now reached more than 99% of districts and empowered more than 14 million registered merchants across Indonesia.

Job ID: 114472411