Search by job, company or skills

Tesla

Site Reliability Engineer, Platform Engineering

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Position Description:
Tesla's Platform Engineering is looking for a Site Reliability Engineer to join our team. As a member of the team, you will be building and maintaining Kubernetes clusters using infrastructure-as-code tools like Ansible, Terraform, ArgoCD and Helm and helping the application teams to be successful on our platform. The underlying infrastructure is a mix of on-premise VMs, bare metal hosts and public clouds such as AWS located all around the globe, which presents unique challenges and opportunity to work with different types of infrastructure technologies. A successful candidate will be expected to possess expert knowledge in Linux fundamentals, architecture and performance tuning; as well as software development skills to match. Experience running Kubernetes in production will be a strong plus; we prefer Golang or Python for any automation or tools we have to build along the way. We are the team that runs production critical workloads for every aspect of the business at Tesla and sets the standards for other teams, a group of well-rounded generalists that not only solve the hardest problems in the industry but also push other Engineering teams at large to be better. Join us to get a chance to work with some of the best Engineers in the industry for one of the most transformative companies in the history of both automotive and energy industries.

Responsibilities:

  • Hands-on with developers to deploy the applications to provide support
  • Building new features to improve the platform in terms of stability & updates
  • Manage our Kubernetes clusters on-prem and in the cloud to support our growing workloads
  • Participating in the architecture design process and troubleshooting of live applications with the product teams
  • Participating in a 24x7 on-call rotation
  • Influence architectural decisions with focus on security, scalability and high-performance
  • Setup and maintain monitoring, metrics & reporting systems for fine-grained observability and actionable alerting
  • Authoring technical documentation for workflows/processes/best practices

  • Requirements:


  • Experience managing web-scale infrastructure in a production *nix environment
  • Ability to prioritize tasks and work independently with an analytical mind with a bias for action
  • Advanced or expert-level Linux administration and performance tuning skills
  • Bachelor's Degree in Computer Science, Computer Engineering, or equivalent experience or evidence of exceptional ability
  • Advanced experience with configuration management systems such as Ansible, Terraform or Puppet
  • Demonstrable knowledge of the Linux operating system internals, networking stack, filesystems, resource scheduling and process management
  • Exposure to AWS, or other cloud infrastructure providers
  • Experience managing container-based workloads, using Kubernetes or other orchestration software in production (ArgoCD, Helm)
  • Proficiency in a high-level language like Python, Go, Ruby and/or Java
  • More Info

    Job Type:
    Industry:
    Function:
    Employment Type:

    About Company

    Job ID: 147326179

    Similar Jobs

    Bengaluru, India

    Skills:

    TerraformIncident ResponseAnsibleHelmKubernetesAWSLinux systems administration

    Bengaluru, India

    Skills:

    DatadogObject Oriented ProgrammingDynatraceSplunkRubyPythonGolangAWS BedrockAIOps toolsAzure OpenAIBigPandaElasticGCP Vertex AIMoogsoftPrompt Engineering

    Bengaluru, India

    Skills:

    JavaSolarisDocker SwarmPrometheusBashGrafanaGroovyAixJenkinsFreebsdLinuxDockerTerraformAnsibleKubernetesPythonGitLab CIautoconfautomakeUnix Systems AdministrationLibtool

    Bengaluru, India

    Skills:

    operational support EsxiWindows ServerVmware VspherePowerShellDrsNetworkingDnsGroup PolicyStorageLinuxAnsibleHaPowercliVmotionVcenterPythonActive DirectoryMonitoringCompute

    Bengaluru, India

    Skills:

    ElkPrometheusGrafanaDatadogTerraformCloud NetworkingPythonAzure DevOpsAWSBashFluxJenkinsGcpLinuxAzureKubernetesGitOpsEFKRelational DatabasesTerragruntGitHub ActionsCrossplaneGitLab CIArgo CD