
Search by job, company or skills

Roles and Responsibilities-
Drive the reliability and scalability of cloud-based systems while identifying and implementing improvements for operational efficiency and proactive monitoring. Automation and Tool DevelopmentContinuously seek opportunities to automate workflows, develop self-sustainable tools, and improve operational efficiency. Incident ManagementFacilitate partner inquiries and production incidents, ensuring compliance with internal SLAs. Responsibilities include responding to, investigating, and mitigating customer impact. Partner with the Global Partner Integrations (GPI), consumer engineering teams, and PMO to support product launches and other initiatives. You troubleshoot a production issue by reviewing source code, logs, operational metrics, stack trace, etc. to pinpoint a specific problem and then resolve it. You identify root causes and identify learnings to improve both operational processes Is a result-driven creative thinker who drives innovation and produces delightful experiences for our customers. Demonstrate data-driven open-minded decision making, have an insatiable curiosity, love to invent and innovate to solve difficult challenges Takes ownership of their work and consistently delivers results in a fast-paced environment. Actively support hyper-care and watch party events, providing real-time operational metrics and insights. Perform health checks on critical applications and services, ensuring uptime and availability. Write complex queries and scripts, analyze datasets, and pinpoint issues efficiently. Effectively communicate with global partners and stakeholders.
Roles and Responsibilities
- Foster teams with strong SRE drive engineering culture to close gap between operations and software engineering teams. Drive the observability and monitoring of cloud-based systems while identifying and implementing improvements for operational efficiency and proactive monitoring. Technical strong with operational capabilities that are industry standards such as alerts, monitoring, system/platform scalability. Automation and Tool Development Continuously seek opportunities to automate workflows, develop self-sustainable tools, and improve operational efficiency. Incident Management Facilitate partner inquiries and production incidents, ensuring compliance with internal SLAs. Responsibilities include responding to, investigating, and mitigating customer impact. Partner with the s/w engineering teams, technical account managers, and PMO to support product launches and other initiatives. Your team troubleshoots any production issue by reviewing source code, logs, operational metrics, stack trace, etc. to pinpoint a specific problem and then resolve it. You identify root causes and identify learnings to improve both operational processes Is a result-driven creative thinker who drives innovation and produces delightful experiences for our customers. Demonstrate data-driven open-minded decision making, have an insatiable curiosity, love to invent and innovate to solve difficult challenges Takes ownership of their work and consistently delivers results in a fast-paced environment. Actively support hyper-care and watch party events, providing real- time operational metrics and insights. Perform health checks on critical applications and services, ensuring uptime and availability. Write complex queries and scripts, analyze datasets, and pinpoint issues efficiently. Effectively communicate with global partners and stakeholders. Exercise good judgment when balancing immediate and long-term business needs. What to Bring Monitoring & Alerting Experience implementing alerting, metrics, and logging using tools like Prometheus, CloudWatch, Elastic, and PagerDuty. Direct experience with at least one cloud provider (AWS, GCP, Azure, or other).
Strong expertise in SQL hands-on experience working with databases. Experience building dashboards using tools like Databricks and Grafana. Familiarity with OAuth 2.0 authentication framework. Experience with tools such as PagerDuty and ServiceNow is a plus. Ability to work flexible shifts to provide global operational coverage and collaborate effectively with remote peers across disparate geographies and time zones.
Job ID: 111420861
Skills:
.NET, Sql, Microservices, Rest Api, Nosql, Devops, Typescript, Javascript, Web Apps, ASP.NET, Function Apps, Event Hub, Azure PaaS services, Service Bus, AI Technologies
Skills:
.Net Core or equivalent backend tech and MVC Patterns, Perl scripting, Real-Time processes setup, EDI Development, Relational DB understanding, SQL design development and automation, Handling large data files and datasets, Sterling Integrator WTX Mapping, Conceptual Logical and Physical data modelling, Kafka ServiceBus
Skills:
snowflake , Etl Development, Data Warehousing, Tableau, Informatica, Sql
Skills:
Appdynamics, Change Management, Incident Management, Splunk, Problem Management, Autosys, Sql, Confluent Kafka, Azure App Services, Azure Virtual Machines, Linux Unix commands
Skills:
Servicenow, CSS, Itil, HTML, Sdlc, Devops, Software Development, Javascript, Data Migration, Agile, Xml, System Integration, Executive Dashboards, Performance Analytics
We don’t charge any money for job offers