Search by job, company or skills

Art Technology and Software

Service Reliability Engineer (Mid/Senior)

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 14 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Service Reliability Engineers (Mid / Senior)

Location: WFO – Infopark, Kochi

Team: Engineering & Risk Operations

Role Summary

We are looking for a Mid‑level / Senior‑level Service Reliability Engineer to help ensure the resilience, performance, and integrity of our fintech platform.

In this hybrid role, you will drive efforts to maintain service uptime, while also leveraging data‑driven analysis to detect, respond to, and mitigate fraudulent activities. You will work at the intersection of engineering, risk, and operations, ensuring that both our systems and customers remain protected.

This is a high‑impact role, ideal for candidates with 3+ years of experience who thrive in fast‑paced, mission‑critical payments or fintech environments.

Key ResponsibilitiesService Reliability Engineering

  • Ensure high availability and performance of customer‑facing services and payment systems.
  • Design and implement observability and alerting strategies using Datadog, PagerDuty, and custom dashboards.
  • Troubleshoot alerts, logs, and databases across infrastructure, applications, and APIs; perform root cause analysis.
  • Define and manage SLIs, SLOs, and SLAs to measure and improve system health.
  • Develop and maintain automation scripts and tools for system provisioning, health checks, and failover mechanisms.
  • Collaborate with development and infrastructure teams to design scalable, fault‑tolerant systems.

Fraud Detection & Risk Monitoring

  • Monitor suspicious transaction patterns in real time using Splunk, Datadog, internal logs, and behavioral data.
  • Investigate alerts, anomalies, and user behaviors to detect fraud, abuse, or financial risk exposure.
  • Tune and optimize detection rules, triggers, and risk signals based on observed trends.
  • Work closely with Compliance, Fraud Operations, and Customer Support teams on investigations and resolutions.
  • Develop and maintain dashboards and reports for fraud KPIs, incident metrics, and operational trends.
  • Participate in cross‑functional fraud response processes and post‑incident reviews.

Required Qualifications

  • 3+ years of experience in a combined SRE, Fraud Analyst, or Security Operations role.
  • Strong experience with monitoring and alerting tools: Datadog, PagerDuty, Splunk.
  • Proficiency with Linux, shell scripting, and cloud infrastructure troubleshooting (AWS, GCP, or Azure).
  • Solid understanding of service observability, incident response, and CI/CD pipelines.
  • Experience with fraud detection or risk monitoring workflows in fintech, payments, or transaction‑heavy systems.
  • Comfortable writing queries (SQL, Splunk SPL) and handling large‑scale log or event data.
  • Strong collaboration and communication skills with both technical and non‑technical stakeholders.

Nice to Have

  • Exposure to KYC/AML systems or fraud frameworks in regulated environments.
  • Familiarity with secure API architecture, OAuth2, JWT, and access‑control mechanisms.
  • Knowledge of PCI‑DSS, ISO 27001, ISO 8583, or other financial compliance frameworks.
  • Hands‑on experience with rule engines or anomaly detection models.
  • Certifications such as AWS DevOps Engineer, CFE (Certified Fraud Examiner), or similar.

What You'll Get

  • A unique opportunity to work across Service Reliability Engineering and Fraud Detection.
  • High‑impact work supporting secure, high‑volume payment platforms.
  • A collaborative, data‑driven culture with a strong engineering and risk mindset.
  • Career growth through ownership of complex systems and security initiatives.

Skills: support,grafana,monitoring,datadog,fraud,risk,splunk,24/7,nagios,troubleshooting,sql

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 147222661