
Search by job, company or skills
At American Express, our culture is built on a 175-year history of innovation, shared and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. From delivering differentiated products to providing world-class customer service, we operate with a strong risk mindset, ensuring we continue to uphold our brand promise of trust, security, and service.
As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new skills, develop as a leader, and grow your career. Here, your voice and ideas matter, your work makes an impact, and together, you will help us define the future of American Express.
How will you make an impact in this role
We're looking for a Site Reliability/Application SupportEngineer responsible for application performance, availability, and reliability. Candidate is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. Site Reliability Engineering/Application Support(SRE/AS) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant systems. This role will ensure that American Express internal and external services have reliability and uptime appropriate to users needs. We also ensure a continuous improvement, while keeping an ever-watchful eye, automated, on capacity and performance.
This role will drive the SRE/ASmindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to day work through better automation, monitoring, alerting, testing and deployment. You'll be expected to work with several Technology partners to identify areas of opportunity within the availability platform and build a solution to automate monitoring solutions for the modernization platform, technology, and constant innovations to drive efficiencies. You will be responsible for implementing tracing, monitoring, tooling solutions to maximize the performance and availability of our Web applications. This is an opportunity to work in one of the best Technology units to help improve customer experience for American Express digital assets and influence how millions of people interact with their cards, their merchants, and their money.
This role is a hands-on position supporting American Express Site Reliability Engineering/ Application Supportteam.
What you will be doing
Provide hands-on support for the runtime operation of our applications, ensuring high availability and performance.
Collaborate with software engineering and infrastructure teams to troubleshoot and resolve runtime issues, including performance bottlenecks, scalability challenges, and system failures.
Contribute to the design and implementation of monitoring, alerting, and logging solutions to proactively identify and address potential runtime issues.
Participate in incident response and root cause analysis efforts to ensure the stability and resilience of the applications.
Work closely with cross-functional teams to understand application requirements and provide input on runtime and operational considerations during the software development lifecycle.
Contribute to the development and maintenance of runtime automation and tooling to streamline operational processes and improve efficiency.
Mentoring your peers and demonstrate a passion for continuous learning environment for the team.
Develop common framework components (to be leveraged by enterprise applications), define standards for configuration, monitoring, reliability, and performance engineering
Drive automation and ensure automated test scripts are completed for new features.
Goodattitude,communication,willingnessto learn and collaborate.
Bring a culture of innovation, ideas, and continuousimprovement.
Challenging status quo, demonstrate risk taking, and implementcreativeideas.
Continuously improve automated remediation tasks to ensure the highest levels of availability
Qualifications
Open to work in 24.7 or on-call working environment
BS or MS degree in computer science, computer engineering, or other technical discipline, or equivalent experience inSite Reliability Engineering/Application Support(SRE/AS) supporting Full-stack applications
Development or support of Java/J2EE/REACT JS applications, and Node applications
Hands on experience with frameworks - Spring Boot, Vertex, NodeJS
Experience in designing mission critical highly available enterprise applications
Hand on experience with performance testing framework design, tuning Java applications
Experience managing relational and NoSQL databases such as DB2, Postgres, Mongo, Couchbase, Cassandra etc.
Strong knowledge of Linux internals and experience managing Linux systems in high traffic environments
Strong interpersonal communication skills and the ability to work well in a diverse team-focused environment
Experience with Splunk and/or ELK. Hands on experience on configuring Splunk, Grafana dashboards, Elastalert, OpenSearch, etc.
Good understanding of cloud technologies - Kubernetes, OpenShift, Docker etc.
Good understand of GraphQL - Query and resolver
Knowledge of Public Cloud technologies GCP, AWS, AZURE etc. would be an advantage
Monitoring and analyzing PMI data
Hands on experience on enterprise tools set such as Grafana, Dynatrace, AppDynamics, BMC, Prometheus etc.
Understanding of using Agile Practices in Operations teams
Experience in handling DDoS/BOT attack and different security remediations
Working experience with Network load balancers, Global Traffic Managers (GTMs), Local Traffic Managers (LTMs)
Working experience on network rules creation, load balancer configurations, network packet analysis
Analytical knowledge and exposure on root cause identification using analyzer tools like IBM support assistant, Splunk etc.
We back you with benefits that support your holistic well-being so you can be and deliver your best. This means caring for you and your loved ones physical, financial, and mental health, as well as providing the flexibility you need to thrive personally and professionally:
Competitive base salaries
Bonus incentives
Support for financial-well-being and retirement
Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)
Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
Generous paid parental leave policies (depending on your location)
Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
Free and confidential counseling support through our Healthy Minds program
Career development and training opportunities
American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.
Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.
American Express Company (Amex) is an American multinational corporation specialized in payment card services headquartered at 200 Vesey Street in the Battery Park City neighborhood of Lower Manhattan in New York City. The company was founded in 1850 and is one of the 30 components of the Dow Jones Industrial Average. The company's logo, adopted in 1958, is a gladiator or centurion whose image appears on the company's well-known traveler's cheques, charge cards, and credit cards.
Job ID: 145408839
We don’t charge any money for job offers