Responsible for App Ops- Technical and Functional support- of a wide range of production applications/services including day to day support, weekly/monthly releases/deployments, data fixes, and other regular patching activities
Work closely with Product Infrastructure (PI) team to support ongoing maintenances e.g., migration of apps between/within Data Centers, adding new servers, updating firmware etc and ensure minimal/no impact to customers
You will be a primary contact to resolve production/hosting related issues and expected to work closely with Change Management, Release management, and extended Partner teams for bug fixes, regular deployments etc
Applies knowledge of technology and operational best practices to drive design, development and implementation of operational standards and capabilities to enable highly available , scalable & reliable customer services/experiences
Analyzes and synthesizes a variety of inputs to drive end-to-end incident management process to restore service quick and then Root Cause Analysis( RCA) for long term fixes
Developing monitoring architecture and implementing monitoring agents, dashboards, escalations and alerts
Monitor dashboards (respond to alerts by resolving directly; escalating to appropriate resource )
Answering incoming phone calls and e-mails from customers and addressing their questions and concerns regarding NCR s online banking and point of sales products
Technical and functional support- of a wide range of production applications/services including day to day support, weekly/monthly releases/deployments, data fixes, and other regular patching activities
Primary contact to resolve production/hosting related issues
Applies knowledge of technology and operational best practices to provide support to productions systems
Work assigned Pivotal incidents
DNS updates
Apache configuration updates
You will be required to have a flexible schedule, including weekend support
Facilitate incident bridge lines when application outages occur
Responsible for implementing Product upgrades/Deployment and migration by following appropriate documentation and procedures.
Maintain automation tools and processes to streamline operational tasks and improve efficiency.
Manage the installation of software updates, patches and security fixes to keep systems up to date.
Monitor and manage Jira Queues to ensure service requests and tasks are prioritized, assigned and resolved promptly, according to established SLA s.
Responsible for following appropriate Change management process and attend CAB meetings to gather approvals to implement changes in production.
Participate in incident response and troubleshooting efforts, ensuring timely resolution and minimal impact on operations.
Continuously monitor and optimize system performance, identifying areas for improvement and implementing necessary enhancements.
Stay up-to-date with industry best practices and emerging technologies, identifying opportunities to enhance our infrastructure and operational processes.
BASIC QUALIFICATIONS:
Bachelors degree in Computer Science , Engineering, or a related field.
2+ years related experience
Linux or other Unix experience
Familiar with monitoring tools (e.g., Prometheus, Grafana, ELK stack).
Knowledge in scripting and automation using languages such as Python, Bash, or PowerShell.
Solid understanding of cloud computing platforms (e.g., AWS, GCP, Azure) and containerization technologies (e.g., Docker, Kubernetes).
Troubleshooting production issues
System Admin exposure
Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef).
Quick learner, innovative problem solver, and flexible to adapt new apps and technologies
Excellent communication and collaboration skills, with the ability to work effectively in cross-functional teams.