Job description
- Deploy and maintain infrastructure & Bottomline solutions hosted in the private and public cloud.
- Application release management, configuration, upgrades & support of Java microservices, open-source tools and third-party services in a SaaS environment.
- Identify, diagnose, and resolve complex technology issues efficiently in live production environment
- Work closely with the Engineering and Cloud Infrastructure team to escalate issues for triage and resolution.
- Lead Cloud automation initiatives to avoid recurrence of issues and trigger automated action to improve system availability
- Establish automated proactive monitoring of all systems/services/networks to detect and resolve problems to ensure uninterrupted operation of all infrastructure systems.
- Lead implementation of DevSecOps practices in collaboration with Security team
- Collaborate with Architects and engineering team in preparing scalable network & deployment architecture covering failover and DR setup
Essential Experience:
- Coding experience on backend using .NET Framework, .NET Core
- Design, implement, and manage CI/CD pipelines to automate and streamline the build, test, and deployment processes.
- Good Development Skills in
- Proficiency in CI/CD tools such as Jenkins, GitLab CI, ArgoCD, Helm Chart or similar tool.
- Good understanding of Operating Systems (Unix/Linux, Windows) and networking (TCP/IP, SNMP, SMTP, DNS, HTTP, SSL/TLS, VPN, routing tables),load balancer (F5) and Firewall setup
- Manage containerization and orchestration using Docker, Kubernetes, or similar technologies.
- Experience with infrastructure-as-code frameworks such as Ansible, Terraform, Chef,Puppet.
- Web Server/Application Server deployments and administration.
- Perform regular system updates and patching to maintain the health and security of systems.
- Understanding of multi-tier architecture, Web based development, and Services Oriented Architecture.
- Experience with scripting languages (Bash, Python, PowerShell) for automation.
- Proficiency with monitoring and alerting and logging solutions tools such as Prometheus, Grafana, ELK.
- Excellent communication (verbal and written) and interpersonal skills
- Ability to prioritize & balance activity between strategic projects for longer term impact and immediate production critical requirements with a customer focus.
- Ability to take on-call rotation & co-ordinate work under production critical situations is essential.
Desirable Experience :
- Passion of learning and mastering information technology
- Basic understanding of Database (postgres), IAM (Key Cloak) & Java Programming language is preferred
- Experience with a different queuing system like RabbitMQ, Kafka, SQL, etc.
- Create and maintain documentation for system configurations, processes, and procedures.
Experience and Education:
- 5+ years experience in system setup, configuration, diagnosis & monitoring of Enterprise grade SaaS services
- Bachelor s degree in computer science, Networking or related field.