Location: Bangalore
Department: Information Technology
Reports to: Senior Director- Applications and Systems
Years of Experience - 8+ Years
Position Summary:
The Senior Infrastructure Engineer - Storage and Backup is responsible for designing, implementing, and operating resilient storage, backup, and disaster recovery solutions across on-premises, hybrid, and cloud environments. This role provides deep technical expertise, leads complex infrastructure initiatives, ensures data protection, security, and compliance, and collaborates closely with cross-functional teams to align infrastructure capabilities with business continuity and performance objectives.
Key Responsibilities:
- As a Senior Infrastructure Engineer in Storage and Backup, you will play a critical role in ensuring the stability, scalability, and resilience of our organization's storage, disaster recovery (DR), and backup solutions across on-premises, hybrid, and multi-cloud environments.
Primary Responsibilities:
Solution Architecture & Design
- Collaborate with business and technical teams to design and implement scalable, secure, and high-performance storage architectures.
- Evaluate and select storage technologies (SAN, NAS, object storage, cloud-based storage) based on workload characteristics and performance requirements.
- Design and implement disaster recovery (DR), backup, and high availability (HA) solutions that align with SLAs, RPO, and RTO objectives.
- Architect efficient backup solutions that safeguard critical production systems and business data.
- Prepare detailed architecture diagrams, data flow models, and technical documentation for storage, backup, and DR strategies.
Implementation & Deployment
- Lead the deployment and integration of storage infrastructure solutions across compute, network, and cloud environments.
- Implement and maintain backup solutions (NetBackup, Rubrik and ASR) ensuring comprehensive data protection in both on-premises and cloud environments.
- Deploy disaster recovery solutions, ensuring rapid recovery of critical systems during a failure.
- Automate infrastructure provisioning, configuration, backup scheduling, and monitoring tasks using Infrastructure-as-Code (IaC) tools and scripting.
- Manage data migrations from legacy systems to modern storage platforms while maintaining data integrity and backup consistency.
- Work closely with DevOps, infrastructure, and application teams to ensure seamless deployments and disaster recovery testing.
Maintenance & Operations
- Proactively monitor storage and backup system performance to ensure optimal operation, capacity, and usage.
- Perform regular system updates, including firmware upgrades, patching, and health checks on storage and backup systems.
- Troubleshoot, investigate, and resolve incidents related to storage, backup, and disaster recovery, addressing performance bottlenecks and system failures.
- Conduct routine backup verification to ensure data integrity and test recovery readiness.
- Maintain comprehensive documentation related to backup and disaster recovery, including logs, procedures, and test results.
Security & Compliance
- Participate in regular audits, risk assessments, and security reviews to maintain compliance with both internal and external requirements.
- Lead efforts to ensure infrastructure meets cybersecurity standards such as ISO 27001, PCI, and relevant frameworks.
Capacity Planning & Performance Tuning
- Forecast future storage and backup needs based on growth trends, disaster recovery requirements, and business objectives.
- Optimize storage allocation, backup retention policies, and recovery workflows to meet both performance and cost requirements.
- Review backup schedules and retention policies, ensuring they minimize data loss in the event of a failure or disaster.
Backup & Disaster Recovery Planning & Testing
- Lead the design, implementation, and continuous improvement of disaster recovery plans to ensure business continuity.
- Perform regular testing of disaster recovery plans, including data failover, failback, and system restoration.
- Ensure off-site and cloud-based backups are implemented and managed for DR readiness.
- Establish and refine backup retention policies that balance compliance, cost, and recovery needs.
Collaboration & Support
- Provide tier 3 support for escalated issues related to storage, backup, and DR.
- Collaborate with vendors and third-party support teams to manage hardware/software issues, service renewals, and SLAs for storage, backup, and DR solutions.
- Engage with cross-functional teams (DevOps, infrastructure, application teams) to ensure storage and DR strategies align with business continuity goals and IT roadmaps.
- Foster positive vendor relationships and ensure that SLAs, performance, and compliance are met.
Documentation & Knowledge Management
- Maintain up-to-date high-level and low-level design documentation (HLD/LLD), Standard Operating Procedures (SOPs), and runbooks.
- Author and publish knowledge base articles for internal and external teams via ServiceNow or similar platforms.
- Support process optimization initiatives, helping to Shift Left on Low level tasks
Process Excellence & Automation
- Lead the adoption of IT Service Management (ITSM) best practices (Incident, Problem, Change, Knowledge, Availability).
- Drive automation and process optimization using Python, Power Automate, Azure DevOps, and other relevant tools.
- Bachelor's degree in computer science, Engineering, or a related technical field.
- Pure Storage Certified Data Storage Associate (formerly Global Enablement)
- Microsoft Certified: Azure Solutions Architect Expert
- Google Certified Solutions Architect
Key Skills & Competencies:
- 8+ years of experience in IT Infrastructure, Data Center Management, and Cloud Architecture with a focus on storage, backup, and disaster recovery.
- Expertise in cloud migration (AWS, Azure) and virtualization technologies.
- Strong cybersecurity expertise, particularly in storage and backup systems.
- Hands-on experience with cloud storage solutions, backup management tools (Rubrik, NetBackup), and disaster recovery strategies.
- Knowledge in automation and scripting using tools like Python, PowerShell, and Infrastructure-as-Code (IaC) tools.
- Exceptional troubleshooting and performance tuning skills for storage and backup solutions.
- Experience in managing vendor relationships and SLA performance.
- Strong Communication and interpersonal skills, with the ability to engage with senior stakeholders
- Excellent documentation skills for creating system designs, runbooks, and knowledge base articles.
Preferred Experience:
- Hands-on implementation experience with cloud-based storage, hybrid cloud environments, and data protection strategies.
- Exposure to ISO/ITIL-based governance frameworks and global/regional operations.
- Ability to lead and mentor junior engineers, providing technical direction and guidance.