Job description:
Key Responsibilities:
1. Azure Cloud Management
- Manage and monitor Azure resources including Virtual Machines, App Services, Azure SQL, Storage Accounts, Virtual Networks, and Load Balancers.
- Implement Infrastructure as Code (IaC) using ARM Templates, Bicep, or Terraform.
- Administer Azure Active Directory (AAD), RBAC policies, and Conditional Access.
- Configure and maintain Azure Backup, Site Recovery (ASR), and Security Center for resilience and compliance.
- Implement and monitor Azure Policy, Defender for Cloud, and Azure Monitor for governance and threat protection.
- Manage Hybrid Join and Intune policies for Windows endpoints in hybrid environments.
2. Windows Server Administration
- Administer and secure Windows Server 2016/2019/2022 environments (AD, DNS, DHCP, IIS, GPO, WSUS).
- Perform system patching, log analysis, backup validation, and performance tuning.
- Configure Active Directory replication, domain controllers, and group policies for user management.
- Manage file systems, permissions, and Windows networking components.
- Monitor event logs, troubleshoot BSODs, and perform root cause analysis for OS-level issues.
3. Automation & AI Operations
- Develop PowerShell and Python scripts to automate operational tasks such as patching, scaling, and reporting.
- Use Azure Automation Runbooks, Logic Apps, and Functions to orchestrate workflows and enforce policies.
- Integrate AI-based observability using Azure Monitor Insights, Log Analytics, and Application Insights.
- Implement predictive maintenance and anomaly detection using Azure Machine Learning / Cognitive Services.
- Work with the DevOps team to align automation pipelines using GitHub Actions / Azure DevOps.
4. Security & Compliance
- Implement MFA, Conditional Access, Defender for Cloud, and Just-in-Time (JIT) VM Access.
- Ensure compliance with DPDP Act, CERT-In guidelines, ISO 27001, and Microsoft Security Benchmarks.
- Conduct vulnerability reviews and assist in VAPT remediation.
- Enforce encryption policies (BitLocker, TLS/SSL, Key Vault management).
- Perform regular security posture assessments using Microsoft Secure Score and Compliance Manager.
5. Monitoring, Reporting & Incident Management
- Utilize Azure Monitor, Log Analytics, Sentinel, and Application Insights for performance visibility and alert management.
- Investigate and resolve L2 incidents and escalations (performance degradation, failed jobs, connectivity issues).
- Implement AI-based root cause correlation via Azure AI Operations (AIOps) or third-party AIOps platforms.
- Prepare health reports, uptime dashboards, and monthly client SLA summaries.