About Opkey
Opkey is the leading Cloud Application Lifecycle Management (CALM) platform for Oracle, Workday, Salesforce, Coupa, and more. It cuts the costs and risks that drag down implementations and ongoing change, helping you go live on time, get more from your cloud app investments, and reach AI readiness faster. Opkey's 20+ AI agents manage all five phases of the cloud application lifecycleDefine, Design, Configure, Test and Train.
Whether it's a new implementation, a platform update, or businessasusual change, Opkey handles it all: updates validated in hours, selfhealing tests, endtoend integrations assured, configurations synced, and training updated in real timeall delivered in a single unified platform instead of a patchwork of disconnected tools.
Powered by Argus, a domainspecific AI model trained on decades of expertise and terabytes of enterprise application data, Opkey automates configuration, testing, change impact analysis, and training across these applicationscutting manual effort by 80%, enabling 30% faster golives, and slashing downtime risk by 92%.
Role: Cloud Engineer
Experience: 36 Years
Role Purpose
Own cloud infrastructure modules end-to-end, independently debug production issues, and participate in customer calls to troubleshoot, stabilize, and improve cloud-hosted environments.
Core Skills- Cloud Production Operations (Primary)
- AWS (Primary)
- EC2 production ops: instance lifecycle, AMIs, EBS volumes/snapshots, troubleshooting CPU/memory/disk/network issues, patching coordination, basic cost-aware sizing
- VPC: subnets (public/private), route tables, Internet Gateway/NAT Gateway concepts, NACLs vs Security Groups, VPC endpoints basics, peering basics
- S3: bucket policies, lifecycle rules, encryption basics, access troubleshooting (403/permissions), secure public access controls
- IAM: users/roles/policies, least privilege design, trust relationships, role assumption, access key hygiene, MFA enforcement awareness
- Route53: hosted zones, record types, routing policies basics, DNS troubleshooting (TTL/propagation)
- ALB: listeners, rules, target groups, health checks, sticky sessions basics, TLS termination patterns
- WAF basics: managed rules overview, allow/deny, common false-positive handling approach
- Networking Basics (Required)
- DNS, ports, routing concepts, CIDR/subnetting basics
- TLS handshake basics, certificate chains, common connectivity failure patterns
- Load balancer traffic flow basics (client LB backend)
- SSL / Certificate Management (Required)
- SSL installation & renewal (public + internal/self-signed where applicable)
- PFX/PEM/JKS awareness, SAN vs CN, intermediate chain handling
- Troubleshooting common cert issues (hostname mismatch, chain incomplete, expired certs)
- Windows Web Hosting Basics (Required)
- IIS basics: sites/app pools, bindings, logs, common 502/503 troubleshooting
- ARR basics: reverse proxy fundamentals, routing rules, timeouts, headers, SSL offload basics
- Secondary Skills
- Azure (Secondary)
- VM operations: sizing, disks, troubleshooting performance/connectivity
- VNet: subnets, NSGs, routing basics, private/public access patterns
- Storage: blob basics, access policies/SAS awareness, lifecycle basics
- RBAC: role assignments, scope (subscription/resource group/resource), troubleshooting access issues
- Application Gateway: listeners, backend pools, rules, health probes, TLS basics
- Azure WAF basics: managed rules awareness, request blocking pattern.
- Infrastructure as Code (Required)
- Terraform (Mandatory multi-cloud modules):
- module usage/customization, variables/outputs, remote state basics
- safe plan/apply workflow, drift awareness, environment separation (dev/stage/prod)
- basics of state locking and rollback/recovery approach
- Identity Basics (Required)
- SSO basics: SAML/OAuth concepts, metadata/cert rotation awareness, common misconfig patterns
- Coordinate with app/security teams for troubleshooting sign-in failures and token/cert issues
- Good To Have
- OCI (Exposure)
- Compute production ops, VCN fundamentals, security lists, route tables
- IAM basics (policies/compartments), Load Balancer basics
- Vault basics (cert/secret storage) and operational awareness
Responsibilities
- Own cloud infrastructure components end-to-end (provisioning, hardening, operations, troubleshooting)
- Handle production escalations independently with structured triage and RCA inputs
- Participate in customer calls for incident debugging, environment reviews, and remediation planning
- Ensure secure configurations: least-privilege IAM/RBAC, restricted network exposure, SSL correctness, WAF baseline protections
- Maintain operational readiness: runbooks, standard checks, and repeatable troubleshooting steps across environments
Skills: azure,terraform,devops,oci,cloud,aws