
Search by job, company or skills
Principal Architect with AI Capabilities
Location: REMOTE
Roles and Responsibilities:
Triage production issues, resolve incidents and ensure system stability, availability
Improve the quality and performance of various applications and services in the ecosystem
Collaborate closely with the Engineers to develop effective technical design and strategies, solving complex problems in the tech stack and driving execution to closure.
Propose and implement architectural changes that address scaling and performance challenges.
Provide hands-on technical leadership to deliver viable solutions that meet both functional and non-functional requirements.
Own the engineering excellence and operational readiness of the service driving the SLO, SLI, SLA of the relevant services
Ensure high quality services through comprehensive integration test and unit testing.
Debug production issues effectively, leveraging APM tools to trace transactions across various systems, provide short term / long term fixes based on the Root Cause Analysis.
Oversee all microservices within your purview, ensuring quality, security, availability, scalability and other ilities
Teach, mentor, grow, and provide advice to other domain experts, individual contributors, across several teams in live markets.
Establishing various best practices for software development and encouraging their adoption with the team and organization
Help develop and improve the overall technical direction of the organization including driving/ building best -in-class engineering practices and automation.
Collaborate with non-technical stakeholders, including Product Managers, Designers, Marketing teams.
Provide L3 engineering support to Operational teams.
Must Have Skills:
Minimum 15 years of hands-on coding experience, with a strong emphasis on Java /Go for backend development, as well as React and native app development for frontend.
Proven experience in debugging production issues and a strong ability to read and interpret APM tools like New Relic.
Strong design and architectural experience in building various highly-scalable and highly-available products on cloud providers like AWS, Azure, GCP.
Should have experience in creating system design - logical and deployment architectures for applications and able to write the HLD and LLD documents.
Should have worked hands-on in coding, debugging application and deployed applications in production, debugged and fixed severity 1 in applications in production
Should have handled applications that meet uptime of 99.9% and applications running in multi-site/region
Should have built and deployed high volume (500+ tps), high performance (response times1000 ) system into production
Should have very strong experience in choosing database choices, RDBMS, NoSQL - document vs columnar vs graph
Hands on experience with different services - RESTful, grpc, graphql, and websockets
Very strong experience with caches like Redis, messaging system like Kafka
Should have experience in IaC tools like Terraform, CloudFormation, CI/ CD tools like Jenkins, ArgoCD
Strong understanding of the SDLC Activities which include Analysis, Design, Development, Testing, Deployment and Post-Production Support etc.
Strong people management skills to lead and build a high-performing team.
Experience in collaborating with Product teams and Managers.
Experience in a product-based / hyper-growth environment is highly advantageous.
Experience in Micro Serviced based development and good experience with Kubernetes, Docker.
Experience in Web applications.
Working knowledge in Domain driven development would be an advantage.
Prior experience in the telecom domain would be an advantage.
Experience with Data and AI will be an added advantage.
Qualification:
Degree / Masters in Computer Science, Software Engineering or equivalent
Job ID: 145080187