Search by job, company or skills

T

Sr Software Architect - Global Network Services

new job description bg glownew job description bg glownew job description bg svg
  • Posted 18 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

The Role: We are looking for a Data Engineer to design, build, and own the observability pipeline that serves as the central nervous system for our entire platform. Your mission is to engineer the systems that collect, process, and correlate massive streams of telemetrylogs, metrics, and tracesfrom our globally distributed infrastructure. You will transform raw, disparate data points into a unified, coherent, and actionable view of our network's health and performance, empowering our engineering teams to detect, diagnose, and resolve issues with speed and precision. Responsibilities: Build and Manage Data Ingestion Pipelines: Architect and implement robust, scalable data ingestion pipelines using tools like Fluentd or Logstash to collect telemetry from thousands of sources across our multi-cloud environment. Own Observability Datastores: Deploy and manage the specialized databases that power our observability stack, including time-series databases (e.g., Prometheus, InfluxDB) for metrics and search indexes (e.g., Elasticsearch) for logs. Develop Data Correlation Engines: Design and build systems that process and correlate disparate data streams in real-time. Your work will enable engineers to connect a specific log error to a metric spike or a distributed trace, providing a holistic view of system behavior. Ensure Data Quality and Reliability: Implement data validation, quality checks, and monitoring to ensure the accuracy, consistency, and timeliness of the data flowing through the observability pipeline. Collaborate and Enable: Work closely with our control plane, data plane, and infrastructure teams to understand their monitoring needs and provide them with the data, dashboards, and tools required to maintain operational excellence. Required Qualifications: 3+ years of experience in data engineering, site reliability engineering (SRE), or a similar role with a focus on observability. Hands-on experience building and managing data pipelines with ingestion tools such as Fluentd, Logstash, Fluent Bit, or Vector. Strong experience with time-series databases like Prometheus or InfluxDB. Proficiency with search and analytics engines, particularly Elasticsearch, and the ELK Stack (Elasticsearch, Logstash, Kibana). Solid programming and scripting skills in a language like Python or Go. A strong understanding of the three pillars of observability: metrics, logs, and traces. Preferred Qualifications: Experience building and operating systems in a multi-cloud environment (AWS, Azure, GCP). Familiarity with containerization and orchestration technologies (Docker, Kubernetes). Experience with distributed tracing and telemetry standards like OpenTelemetry. A conceptual understanding of networking principles. Experience working in a fast-paced startup environment.

More Info

Job Type:
Industry:
Employment Type:

About Company

Tata Communications is a digital ecosystem enabler that powers today&#8217&#x3B;s fast-growing digital economy. We enable the digital transformation of enterprises globally, including 300 of the Fortune 500. We carry around 30% of the world&#8217&#x3B;s internet routes and connects businesses to 60% of the world&#8217&#x3B;s cloud giants.
We have been a part of the rich heritage of the internet in India. Over the last 25 years, enterprise-enabled services have been essential to the adoption of digital services in the country. Connectivity is an essential fabric of sustenance for the economy. We are committed to enabling Industry leaders in this New World of Communications&#8482&#x3B;, with our unique promise of delivering secure connected digital experiences.
In 2020, we announced the launch of &#8216&#x3B;Secure Connected Digital Experience&#8217&#x3B; (SCDx), a proposition intended to meet this growing, worldwide demand for new ways of operating, which includes far higher levels of working from home, rising security risks, a shift to digital commerce, and more contactless experiences. It will help companies currently relying on short-term fixes by providing holistic, secure, enterprise-level digital solutions that address current challenges and are fit for the long term.

Job ID: 136510473