Search by job, company or skills

R

Sr Data Engineer

5-10 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description

Experience Summary

5-10 years of Data Engineer specialized in building document and knowledge-oriented data pipelines for regulatory/compliance domains, with strong capabilities in structured transformations, knowledge graphs, and containerized platform integration.

Core Responsibilities / Focus

  • Build and operate data ingestion and transformation pipelines for legal/regulatory content

  • Normalize and transform heterogeneous source formats (e.g., XML/HTML/structured exports) using tools such as XSLT

  • Implement pipelines for embeddings generation, indexing, and enrichment for downstream AI/RAG systems

  • Design and manage RDF-based knowledge representations and SPARQL-accessible datasets

  • Integrate storage and processing components across containerized/cloud environments

  • Support event-driven or integration-heavy workflows (e.g., via Apache Camel, message brokers)

  • Ensure reproducibility, maintainability, and operational handover of data pipelines

Core Skills (Must-Have)

  • Python/

    Java

  • Docker / Docker Compose

  • Kubernetes

  • Knowledge Graphs (RDF)

  • SPARQL

  • XSLT

  • Embeddings pipelines / vector preparation

  • Azure Storage (or equivalent cloud storage services)

  • Apache Camel

  • Git

Preferred / Nice-to-Have

  • Docling (or similar Document conversion)

  • CloudEvents

  • Kafka (or other message brokers)

  • Event-based systems / event-driven architecture

  • Dev Containers

  • GitOps

  • Documentation practices

Domain Advantage

Experience processing legal/regulatory source documents and preserving semantic structure / provenance

Familiarity with content domains such as EU regulation, privacy, ESG, and compliance frameworks


Qualifications

Educational qualification:

BE/B.Tech or Equivalent Degree

Experience :

5-10 Years

Mandatory/requires Skills :
Strong hands-on expertise in Python/Java, Docker / Docker Compose, Kubernetes, Knowledge Graphs (RDF),SPARQL,XSLT,Embeddings pipelines / vector preparation, Azure Storage (or equivalent cloud storage services),

Apache Camel,Git

Preferred Skills :

More Info

Job Type:
Employment Type:

About Company

The Bosch Group is a leading global supplier of technology and services. It employs roughly 402,600 associates worldwide (as of December 31, 2021). The company generated sales of 78.7 billion euros in 2021. Its operations are divided into four business sectors: Mobility Solutions, Industrial Technology, Consumer Goods, and Energy and Building Technology.
As a leading IoT provider, Bosch offers innovative solutions for smart homes, Industry 4.0, and connected mobility. Bosch is pursuing a vision of mobility that is sustainable, safe, and exciting. It uses its expertise in sensor technology, software, and services, as well as its own IoT cloud, to offer its customers connected, cross-domain solutions from a single source. The Bosch Group&#8217&#x3B;s strategic objective is to facilitate connected living with products and solutions that either contain artificial intelligence (AI) or have been developed or manufactured with its help. Bosch improves quality of life worldwide with products and services that are innovative and spark enthusiasm. In short, Bosch creates technology that is "Invented for life."

Job ID: 143956863

Similar Jobs