Role/ Job Title: Data Catalog Stewardship
Function/ Department: Data Governance/ Information Management /Data & Analytics
Location- Thane Maharashtra
Job Purpose:
The role will focus on building, maintaining, and enhancing the enterprise data catalog using Knowledge Graph, Neo4j, ontology design, and Graph RAG capabilities.
The candidate will work closely with Data Governance, Data Engineering, Data Quality, Business, Risk, Compliance, and Analytics teams to improve metadata quality, data discoverability, business glossary adoption, lineage traceability, and AI-ready cataloging across the bank's data ecosystem.
Roles & Responsibilities:
- Own and maintain data catalog assets including datasets, data elements, business terms, technical metadata, classifications, ownership, and glossary mappings.
- Design and enhance Knowledge Graph-based catalog structures using Neo4j.
- Develop and manage ontology models for business glossary, data dictionary, lineage, data quality, CDE/RDE classification, and data ownership.
- Support implementation of Graph RAG use cases for metadata search, semantic discovery, and AI-enabled data catalog queries.
- Create and maintain relationships between business terms, datasets, data elements, source systems, lineage, data quality rules, owners, stewards, and critical data elements.
- Work with business and technology teams to validate catalog definitions, metadata completeness, ownership, and data classification.
- Support CDE/RDE discovery and mapping for critical banking datasets.
- Contribute to data governance frameworks, metadata standards, stewardship workflows, and catalog quality controls.
- Perform metadata profiling and validation using SQL and Python where required.
- Collaborate with Data Engineering teams to ingest metadata from source systems, marts, data quality platforms, and lineage repositories.
- Ensure catalog content is accurate, consistent, audit-ready, and aligned with governance standards.
- Prepare catalog progress reports, stewardship dashboards, and governance status updates for stakeholders.
Required Skills
- Strong hands-on experience in Neo4j development.
- Strong understanding of Knowledge Graph-based data cataloging.
- Good knowledge of data catalog concepts, including business glossary, data dictionary, metadata, lineage, ownership, and classification.
- Experience in ontology design, entity-relationship modeling, graph schema design, and semantic data modeling.
- Understanding of Graph RAG concepts for metadata discovery, LLM-based search, and contextual catalog querying.
- Working knowledge of Data Governance frameworks, including stewardship, ownership, metadata management, data quality, lineage, CDE/RDE discovery, and policy alignment.
- Hands-on experience with SQL for metadata analysis, profiling, and validation.
- Working knowledge of Python for automation, data preparation, metadata processing, and catalog enrichment.
- Strong communication skills with the ability to interact with business users, technology teams, governance stakeholders, and leadership.
Preferred Skills
- Experience working in banking, financial services, risk, compliance, or analytics domains.
- Exposure to data governance tools such as Collibra, Alation, Informatica, or similar platforms.
- Understanding of data privacy, PII classification, regulatory reporting, and audit requirements.
- Knowledge of lineage concepts across source, staging, refined, mart, and consumption layers.
- Experience in building semantic search, metadata intelligence, or AI-assisted cataloging solutions.
- Familiarity with APIs, metadata ingestion pipelines, and graph-based integration patterns.
Technical Skill Set
- Graph Database: Neo4j
- Knowledge Graph: Graph catalog modeling, relationship design, entity mapping
- Data Catalog: Business glossary, data dictionary, metadata management, lineage, ownership
- AI / RAG: Graph RAG, semantic search, metadata-driven contextual retrieval
- Ontology: Ontology design, taxonomy, semantic relationships, graph schema
- Programming: Python
- Database: SQL
- Governance: Data Governance framework, stewardship, CDE/RDE, data quality, classification
Candidate Profile
The ideal candidate should be a hands-on technologist with strong understanding of Neo4j, Knowledge Graphs, and Data Cataloging, combined with the ability to work with business and governance teams. The candidate should be comfortable translating business metadata into graph-based catalog structures and enabling improved data discovery, lineage traceability, and AI-ready metadata consumption.
Education Qualification:
Graduation: Bachelor's degree in computer science, Information Technology, Data Engineering, Data Science, or related field/Post Graduation
Experience: 2 to 7 years of relevant experience.