About Us
Actualize is a trusted global partner in engineering and digital transformation, with 20+ years of domain expertise in the manufacturing, industrial, and automotive sectors. Our India-based team delivers smart, scalable solutions using AI/ML, IoT, cloud, and intelligent automation to modernize legacy systems and enable data-led innovation. Headquartered in Bangalore, we also operate from Pune, Ohio, and Munich.
Brief
We are looking for Data Engineer with hands-on experience in designing, building, and maintaining scalable data architectures capable of handling both structured data (numerical) and unstructured data (text, audio, images, video). This role spans cloud and onpremise environments, ensuring secure, reliable, and highquality data pipelines for AI/ML :
- Design and deploy scalable data architectures supporting structured (numerical) and unstructured (text, audio, image, video) data for advanced AI use cases.
- Apply strong technical expertise towards Build, maintain, and optimize ETL/ELT pipelines across enterprise systems such as SAP, Oracle, Data bricks, and other internal platforms.
- Prepare clean, highquality datasets for scientific AI, computer vision, predictive modelling, generative AI, and analytics/reporting solutions.
- Apply strong technical expertise in SAP data models, SQL, Data bricks, Spark, Python, and cloud/onprem data workflows.
- Implement data governance, quality checks, lineage tracking, and metadata management to ensure reliability and compliance.
- Collaborate with AI engineers, data scientists, and crossfunctional stakeholders to ensure data readiness for production AI solutions across cloud and onprem environments.
Requirements
Must-have Qualifications and Experience :
- Engineering Graduate/ Post Graduation (ME/MTech./MS/MBA) with a minimum of 6 years exp.
- Handson experience designing centralized storage and processing pipelines for computer vision image and video data, including largescale preprocessing workflows.
- Handson experience with image/video annotation platforms and using feature stores to organize and manage data
- Proficiency in Python, SQL, Spark/PySpark, Data bricks, and largescale data processing
- Handson experience working with enterprise systems such as Azure Data Lake, Data bricks, SAP, Oracle, or equivalent platforms.
- Handson experience with data modeling, pipeline optimization, data governance, and security best practices in cloud (Azure/AWS/GCP) and hybrid environments.
- Handson experience with building reliable ETL/ELT pipelines for AI/ML workloads.
- Experience with vector databases, embedding, and LLMbased data workflows
Benefits
- Challenging Work Environment : We provide a stimulating environment where you will have the opportunity to work on complex and meaningful projects.
- Global Impact : Be part of a multinational organization where your work and ideas contribute to both local and global success.
- Growth & Development : We are dedicated to helping you develop your career by providing opportunities to grow, take on new challenges, and lead initiatives that matter.
- Innovation Encouraged : Our culture supports challenging the Status Quo and fostering new ideas to build tools, frameworks, and applications that make a difference.
(ref:hirist.tech)