Search by job, company or skills

Zoetis

Data Engineer, ZTD R&D

5-8 Years
Save
  • Posted 5 days ago
  • Over 50 applicants
Quick Apply

Job Description

POSITION RESPONSIBILITIES

Business Analysis

• Autonomously perform Business Analysis as needed for data-based use cases, including open interaction with SMEs to answer questions and seek feedback, early analysis investigation spikes to analyze preliminary datasets, and phase-based design planning for implementation.

• Creates accurate entity mappings for data integrations and transformations, including business rules

• Creates entity relationship diagrams to document a variety of data development cases, including process flow, entity source to target, and database diagrams

• Define scope, approach, next steps, and direction with regard to assigned project objectives, and consult with technical lead for internal and business objective alignment

• Determine appropriate frequency and mode of needed data staging, from near-real-time streaming to daily wipe-reload

• Determine appropriate means of provisioning data, from direct file or analysis table consumption to visualization 15%

Data Analysis

• Analyze incoming data (from raw sets, database tables, APIs, and other sources) rapidly to understand its potential, needed manipulations, and issues.

• Perform data cleanup activities without direction

• Profile datasets, detecting anomalies, needed alignments, and patterns

• Proactively determine probable rules, and confirm unknown concepts with domain technical owner

• Understands AI tools and technologies; able to leverage pre-trained models for daily tasks.

• Provide L3/L4 support for production data issues including advanced query capabilities and ETL pattern knowledge/familiarity to troubleshoot datasets and ETLs, and direct contracted resources as needed to remediate issues. 25%

Data Development

• Design and develop data solutions from prototype to production, largely self-driven, with a particular focus on rapid ETL development, including integrations, data processing, and analyses, as well as limited data apps and visualizations

• Automate pipelines using scripting and ETL tools

• Manage and monitors pipeline execution

• Design and implement error handling capabilities

• Setup simple orchestration jobs

• Perform data migration analysis tasks, such as evaluation of current sets, mapping, and process flow diagramming

• Own and oversee as needed execution of data migration tasks as part of system deployment

• Diagnose, handle, and manage data migration issues

• Understands ACID principles for databases

• Understands and applies different approaches for data loading based on the scenario, such as wipe/reload, upsert, and CDC

• Recognizes and can approach varying latencies of data loading, from batch to near-realtime

• Oversee and work with contract resources to fulfill design and execution for data products.

• Assume responsibility for the overall quality of delivered products.

• Understand and apply key Agile concepts like failing fast and minimum viable product.

• Participate in project management activities like daily stand-up meetings, sprint reviews, etc.

• Assume responsibility for own assigned tasks and reaching out for clarity,

• Create tasks as needed and delegate where appropriate

• Document technical design specifications

• Interpret and diagnose existing legacy or inherited code for problems and proposed remediation

• Unit test workflows extensively to minimize rework

• Peer review with others to receive and provide advice and insight

• Regression test developed items to verify continuous coverage of existing functionality

• Design, execute, and oversee test script execution and automation

• Define deployment/installation documentation

• Execute or oversee deployment, including installation, installation verification, and hypercare

• Document products consistently 50%

Data Architecture

• Recognize key VMRD data entities and understand how they relate to others, to extend the value and linkability of data across use cases.

• Know and apply approaches and sources to acquire additional metadata for key VMRD entities

• Understands basic Master Data Management principles

• Assure adherence to basic data security approaches, including app, visualization, database, and file security

• Understand and apply role-based security approaches and inheritance

• Recognize and safely handle sensitive data such as Personal Information and Intellectual Property

• Good understanding of the technical impacts of GxP-related systems and processes

• Build and continually enhance knowledge of both the technical and business functional landscape for VMRD

• Actively interface with Systems Engineers on app-related projects

• Collaborate with colleagues to continually enhance process and knowledge 10%

ORGANIZATIONAL RELATIONSHIPS

• ZTD R&D Solution Partners

• ZTD R&D Systems Engineers

• ZTD Centers Of Excellence

• VMRD business SMEs from multiple product lines and departments

Supervision

0-4 contingent workers technical direction

EDUCATION AND EXPERIENCE

• Undergraduate degree related to information technology and/or computer science or equivalent education and work experience required.

• 5-8+ years experience with the design, building, and supporting of rapid data development (3-6+ with Master's degree)

• Experience with utilizing multiple vendors and/or departments for service and support activities.

• Excellent interpersonal and communication skills with the ability to build relationships.

• Experience in coordinating activities with multidisciplinary teams distributed in many physical locations with different time zones.

• Ability to prioritize issues and drive progress in ambiguous situations.

TECHNICAL SKILLS REQUIREMENTS

• Familiarity with structured and unstructured data approaches

• Familiarity with structuring data to serve analytic needs, both for visualizations and data science use cases

• Expert experience with rapid ETL tools, including Alteryx/KNIME and Python/R

• Strong experience deploying rapid ETLs from PoC to production

• Experience with scripted languages such as Powershell, Python, and R in an automation environment

• General experience with SQL Server and/or Oracle databases

• Strong experience with SQL query writing

• Working knowledge of T-SQL and/or PL/SQL

• Expert experience with data analysis and troubleshooting

• Basic statistics knowledge

• Some experience with Power BI and/or Tableau

• Strong experience with Microsoft Excel for basic ad hoc data purposes as well as as a data source

• Some experience using pretrained GenAI tools for code acceleration

• Experience with software testing including unit, integration, and regression testing

• Experience with technical writing for SDLC documentation

Project Management

• Experience working within a Solution Delivery Lifecycle Management framework

• Experience with Agile and familiarity with Waterfall development methodologies

• Ability to self-manage targeted projects, and create and delegate tasks

About Company

Job ID: 108519141

Similar Jobs

Hyderabad, India

Skills:

RDSPysparkAWS GlueDynamodbEmrSqlLambdaCloudwatchIamSqsSnsPythonAWSLake FormationCloudTrail

Hyderabad

Skills:

PowershellData EngineerEtl ToolsPythonData Analysis

Hyderabad, India

Skills:

Data QualityRDBMSSystem DesignData GovernanceSqlPythonEtlNo-SQLMaster Data Management

Hyderabad, India

Skills:

Oracle SqlKafkaPl SqlNlpNeo4jShell scriptingPythonAWSHadoopScalaApache SparkUnix CommandAutosysSqlHiveGcpSparkMongoDBAzureADKAgentic AI frameworksH2OCI CD pipelinesSparkling Watersinfrastructure as code

Hyderabad, India

Skills:

Data CleaningPowerbiData VisualizationTableauSqlPythonData TransformationRData Pipeline BuildingData Validation