Position Summary
A data scientist cum data engineer with 5+ years of experience with a deep understanding of data analysis, Big Data & Cloud technologies, AI & machine learning, GenAI and NLP techniques. You will be responsible for developing & building low level design as per approved Tech specs, extracting valuable insights from large datasets, building advanced AIML models and driving data-driven decision-making within the organization or for the enterprise customers. This role involves a combination of data analysis, model development, and strategic thinking to solve complex business problems. You will interact with key stakeholders and apply your technical proficiency in AIML, Python, R and algorithms. You will work across different stages of the development project using data science & technologies to provide solutions and interface directly with enterprise customers for the Adobe Experience Platform.
What you'll do
- Interface with Adobe customers to gather requirements, map solutions & make recommendations. Document projects with clear business objectives, provides data gathering & data preparation, final algorithm, detailed set of results
- Experience in Natural Language Processing, GenAI and Image processing
- Support the execution of Data Science solutions to business problems
- Innovate on new ideas to solve customer needs & assist to create GTM strategies for new solutions
- Experience in AIML modeling like propensity models, clusterings, regression, etc.
- Developing ETL pipelines involving big data.
- Developing data processinganalytics applications primarily using PySpark.
- Experience of developing applications on cloud(AWS/Azure/GCP) mostly using services related to storage, compute, ETL, DWH, Analytics and streaming.
- Clear understanding and ability to implement distributed storage, processing and scalable applications.
- Experience in working with SQL and NoSQL database.
- Ability to write and analyze SQL, HQL and other query languages for NoSQL databases.
- Proficiency in writing disitributed & scalable data processing code using PySpark, Python and related libraries.
- Experience in developing applications that consume the services exposed as ReST APIs.
- Understand and clean datasets, interpret outputs and make them comprehensible to understand for other teams.
- Strong collaboration with consultants onshore & offshore
- Create reusable statistical models & modify existing algorithms
- Report on customer trends and deployment performance and identify areas that we can use target using ML/Data science solutions.
Requirements
- 2+ years of experience & knowledge with Web Analytics or Digital Marketing.
- 5+ years of experience in AIML role, with a focus on building data pipelines for conducting data intensive analysis
- 5+ years of experience with Machine Learning and alogrithims for classification, clustering, prediction, recommendations and NLP
- 5+ years of experience with common Data Science Toolkits (i.e. R, Jupyter Notebooks, PySpark etc.)
- 5+ years of enterprise development using Python
- Strong understanding of GenAI and LLM
- Ability to enhance Standard Algorithms is required
- Knowledge & experience using Amazon Sagemaker, Microsoft Azure ML or Google AI technologies
- 5+ years of complex SQL experience
- 5+ years of Data Modeling experience
- Demonstrate exceptional organizational skills and ability to multi-task simultaneous different customer projects
- Strong verbal & written communication skills to lead customers to a successful outcome and explain complex technical concepts to non-technical stakeholders.
- Excellent problem-solving and critical-thinking skills.
- Experience with big data technologies and cloud platforms (e.g., AWS, Azure, Google Cloud)
- Must be self-managed, proactive and customer focused
- Degree in Computer Science, Information Systems or related field
- Should be able to work in teams
Special Consideration given for
- Experience & knowledge with Adobe Experience Cloud solutions
- Experience & knowledge with Web Analytics or Digital Marketing