Job Responsibility:
- We are looking for a data scientist who will help us discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver AI/ML based Enterprise Software Products.
- Develop solutions related to machine learning, natural language processing and deep learning & Generative AI to address business needs.
- Your primary focus will be in applying Language/Vision techniques, developing llm based applications and building high quality prediction systems.
- Analyze Data: Collaborate with cross-functional teams to understand data requirements and identify relevant data sources. Analyze and preprocess data to extract valuable insights and ensure data quality.
- Evaluation and Optimization: Evaluate model performance using appropriate metrics and iterate on solutions to enhance performance and accuracy. Continuously optimize algorithms and models to adapt to evolving business requirements.
- Documentation and Reporting: Document methodologies, findings, and outcomes in clear and concise reports. Communicate results effectively to technical and non-technical stakeholders.
Work experience background required:
- Experience building software from the ground up in a corporate or startup environment.
Essential skillsets required:
- 3-6 years experience in software development
- Educational Background: Strong computer science and Math/Statistics
- Experience with Open Source LLM and Langchain Framework and and designing efficient prompt for LLMs.
- Proven ability with NLP and text-based extraction techniques.
- Experience in Generative AI technologies, such as diffusion and/or language models.
- Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
- Familiarity with cloud computing platforms such as GCP or AWS. Experience to deploy and monitor model in cloud environment.
- Experience with common data science toolkits, such as NumPy, Pandas etc
- Proficiency in using query languages such as SQL
- Good applied statistics skills, such as distributions, statistical testing, regression, etc.
- Experience working with large data sets along with data modeling, language development, and database technologies
- Knowledge in Machine Learning and Deep Learning frameworks (e.g., TensorFlow, Keras, Scikit-Learn, CNTK, or PyTorch), NLP, Recommender systems, personalization, Segmentation, microservices architecture and API development.
- Ability to adapt to a fast-paced, dynamic work environment and learn new technologies quickly.
- Excellent verbal and written communication skills