Overview: We are seeking a highly skilled and experienced Senior Generative AI Data Scientist to join our dynamic team. The ideal candidate will be passionate about pushing the boundaries of AI innovation, with a deep understanding of Python development and a proven track record in the Generative AI space.
Key Responsibilities
- Develop and maintain advanced Python-based applications in the Generative AI domain, ensuring high performance, reliability, and scalability
- Implement and optimize Generative AI models, including GPT, LLAMA, Mistral, FLAN T5 and other cutting-edge AI technologies, to create innovative solutions
- Development of advanced RAG pipelines with proper embeddings, indexing, chunking, reranking, prompts and evaluation
- Collaborate with cross-functional teams to integrate AI functionalities into broader systems and applications
- Utilize AWS/Azure/Databricks GPU machines to manage GPU memory effectively, maximizing performance and efficiency
- Stay updated on the latest advancements in Generative AI, Python development practices, and cloud services to continually enhance our AI capabilities
- Assist delivery leads in delivering Generative AI solutions to clients in a timely manner, ensuring client satisfaction and project success
Required Skills and Experience:
- Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience.
- 6+ years of experience in data science.
- Experience with LLMs like Llama (1/2/3), Mistral, T5, Langchain or framework similar like Langchai)
- Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning
- Knowledge of advanced RAG pipelines with proper embeddings, indexing, chunking, reranking, prompts and evaluation
- Candidate must be comfortable interpreting research papers and architecture diagrams of Language Models
- Candidate must be comfortable with LORA, RAG, Instruct fine-tuning, Quantization, etc.
- Experience leading the end-to-end design, development, and deployment of predictive modeling solutions
- Excellent programming skills in Python. Strong working knowledge of Pythons numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc.
- SQL skills with SQL Server and Spark experience is preferred but not necessary.
- Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks.
- Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling.
- Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment.
- Experience with cloud platforms such as Azure, AWS, Databricks is preferred