Job Description
Project Role : Large Language Model Architect
Project Role Description : Architect large language models (LLM) that can process and generate natural language. Design neural network parameters, trained on large quantities of unlabeled text data.
Must have skills : Large Language Models (LLMs)
Good to have skills : NA
Minimum 3 Year(s) Of Experience Is Required
Educational Qualification : 15 years full time education
Summary: As a Large Language Model Architect, you will engage in the innovative design and architecture of large language models that are capable of processing and generating natural language. Your typical day will involve collaborating with cross-functional teams to define model specifications, experimenting with various neural network architectures, and analyzing the performance of models trained on extensive datasets. You will also be responsible for troubleshooting and optimizing model performance, ensuring that the solutions you develop meet the highest standards of accuracy and efficiency. Your role will be pivotal in advancing the capabilities of natural language processing technologies within the organization. Roles & Responsibilities: - Expected to perform independently and become an SME. - Required active participation/contribution in team discussions. - Contribute in providing solutions to work related problems. - Collaborate with data scientists and engineers to refine model architecture and improve performance. - Conduct experiments to evaluate the effectiveness of different model configurations and training methodologies. Professional & Technical Skills: - Must To Have Skills: Proficiency in Large Language Models. - Strong understanding of neural network architectures and their applications in natural language processing. - Experience with data preprocessing techniques for large datasets. - Familiarity with programming languages such as Python and frameworks like TensorFlow or PyTorch. - Ability to analyze and interpret model performance metrics to drive improvements. Additional Information: - The candidate should have minimum 3 years of experience in Large Language Models. - This position is based at our Bengaluru office. - A 15 years full time education is required.