
Search by job, company or skills
Role: Senior Software Engineer Python(LLM Evaluation & Repository Validation)
Location: Remote
Skills required: Python, Git and Docker
About the Role:
We are looking for experienced software engineers (tech lead level) who are familiar with high quality public GitHub repositories and can contribute to this project. You should have experience working with well-maintained, widely-used repos with 5000+ stars. This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality
What does day-to-day look like:
Analyze and triage GitHub issues across trending open-source libraries.
Set up and configure code repositories, including Dockerization and environment setup.
Evaluating unit test coverage and quality.
Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
Collaborate with researchers to design and identify repositories and issues that are challenging for LLMs.
Opportunities to lead a team of junior engineers to collaborate on projects.
Required Skills:
Strong experience with Python,
Proficiency with Git, Docker, and basic software pipeline setup.
Ability to understand and navigate complex codebases.
Comfortable running, modifying, and testing real-world projects locally.
Experience contributing to or evaluating open-source projects is a plus.
Nice to Have:
Previous participation in LLM research or evaluation projects.
Experience building or testing developer tools or automation agents.
Interested candidates can directly reach out at [Confidential Information]
Job ID: 144626615