Back to Jobs
XX
Senior Software Engineer for LLM EvaluationConfidentialUnited States

This job offer is no longer available

XX

Senior Software Engineer for LLM Evaluation

Confidential
  • US
    United States
  • US
    United States

About

Role Overview
As a Software Engineering evaluator, you will play a crucial role in creating advanced datasets for training, benchmarking, and enhancing large language models. This position involves collaborating closely with researchers to curate code examples, provide precise solutions, and refine AI-generated code across various programming languages, ensuring the development of reliable and efficient AI-driven coding solutions.

Key Responsibilities
  • Curate code examples, build solutions, and correct code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go for AI model training initiatives.
  • Evaluate and refine AI-generated code to ensure efficiency, scalability, and reliability.
  • Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
  • Develop agents to verify code quality and identify error patterns.
  • Hypothesize on software engineering cycle steps (such as prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and assess model capabilities.
  • Design verification mechanisms to automatically validate solutions to software engineering tasks.
Qualifications
  • Minimum of 3 years of software engineering experience.
  • Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools.
  • Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
  • Excellent oral and written communication skills for clear, structured evaluation rationales.
Work Terms
  • Commitment : Flexible engagement, minimum 10 hours/week, up to 40 hours/week.
  • Type : Contractor (no medical/paid leave).
  • Duration : 1 month (potential extensions based on performance and fit).
  • Location : Candidates must be based in the US, Canada, or WEU countries (Austria, Belgium, France, Germany, etc.).
Eligibility
  • The application process takes 15-30 minutes.
  • Completion of an AI video interview is required.
  • United States

Languages

  • English
Notice for Users

This job was posted by one of our partners. You can view the original job source here.