Dieses Stellenangebot ist nicht mehr verfügbar
Über
- Title:
Senior Software Engineer (Remote) - Engagement:
Hourly contract (independent contractor) - Location:
Remote – candidates based in the United States, United Kingdom, Canada, Europe, Singapore, Dubai, or Australia who are legally able to work as independent contractors in their jurisdiction.
We are hiring for one of our clients a skilled software engineer to support the evaluation, improvement, and benchmarking of large language models (LLMs). This role focuses on building high-quality code, designing rigorous technical evaluations, and creating reliable workflows that generate high-signal data for AI training.
Role Overview
You will work in a coding-focused, research-driven environment that partners closely with advanced AI research teams. The role involves writing and debugging production-quality code, designing structured evaluations, and analyzing AI model behavior to improve reliability, correctness, and reasoning performance.
This position is ideal for engineers who enjoy solving complex technical problems, investigating subtle system failures, and working in environments where precision, reproducibility, and collaboration are critical.
Key Responsibilities
- Write, review, and debug production-quality code across multiple programming languages
- Design coding, reasoning, and debugging tasks for AI model evaluation
- Analyze AI-generated outputs to identify hallucinations, regressions, and failure patterns
- Build reproducible development environments using Docker and automation tools
- Develop scripts, pipelines, and tooling for data generation, scoring, and validation
- Produce structured annotations, judgments, and high-quality datasets
- Run systematic evaluations to improve model reliability and reasoning
- Collaborate with engineers, researchers, and quality stakeholders to align on standards and continuously improve quality
Required Skills
- Strong hands-on software engineering experience (professional or research-based) in one or more of the following:
- Python
- JavaScript /
- TypeScript
- Experience with Linux, Bash scripting, and automation
- Proficiency with Docker, reproducible environments, and development containers
- Strong Git skills, including branching, diffs, patches, and conflict resolution
- Solid understanding of testing and quality assurance practices (unit, integration, edge-case testing)
- Ability to overlap a portion of working hours with Pacific Time
Preferred Background
- Bachelor's degree in a technical field with approximately 6+ years of relevant experience, or
- Master's degree in a technical field with approximately 4+ years of relevant experience, or
- PhD in a technical field with approximately 2+ years of relevant experience
Equivalent practical experience will also be considered.
Engagement Details
- Work type:
Remote - Engagement model:
Independent contractor - Time commitment:
20–40 hours per week (flexible options available) - Availability:
Ability to overlap at least 4 hours per day with Pacific Time - Contract duration:
Approximately 3 months, with potential extension based on performance and project needs
Compensation
- Compensation is competitive and provided on an hourly or project basis, commensurate with experience, skills, and scope of work. Details will be shared during the interview process.
Evaluation Process
- Two interview stages, including a technical interview and a discussion focused on collaboration, expectations, and role fit
Sprachkenntnisse
- English
Dieses Stellenangebot wurde von einem unserer Partner veröffentlicht. Sie können das Originalangebot einsehen hier.