[Remote] Remote Software Engineer – AI Research & Evaluation
Note: The job is a remote job and is open to candidates in USA. Turing is a leading research accelerator for frontier AI labs, based in San Francisco, California. They are seeking a Remote Software Engineer – AI Research & Evaluation to create datasets for training and evaluating large language models, collaborating closely with researchers to enhance AI-driven coding solutions.
Responsibilities
- Work on AI model training initiatives by curating code examples, building solutions, and correcting code — primarily in Python, with additional work in JavaScript (including ReactJS), C/C++, Java, Rust, and Go
- Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable
- Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks
- Build agents and automated verification tools in Python that can verify the quality of code and identify error patterns
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
- Design verification mechanisms that can automatically verify a solution to a software engineering task
Skills
- Several years of software engineering experience (3 years or more)
- Strong expertise in Python with deep knowledge of frameworks, tooling, and best practices for building production-grade software
- Experience building full-stack applications and deploying scalable software using modern languages and tools
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
- Excellent oral and written communication skills for clear, structured evaluation rationales
- Engineers who have worked at the frontier of AI — at companies like OpenAI, NVIDIA, Databricks, Palantir, Snowflake, or similar organizations pushing the boundaries of intelligent systems
- Graduates from programs with strong CS foundations such as University of Washington, University of Illinois Urbana-Champaign, UT Austin, University of Michigan, Purdue, and comparable institutions
Benefits
- Flexible engagement, minimum 10 hrs/week, up to 40 hrs/week
- Type: Contractor (no medical/paid leave)
Company Overview