[Remote] Remote Software Engineer – AI Research & Evaluation
Note: The job is a remote job and is open to candidates in USA. Turing is a leading research accelerator for frontier AI labs based in San Francisco, California, and is seeking a Remote Software Engineer specializing in AI Research & Evaluation. The role involves creating datasets for training large language models, collaborating with researchers, and evaluating AI-generated code for efficiency and reliability.
Responsibilities
- Work on AI model training initiatives by curating code examples, building solutions, and correcting code across both Python and JavaScript (React, Node.js), with additional work in C/C++, Java, Rust, and Go
- Evaluate and refine AI-generated code across backend and frontend contexts to ensure that it is efficient, scalable, and reliable
- Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks
- Build agents that can verify the quality of the code and identify error patterns across full-stack applications
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
- Design verification mechanisms that can automatically verify a solution to a software engineering task
Skills
- Several years of software engineering experience (3 years or more)
- Strong expertise in building full-stack applications using Python and JavaScript (React, Node.js), with the ability to work across backend and frontend codebases
- Experience deploying scalable, production-grade software using modern languages and tools
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
- Excellent oral and written communication skills for clear, structured evaluation rationales
- Engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale engineering organizations
- Graduates from leading programs such as Harvard, Columbia, Princeton, Yale, University of Pennsylvania, and comparable institutions
Benefits
- Flexible engagement, minimum 10 hrs/week, up to 40 hrs/week
- Contractor (no medical/paid leave)
- 1 month (potential extensions based on performance and fit)
- Candidates must be based in the United States
- The application process takes 15–30 minutes.
- Completion of an AI video interview is required.
- Refer them at turing.com/referrals, and earn money from your network.
Company Overview