Agentic Coding Annotator (Online/Offline Tasks)

Department Icon IT / Software Development & Related
102+ Applicants
Posted: 2 weeks ago
5-7 years
Kolkata, West Bengal
work from office

Posted: 2 weeks ago
|
Applicants: 102+
Job Description
About Company
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

About Turing:

Turing is one of the worlds fastest-growing AI companies, accelerating the advancement and deployment of powerful AI systems. Turing helps customers in two ways: working with the worlds leading AI labs to advance frontier model capabilities in thinking, reasoning, coding, agentic behavior, multimodality, multilinguality, STEM, and frontier knowledge; and leveraging that work to build real-world AI systems that solve mission-critical priorities for companies.

Role Overview:

  • We are looking for strong, detail-oriented software practitioners to help evaluate and improve datasets for agentic coding models.
  • This role involves working with realistic coding tasks in an agentic coding harness, reviewing model trajectories, verifying solutions, and producing high-quality annotations.

Depending on the assignment, the work may include:

  • Online evaluations: Manually interacting with blinded models on predefined tasks, then ranking and grading resulting trajectories
  • Offline evaluations: Designing realistic coding tasks, calibrating them through user simulation, writing task-specific rubrics, and grading generated trajectories

This is not a basic annotation role. Candidates are expected to read and debug code, validate behavior, follow detailed process rules, and make consistent judgment calls across model runs.

We are specifically looking for candidates with enough engineering maturity to independently work on realistic software tasks, not just toy problems or shallow code-review exercises.

What does day-to-day look like:

  • Execute realistic coding tasks within the assigned agentic coding harness while maintaining model blindness and session independence
  • Follow task instructions, milestones, planned interactions, and evaluation guardrails consistently across runs
  • Verify model outputs by reading code, running commands, checking logs, and inspecting generated artifacts
  • Perform targeted validation of outputs using tests, scripts, and manual checks
  • Write clear, specific, evidence-based rationales for trajectory rankings and assessments
  • Design multi-step, realistic coding tasks (offline work), including user intent and milestone structure
  • Create and refine task-specific rubrics and binary evaluation criteria
  • Review completed work for quality, completeness, consistency, and schema compliance
  • Identify and escalate broken environments, unclear instructions, or process gaps with clear supporting evidence

Requirements:

  • Software Engineering Fluency (Mandatory)
  • 5+ years of experience in software engineering, QA, developer tooling, data/ML engineering, or similar code-heavy roles
  • Strong hands-on experience in at least 1–2 programming languages or ecosystems
  • Representative languages include: Python, JavaScript/TypeScript, Rust, Java, C/C++, Bash/CLI environments, Haskell, Swift, SQL, or other production-relevant ecosystems

Ability to:

  • Read and understand unfamiliar codebases
  • Run and interpret tests, scripts, and CLI tools
  • Debug issues and reason about edge cases or partial fixes
  • Evaluate whether an implementation is functionally correct

Looking to get Placed? Try our Placement Guarantee Plan

Additional Preferred Qualifications (Offline / Senior Candidates):

  • Strong Docker skills and experience building/debugging reproducible environments
  • Experience working in large, complex repositories (not just small or greenfield projects)
  • Demonstrated originality and sound engineering judgment in defining technical problems
  • Ability to design realistic, non-trivial tasks that go beyond tutorials, README flows, or simple bug fixes

Perks of Freelancing With Turing:

  • Work on cutting-edge AI projects with leading foundation model companies
  • Collaborate on high-impact work at the frontier of LLM evaluation and reasoning
  • Remote, flexible opportunities with global teams
  • Competitive compensation based on experience and project scope

Offer Details:

  • Commitments Required: 8 hours per day with a 4-hour overlap with PST.
  • Employment Type: Contractor position (Note: this role does not include medical/paid leave).
  • Duration of Contract: 5 weeks; [expected start date is next week].

Skills

CPythonDebuggingJavaJavascriptDeveloperSql

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

About Company

Turing is a company that connects top remote developers with companies worldwide. They aim to empower developers and help companies find talent.

Important dates & deadlines?

Application Deadline

31 Jul 26, 03:50 PM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

Agentic Coding Annotator (Online/Offline Tasks)

Share with