Careerflow.ai Logo

Careerflow.ai

Data Annotation Specialist - Computer Use Agents (CUA) Trajectory Evaluator

Posted 2 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in IN
Mid level
Remote
Hiring Remotely in IN
Mid level
Create, validate, and document step-by-step Computer-Use Agent (CUA) trajectories for technical developer workflows. Break down natural language instructions into reproducible actions, execute and test workflows in Linux using Python/Bash, interact with APIs and browser automation, and collaborate to improve annotation quality and guidelines.
The summary above was generated by AI

Role Overview:

We are looking for skilled professionals to contribute as S2 Annotators, responsible for producing and validating high-quality Computer-Use Agent (CUA) trajectories for developer-adjacent workflows. This includes tasks such as file operations, light scripting, API interactions, and browser automation. This role requires a strong understanding of technical workflows, attention to detail, and the ability to translate natural language instructions into precise, step-by-step executable actions that can be used to train advanced AI systems.

What does day-to-day look like

  • Create detailed, step-by-step positive CUA trajectories for technical tasks (e.g., file manipulation, scripting, API calls, browser-based workflows)

  • Break down natural language instructions into clear, verifiable actions

  • Validate and review trajectories for correctness, completeness, and reproducibility

  • Work within Linux desktop environments to execute and document workflows

  • Use scripting (Python/Bash) to simulate or validate task execution where required

  • Interact with tools and environments involving APIs, terminals, and browser automation

  • Collaborate with internal teams to refine task quality and annotation guidelines

  • Ensure consistency, accuracy, and high-quality standards across all annotations

Requirements

  • 2–5 years of experience in software development, technical support, or similar technical roles

  • Strong familiarity with Linux environments and command-line operations

  • Proficiency in at least one scripting language: Python or Bash

  • Ability to decompose complex instructions into structured, step-by-step workflows

  • Strong attention to detail in documenting technical processes

  • Exposure to LLM-based tools, AI systems, or agentic workflows

  • Basic understanding of APIs, file systems, and developer tooling

  • Familiarity with OpenClaw or similar environments/tools

Nice to have

  • Prior experience in data annotation, RLHF, or SFT labeling workflows

  • Exposure to CI/CD pipelines, REST APIs, or terminal-based automation

  • Experience working with browser automation tools or developer productivity tools

  • Background in evaluating or improving AI-generated outputs

Offer Details:

  • Engagement type: Contractor assignment/freelancer (no medical/paid leave)

  • Duration: 5 weeks

Evaluation Process:

  • Resume screening

  • Take home assessment (60 mins)

Similar Jobs

13 Days Ago
Easy Apply
Remote
Easy Apply
Mid level
Mid level
Big Data • Fintech • Mobile • Payments • Financial Services
As the CRA Compliance Lead, you will manage compliance strategies, enhance community engagement, analyze consumer complaints, and ensure alignment with regulatory expectations for Affirm Bank.
2 Hours Ago
Remote
Mid level
Mid level
Fintech • Payments • Financial Services
The Performance Marketing Executive will manage user acquisition campaigns, focusing on key markets, collaborate across teams, and optimize performance through data-driven strategies.
Top Skills: AdjustApple Search AdsAppsflyerBranchGoogle App CampaignsMetaTiktok
16 Hours Ago
Remote
Mid level
Mid level
Fintech • Payments • Financial Services
The Sales Executive will lead a team to meet sales targets, manage Direct Sales Representatives, and develop relationships while driving financial inclusion efforts.
Top Skills: Audit ManagementMarketingSales

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account