Talent.com
This job offer is not available in your country.
AI Agent Evaluation Analyst - AI Trainer

AI Agent Evaluation Analyst - AI Trainer

MindriftMY
10 hours ago
Job type
  • Quick Apply
Job description

At Mindrift , innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

The Mindrift platform, launched and powered by Toloka , connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for :

We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate.

Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for :

  • Analysts, researchers, or consultants with strong critical thinking skills.
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig.
  • People open to a part-time and non-permanent opportunity.

About the project :

We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.

What you’ll be doing :

  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism.
  • Identifying inconsistencies, missing assumptions, or unclear decision points.
  • Helping define clear expected behaviors (gold standards) for AI agents.
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives.
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly.
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
  • How to get started :

    Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

    Requirements

  • Excellent analytical thinking : Can reason about complex systems, scenarios, and logical implications.
  • Strong attention to detail : Can spot contradictions, ambiguities, and vague requirements.
  • Familiarity with structured data formats : Can read, not necessarily write JSON / YAML.
  • Can assess scenarios holistically : What's missing, what’s unrealistic, what might break?
  • Good communication and clear writing (in English) to document your findings.
  • We also value applicants who have :

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.
  • Background in consulting, academia, olympiads (e.g. logic / math / informatics), or research.
  • Exposure to LLMs, prompt engineering, or AI-generated content.
  • Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).
  • Benefits

  • Get paid for your expertise, with  rates that can go up to $38 / hour  depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.
  • Create a job alert for this search

    Ai • MY

    Related jobs
    • Promoted
    Product Development Engineer

    Product Development Engineer

    ZFMalaysia, Malaysia
    Data engineering roles in motorsport combine advanced analytics with race strategy, transforming raw telemetry data into decisive performance advantages. These positions are crucial in modern racing...Show moreLast updated: 1 day ago
    • Promoted
    DevOps AI Engineer (Python, Gen AI)

    DevOps AI Engineer (Python, Gen AI)

    DHL GermanyMalaysia, Malaysia
    With a global team of 6000+ IT professionals, DHL IT Services connects people and keeps the global economy running by continuously innovating and creating sustainable digital solutions.We work beyo...Show moreLast updated: 3 days ago
    • Promoted
    Product Specialist (Central +; Ketosteril)

    Product Specialist (Central +; Ketosteril)

    Agensi Pekerjaan Reeracoen Malaysia Sdn. Bhd.PahangMalaysia, Pahang, Malaysia
    Responsible for achievement of the assigned territory’s Sales Budget.Maintain, cultivate and expand existing customers in line with Company’s growth strategies. Organize and conduct CME (Continuing ...Show moreLast updated: 3 days ago
    AI Trainer - Agent Evaluation Infrastructure (MCP)

    AI Trainer - Agent Evaluation Infrastructure (MCP)

    MindriftMY
    Quick Apply
    We believe in using the power of collective human intelligence to ethically shape the future of AI.The Mindrift platform, launched and powered by. AI projects from innovative tech clients.Our missio...Show moreLast updated: 1 day ago
    • Promoted
    Sales Engineer

    Sales Engineer

    ALL GASES SDN BHDPahangMalaysia, Pahang, Malaysia
    All Gases Sdn Bhd is seeking a.You will be responsible for driving sales growth by delivering tailored technical solutions, building client relationships, and presenting value-driven proposals.Prep...Show moreLast updated: 1 day ago
    Freelance Mathematics - QA / AI Trainer

    Freelance Mathematics - QA / AI Trainer

    MindriftMY
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Show moreLast updated: 1 day ago
    • Promoted
    Geophysicist

    Geophysicist

    OUS GroupPahangMalaysia, Pahang, Malaysia
    At OUS Group, we’re driven by discovery.As a growing force in Malaysia’s iron ore mining industry, we’re seeking a.This is a field-based role that will take you across Malaysia—from remote explorat...Show moreLast updated: 1 day ago
    • Promoted
    Research Engineer

    Research Engineer

    Monash University MalaysiaMalaysia, Malaysia
    Information Technology, Research (Admin Support).Duration : Fixed-term (1 year, renewable for another year).Remuneration : MYR 6133 / monthly. Amplify your impact at a world top 50 University.Be surro...Show moreLast updated: 30+ days ago
    • Promoted
    Accounts Receivable & Treasury Analyst (Japanese Speaker)

    Accounts Receivable & Treasury Analyst (Japanese Speaker)

    Roche Services (Asia Pacific) Sdn BhdMalaysia, Malaysia
    At Roche you can show up as yourself, embraced for the unique qualities you bring.Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted ...Show moreLast updated: 1 day ago
    • Promoted
    LILT | Cantonese Linguists for new AI project

    LILT | Cantonese Linguists for new AI project

    Lilt, Inc.Malaysia, Malaysia
    AI is changing how the world communicates — and LILT is leading that transformation.AI, machine translation, and human-in-the-loop. At LILT, we empower our teammates with leading tools, global colla...Show moreLast updated: 3 days ago
    Freelance SalesForce Expert - AI Trainer

    Freelance SalesForce Expert - AI Trainer

    MindriftMY
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Show moreLast updated: 1 day ago
    Freelance Biology - Quality Assurance (AI Trainer)

    Freelance Biology - Quality Assurance (AI Trainer)

    MindriftMY
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. We believe in using the power of collective intelligence to ethica...Show moreLast updated: 1 day ago
    • Promoted
    Business Analyst

    Business Analyst

    MpowertsMalaysia, Malaysia
    Perform the requirements gathering with customer.Perform analysis on customer requirements.Design and document the functional details. Provide consultancy on the system product.Training will be prov...Show moreLast updated: 30+ days ago
    • Promoted
    Data Partner- Math- Chinese - Remote- Global

    Data Partner- Math- Chinese - Remote- Global

    TELUS Digital AI Data SolutionsMalaysia, Malaysia
    Data Partner- Math- Chinese - Remote- Global.TELUS Digital AI Data Solutions.We are seeking a Subject Matter Expert to design advanced, domain-specific questions and solutions and to create challen...Show moreLast updated: 30+ days ago
    • Promoted
    Cash and Trade Processing Analyst

    Cash and Trade Processing Analyst

    Citigroup Inc.PahangMalaysia, Pahang, Malaysia
    Whether you’re at the start of your career or looking to discover your next adventure, your story begins here.At Citi, you’ll have the opportunity to expand your skills and make a difference at one...Show moreLast updated: 3 days ago
    Freelance AI Solutions Engineer - Generative AI & Data Workflows

    Freelance AI Solutions Engineer - Generative AI & Data Workflows

    MindriftMY
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Mindrift is looking for passionate freelance contributors to join ...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    AI Associate Engineer

    AI Associate Engineer

    LenovoMalaysia
    Assist in gathering, cleaning, and preprocessing datasets for AI model training.Perform exploratory data analysis (EDA) to uncover insights and patterns from data. Help integrate AI models into appl...Show moreLast updated: 11 hours ago
    • Promoted
    Transaction Monitoring (TM) Analyst (Petaling Jaya)

    Transaction Monitoring (TM) Analyst (Petaling Jaya)

    KPMG in MalaysiaMalaysia, Malaysia
    Conduct first level of review of the regenerated cases in TM System.Conduct independent investigations and assessments of regenerated TM alerts on a daily basis to identify potential risks related ...Show moreLast updated: 1 day ago