Talent.com
Evaluation Scenario Writer - AI Agent Testing Specialist
Evaluation Scenario Writer - AI Agent Testing SpecialistMindrift • Kuala Selangor, Kuala Selangor, Malaysia
Evaluation Scenario Writer - AI Agent Testing Specialist

Evaluation Scenario Writer - AI Agent Testing Specialist

Mindrift • Kuala Selangor, Kuala Selangor, Malaysia
1 hari lalu
Penerangan pekerjaan

Mindrift is looking for a freelance Agent Scenarios Designer based in the specified country. The role focuses on designing realistic and structured evaluation scenarios for LLM‑based agents, testing agent outputs, and refining tests. You will work on a flexible schedule and receive pay up to $38 / hr based on experience.

What We Do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe.

About the Role

You will design realistic and structured evaluation scenarios, create test cases that simulate human‑performed tasks, and define gold‑standard behavior to compare agent actions against. Your work will ensure each scenario is clearly defined, well‑scored, and easy to execute and reuse. You need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Responsibilities

  • Design structured test scenarios based on real‑world tasks
  • Define the golden path and acceptable agent behavior
  • Annotate task steps, expected outputs, and edge cases
  • Work with developers to test scenarios and improve clarity
  • Review agent outputs and adapt tests accordingly

How to Get Started

Apply to this posting, qualify, and you’ll have the chance to contribute to projects aligned with your skills on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.

Requirements

  • Bachelor’s and / or Master’s degree in Computer Science, Software Engineering, Data Science / Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / NLP, Information Systems or related fields
  • Background in QA, software testing, data analysis, or NLP annotation
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases)
  • Strong written communication skills in English
  • Comfortable with structured formats like JSON / YAML for scenario description
  • Can define expected agent behaviors (gold paths) and scoring logic
  • Basic experience with Python and JavaScript
  • Curious and open to working with AI‑generated content, agent logs, and prompt‑based behavior
  • Ready to learn new methods, able to switch between tasks and topics quickly, and sometimes work with challenging, complex guidelines
  • Fully remote freelance role – only requires a laptop, internet connection, available time, and enthusiasm to take on a challenge
  • Nice to Have

  • Experience in writing manual or automated test cases
  • Familiarity with LLM capabilities and typical failure modes
  • Understanding of scoring metrics (precision, recall, coverage, reward functions)
  • Benefits

  • Get paid for your expertise, with rates up to $38 / hr depending on your skills, experience, and project needs
  • Participate in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Gain valuable experience to enhance your portfolio through an advanced AI project
  • Influence how future AI models understand and communicate in your field of expertise
  • #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Evaluation Writer Ai • Kuala Selangor, Kuala Selangor, Malaysia

    Pekerjaan berkaitan
    Search Engine Optimization (SEO) Specialist

    Search Engine Optimization (SEO) Specialist

    Digital • Klang City, Selangor, Malaysia
    Search Engine Optimization (SEO) Specialist.Hire Digital is looking for an experienced.Senior SEO Specialist (Freelance, Remote). This role requires a strategic thinker who can drive end-to-end SEO ...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Aymara Language Expert - AI Trainer

    Aymara Language Expert - AI Trainer

    Invisible Expert Marketplace • Port Klang, Port Klang, Malaysia
    Be among the first 25 applicants.Are you an Aymara language expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into powerful tools for communicati...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Senior AI Full-Stack Developer - Python / React

    Senior AI Full-Stack Developer - Python / React

    OneSeven Tech (OST) • Klang City, Selangor, Malaysia
    OneSeven Tech (OST) is seeking a Senior Full-Stack Engineer to join our team.We are looking for someone who focuses on building and deploying full-scale applications while leveraging cutting-edge A...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Junior Content Writer

    Junior Content Writer

    EPS Consultants • Petaling Jaya, Selangor, MY
    Quick Apply
    Position : Content Writer ( VLLM).Tenure : 1 year contract ( Renewable basis).Salary : RM 3200- RM 3700 ( Subject to Experience). Location : Kuala Lumpur / Petaling Jaya.You will analyze images and write...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    Instructor for AI-Assisted Programming Workshops (part-time)

    Instructor for AI-Assisted Programming Workshops (part-time)

    TripleTen • Shah Alam, Shah Alam, Malaysia
    TripleTen Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Instructor – AI Course Trainer.We’re building an online course for experienced software developers on the practical use of Cursor...Tunjukkan lagi
    Kemas kini terakhir: 10 hari yang lalu • Dinaikkan pangkat
    AI Developer (Malaysia)

    AI Developer (Malaysia)

    Azeus Systems Limited • Kuala Lumpur, Malaysia
    You will analyze data sets to identify patterns, create frameworks and develop predictive algorithmic models to deliver intelligent systems that are capable of performing tasks that usually require...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    AI Data Specialist - Chinese

    AI Data Specialist - Chinese

    RWS TrainAI • Kuala Lumpur, Kuala Lumpur, Malaysia
    Flexible, work whenever you want.Until the end of December 2025 (an extension is possible).Are you a student, recent graduate, stay-at-home parent, gig worker, or professional seeking flexible remo...Tunjukkan lagi
    Kemas kini terakhir: 21 hari yang lalu • Dinaikkan pangkat
    Senior Business Analyst - Insurance, Cantonese Speaker (Fully Remote)

    Senior Business Analyst - Insurance, Cantonese Speaker (Fully Remote)

    CoverGo | Insurtech • Klang City, Selangor, Malaysia
    Working on the latest tech for the Insurtech Market Leader.At CoverGo, our mission is to empower all insurance companies to make insurance 100% digital and accessible to everyone.We are a leading g...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Lead Technical Author

    Lead Technical Author

    Talent Switch • Kuala Lumpur, Malaysia
    Lead the investigation, evaluation and implementation of next generation technologies, especially generative AI, to help drive productivity and quality gains in technical authoring.Define and execu...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    Product Manager - Growth and AI

    Product Manager - Growth and AI

    Flintex Consulting Pte Ltd • Kuala Lumpur, 14, my
    Quick Apply
    This role is to lead initiatives focused on client acquisition and AI-enhanced user experiences in our global mobile trading platform. This role is pivotal in driving growth by enhancing user onboar...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    Senior AI Research Engineer, Model Inference (100% Remote)

    Senior AI Research Engineer, Model Inference (100% Remote)

    Tether Operations Limited • Kuala Lumpur, 14, MY
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    AI BUSINESS ANALYST

    AI BUSINESS ANALYST

    Talent Switch • Kuala Lumpur, Malaysia
    Work with business stakeholders to identify and define use cases for AI solutions (e.Analyze current workflows and systems to determine where AI can enhance productivity, accuracy, or decision-maki...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    Data Architect

    Data Architect

    Two95 International Inc. • Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    Execute Enterprise data initiatives / programs to establish a governed, curated, and agile data ecosystem that enables business to make data-driven decisions. Translate strategic requirements in...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    Content Specialist

    Content Specialist

    Cartrack • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    We are a world-leading smart mobility SaaS company with over 2,000,000 subscribers across 23 countries and we are looking for a Content Specialist to join our team. Our teams are collaborative, vibr...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Senior Business & Data Analyst for (AI Enablement)

    Senior Business & Data Analyst for (AI Enablement)

    Unison Group • Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    We are seeking a dynamic and experienced Business & Data Analyst to drive the successful enablement of an Artificial Intelligence Program. This hybrid role combines analytical rigor with project...Tunjukkan lagi
    Kemas kini terakhir: 1 hari yang lalu
    Italian Voice Actor - AI Trainer

    Italian Voice Actor - AI Trainer

    Invisible Expert Marketplace • Port Klang, Port Klang, Malaysia
    Italian Voice Acting Specialist – AI Trainer.We’re looking for a highly skilled Italian voice acting professional to help build AI voice models. You’ll use cutting‑edge tools, record and evaluate It...Tunjukkan lagi
    Kemas kini terakhir: 8 hari yang lalu • Dinaikkan pangkat
    Freelance Chemistry Expert with Python - AI Trainer

    Freelance Chemistry Expert with Python - AI Trainer

    Mindrift • Port Klang, Port Klang, Malaysia
    Freelance Chemistry Expert with Python - AI Trainer.Be among the first 25 applicants.This opportunity is only for candidates currently residing in the specified country. Your location may affect eli...Tunjukkan lagi
    Kemas kini terakhir: 12 hari yang lalu • Dinaikkan pangkat
    (WFH) SEO Content Writer

    (WFH) SEO Content Writer

    Alpha Iota BPO Sdn Bhd • Kuala Lumpur, Malaysia
    Join Our Alpha Iota Family, Where Everyone Wins!.Exciting Work-from-Home Opportunities.Learning & Development Programs to Upskill Yourself. Health and Wellness Perks & Benefits.Motivating and Suppor...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat