Talent.com
Evaluation Scenario Writer - AI Agent Testing Specialist
Evaluation Scenario Writer - AI Agent Testing SpecialistMindrift • Kota Bharu, Kelantan, Malaysia
Evaluation Scenario Writer - AI Agent Testing Specialist

Evaluation Scenario Writer - AI Agent Testing Specialist

Mindrift • Kota Bharu, Kelantan, Malaysia
8 hari lalu
Penerangan pekerjaan

Mindrift is looking for a freelance Agent Scenarios Designer based in the specified country. The role focuses on designing realistic and structured evaluation scenarios for LLM‑based agents, testing agent outputs, and refining tests. You will work on a flexible schedule and receive pay up to $38 / hr based on experience.

What We Do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe.

About the Role

You will design realistic and structured evaluation scenarios, create test cases that simulate human‑performed tasks, and define gold‑standard behavior to compare agent actions against. Your work will ensure each scenario is clearly defined, well‑scored, and easy to execute and reuse. You need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Responsibilities

  • Design structured test scenarios based on real‑world tasks
  • Define the golden path and acceptable agent behavior
  • Annotate task steps, expected outputs, and edge cases
  • Work with developers to test scenarios and improve clarity
  • Review agent outputs and adapt tests accordingly

How to Get Started

Apply to this posting, qualify, and you’ll have the chance to contribute to projects aligned with your skills on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.

Requirements

  • Bachelor’s and / or Master’s degree in Computer Science, Software Engineering, Data Science / Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / NLP, Information Systems or related fields
  • Background in QA, software testing, data analysis, or NLP annotation
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases)
  • Strong written communication skills in English
  • Comfortable with structured formats like JSON / YAML for scenario description
  • Can define expected agent behaviors (gold paths) and scoring logic
  • Basic experience with Python and JavaScript
  • Curious and open to working with AI‑generated content, agent logs, and prompt‑based behavior
  • Ready to learn new methods, able to switch between tasks and topics quickly, and sometimes work with challenging, complex guidelines
  • Fully remote freelance role – only requires a laptop, internet connection, available time, and enthusiasm to take on a challenge
  • Nice to Have

  • Experience in writing manual or automated test cases
  • Familiarity with LLM capabilities and typical failure modes
  • Understanding of scoring metrics (precision, recall, coverage, reward functions)
  • Benefits

  • Get paid for your expertise, with rates up to $38 / hr depending on your skills, experience, and project needs
  • Participate in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Gain valuable experience to enhance your portfolio through an advanced AI project
  • Influence how future AI models understand and communicate in your field of expertise
  • #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Evaluation Writer Ai • Kota Bharu, Kelantan, Malaysia

    Pekerjaan berkaitan
    AI-Powered Full-Stack Engineer — Prototyping to Production

    AI-Powered Full-Stack Engineer — Prototyping to Production

    Mindvalley, Inc. • Kota Bharu, Kelantan, Malaysia
    A leading innovative technology company based in Malaysia is seeking a Senior Full Stack Engineer focused on AI product development. You will work closely with the Innovation Team to create impactfu...Tunjukkan lagi
    Kemas kini terakhir: 13 jam yang lalu • Dinaikkan pangkat • Baharu!
    Freelance Software Developer (Ruby) - Quality Assurance (AI Trainer)

    Freelance Software Developer (Ruby) - Quality Assurance (AI Trainer)

    Mindrift • Kota Bharu, Kelantan, Malaysia
    Freelance Software Developer (Ruby) - Quality Assurance (AI Trainer).Be among the first 25 applicants.This opportunity is only for candidates currently residing in the specified country.Your locati...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Project Manager - Remote, Cantonese Speaker ( Mobile Apps / Web / AI Solution)

    Project Manager - Remote, Cantonese Speaker ( Mobile Apps / Web / AI Solution)

    REDSO INNOVATION SDN. BHD. • Kota Bharu, Kelantan, Malaysia
    This is an exciting opportunity to join REDSO INNOVATION SDN.Project Manager (Mobile Apps / Web / AI Solution).In this full-time fully remote role, you will be responsible for leading the successfu...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Hybrid ML Engineer - Diffusion & Vision (Remote-Friendly)

    Hybrid ML Engineer - Diffusion & Vision (Remote-Friendly)

    Bjak • Kota Bharu, Kelantan, Malaysia
    A leading AI company in Malaysia seeks a Machine Learning Engineer to develop cutting-edge generative vision features.You'll customize diffusion models and build large-scale datasets while collabor...Tunjukkan lagi
    Kemas kini terakhir: 13 jam yang lalu • Dinaikkan pangkat • Baharu!
    B2B Content Strategist

    B2B Content Strategist

    Starfish • Kota Bharu, Kelantan, Malaysia
    A leading Singaporean Telecoms Client is seeking an experienced.Full-time, Remote) with a strong background in content management, strategy, and copywriting. You will lead projects that enhance the ...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Remote Ruby QA Engineer for AI Training (Part‑Time)

    Remote Ruby QA Engineer for AI Training (Part‑Time)

    Mindrift • Kota Bharu, Kelantan, Malaysia
    A technology company in Malaysia is seeking a part-time Freelance Software Developer (Ruby) to work on AI projects.Candidates should have a Bachelor's or Master's degree and at least 3 years of exp...Tunjukkan lagi
    Kemas kini terakhir: 13 jam yang lalu • Dinaikkan pangkat • Baharu!
    Remote Chinese-Speaking E-commerce Conversion Specialist

    Remote Chinese-Speaking E-commerce Conversion Specialist

    Airswift • Kota Bharu, Kelantan, Malaysia
    A leading customer service provider is seeking an Associate for a fully remote role within Malaysia, focusing on e-commerce support. The ideal candidate is fluent in Chinese and English, with at lea...Tunjukkan lagi
    Kemas kini terakhir: 13 jam yang lalu • Dinaikkan pangkat • Baharu!
    Freelance Luxury Brand Evaluator in Pahang, Malaysia

    Freelance Luxury Brand Evaluator in Pahang, Malaysia

    CXG group • Kota Bharu, Kelantan, Malaysia
    Turn your passion for luxury into a career opportunity! Explore the world of premium brands and make a lasting impact in fashion, beauty, jewelry, or automobiles. Join CXG, the global leader in cust...Tunjukkan lagi
    Kemas kini terakhir: 6 hari yang lalu • Dinaikkan pangkat
    Azure Architect (AI Adoption / Security)

    Azure Architect (AI Adoption / Security)

    Softenger (Malaysia) Sdn Bhd • Kota Bharu, Kelantan, Malaysia
    Job Title : AI Architect (Adoption / Security).We are hiring for key roles to support a major enterprise‑scale AI transformation program. Candidates will work closely with business and IT teams to driv...Tunjukkan lagi
    Kemas kini terakhir: 4 hari yang lalu • Dinaikkan pangkat
    Remote ServiceNow AI Trainer : Shape Next-Gen Automation

    Remote ServiceNow AI Trainer : Shape Next-Gen Automation

    Mindrift • Kota Bharu, Kelantan, Malaysia
    A technology solutions company is looking for a Freelance ServiceNow Consultant to join as an AI Trainer.This remote role involves transforming intents into agent steps, defining dialogue flows, an...Tunjukkan lagi
    Kemas kini terakhir: 5 hari yang lalu • Dinaikkan pangkat
    AI-Driven Software PM & BA - Remote

    AI-Driven Software PM & BA - Remote

    Bridzia Sdn Bhd • Kota Bharu, Kelantan, Malaysia
    A leading software house in Kuala Lumpur is seeking a Project Manager / Business Analyst to lead software development teams specializing in e-commerce solutions. The successful candidate will manage...Tunjukkan lagi
    Kemas kini terakhir: 2 hari yang lalu • Dinaikkan pangkat
    Senior Full Stack Engineer (AI-Native) - Contract Role

    Senior Full Stack Engineer (AI-Native) - Contract Role

    Mindvalley, Inc. • Kota Bharu, Kelantan, Malaysia
    We’re looking for a Senior Full Stack Engineer who is highly trained in AI — someone who can.This is not a typical engineering role. You will operate like a technical co‑founder, working directly wi...Tunjukkan lagi
    Kemas kini terakhir: 13 jam yang lalu • Dinaikkan pangkat • Baharu!
    AI Evaluator - Cantonese (Chinese) - Malaysia

    AI Evaluator - Cantonese (Chinese) - Malaysia

    CrowdGen by Appen • Kota Bharu, Kelantan, Malaysia
    Join CrowdGen as we launch an exciting new AI Voice Interaction Project designed to help improve the way voice assistants understand and respond to users! We’re looking for detail-oriented contribu...Tunjukkan lagi
    Kemas kini terakhir: 8 hari yang lalu • Dinaikkan pangkat
    AI Project Delivery Exec - Hybrid / Remote - Fast Growth

    AI Project Delivery Exec - Hybrid / Remote - Fast Growth

    Chemin AI • Kota Bharu, Kelantan, Malaysia
    A tech company specializing in AI solutions seeks a Project Delivery Executive in Kuala Lumpur to support AI data labeling projects. Responsibilities include project execution, documentation, and cl...Tunjukkan lagi
    Kemas kini terakhir: 1 hari yang lalu • Dinaikkan pangkat
    SaaS Enterprise Account Leader - AI-Driven Growth (Remote)

    SaaS Enterprise Account Leader - AI-Driven Growth (Remote)

    ServiceNow, Inc. • Kota Bharu, Kelantan, Malaysia
    An innovative tech company is seeking a seasoned salesperson to develop and maintain relationships with C-suite executives. This role requires over 7 years of experience in software sales, the abili...Tunjukkan lagi
    Kemas kini terakhir: 2 hari yang lalu • Dinaikkan pangkat
    Country Manager, Malaysia & Indonesia — AI & Enterprise Sales

    Country Manager, Malaysia & Indonesia — AI & Enterprise Sales

    Proto • Kota Bharu, Kelantan, Malaysia
    A leading AI solutions provider is seeking a Country Manager to oversee operations in Malaysia and Indonesia.The role involves engaging with prospects, managing a sales pipeline, and leading presen...Tunjukkan lagi
    Kemas kini terakhir: 13 jam yang lalu • Dinaikkan pangkat • Baharu!
    Remote Content Strategist for Social Ads & Creatives

    Remote Content Strategist for Social Ads & Creatives

    Alpha Iota BPO • Kota Bharu, Kelantan, Malaysia
    A leading BPO firm in Kuala Lumpur is seeking a Marketing Specialist to create and manage engaging ad campaigns.The role emphasizes creativity and collaboration with designers.Candidates should hav...Tunjukkan lagi
    Kemas kini terakhir: 1 hari yang lalu • Dinaikkan pangkat
    Bilingual Content Editor

    Bilingual Content Editor

    DataAnnotation • Kota Bharu, Kelantan, Malaysia
    We are looking for a bilingual Content Editor to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat