Talent.com
Evaluation Scenario Writer - AI Agent Testing Specialist

Evaluation Scenario Writer - AI Agent Testing Specialist

MindriftIskandar Puteri, Johor, Malaysia
5 hari lalu
Penerangan pekerjaan

Mindrift is looking for a freelance Agent Scenarios Designer based in the specified country. The role focuses on designing realistic and structured evaluation scenarios for LLM‑based agents, testing agent outputs, and refining tests. You will work on a flexible schedule and receive pay up to $38 / hr based on experience.

What We Do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe.

About the Role

You will design realistic and structured evaluation scenarios, create test cases that simulate human‑performed tasks, and define gold‑standard behavior to compare agent actions against. Your work will ensure each scenario is clearly defined, well‑scored, and easy to execute and reuse. You need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Responsibilities

  • Design structured test scenarios based on real‑world tasks
  • Define the golden path and acceptable agent behavior
  • Annotate task steps, expected outputs, and edge cases
  • Work with developers to test scenarios and improve clarity
  • Review agent outputs and adapt tests accordingly

How to Get Started

Apply to this posting, qualify, and you’ll have the chance to contribute to projects aligned with your skills on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.

Requirements

  • Bachelor’s and / or Master’s degree in Computer Science, Software Engineering, Data Science / Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / NLP, Information Systems or related fields
  • Background in QA, software testing, data analysis, or NLP annotation
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases)
  • Strong written communication skills in English
  • Comfortable with structured formats like JSON / YAML for scenario description
  • Can define expected agent behaviors (gold paths) and scoring logic
  • Basic experience with Python and JavaScript
  • Curious and open to working with AI‑generated content, agent logs, and prompt‑based behavior
  • Ready to learn new methods, able to switch between tasks and topics quickly, and sometimes work with challenging, complex guidelines
  • Fully remote freelance role – only requires a laptop, internet connection, available time, and enthusiasm to take on a challenge
  • Nice to Have

  • Experience in writing manual or automated test cases
  • Familiarity with LLM capabilities and typical failure modes
  • Understanding of scoring metrics (precision, recall, coverage, reward functions)
  • Benefits

  • Get paid for your expertise, with rates up to $38 / hr depending on your skills, experience, and project needs
  • Participate in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Gain valuable experience to enhance your portfolio through an advanced AI project
  • Influence how future AI models understand and communicate in your field of expertise
  • #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Evaluation Writer Ai • Iskandar Puteri, Johor, Malaysia

    Pekerjaan berkaitan
    • Dinaikkan pangkat
    AI engineer

    AI engineer

    AIDX TECH PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    The AI Safety Algorithm Reinforcement project is a key initiative by AIDX aimed at strengthening and modernizing the AI safety testing algorithms integrated within our testing platform.It focuses o...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    AI Engineer

    AI Engineer

    NCS PTE. LTD.D20 Bishan, Ang Mo Kio, SG
    Asia Pacific region in over 20 cities, providing consulting, digital services, technology solutions, and more.We believe in harnessing the power of technology to achieve extraordinary things, creat...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Remote AI Task Evaluation & Analytics Specialist

    Remote AI Task Evaluation & Analytics Specialist

    MercorWorkFromHome, Singapore, Singapore
    Jauh
    A leading AI research consulting firm is seeking a part-time AI Task Evaluation & Statistical Analysis Specialist to conduct statistical failure analysis and recommend design improvements based on ...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    AI Engineer

    AI Engineer

    CLOUD KINETICS CONSULTING PTE. LTD.Islandwide, SG
    We are seeking an experienced AI Engineer to join our Data & AI engineering team in Singapore.The ideal candidate will have a proven track record in hands-on data engineering and AI / ML model de...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    Research Assistant / Associate (AI for Cybersecurity - Automatic Agentic Penetration Testing)

    Research Assistant / Associate (AI for Cybersecurity - Automatic Agentic Penetration Testing)

    NATIONAL UNIVERSITY OF SINGAPORED05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG
    Interested applicants are invited to apply directly at the.Your application will be processed only if you apply via.We regret that only shortlisted candidates will be notified.We are looking to rec...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    AI ENGINEER

    AI ENGINEER

    REGTECH INSIGHT PTE. LTD.D16 Upper East Coast, Bedok, Eastwood, Kew Drive, SG
    Lead and facilitate workshops and cross-functional meetings to gather business requirements and define solution approaches. Analyze current business processes to identify AI integration opportunitie...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Advertising AI Algorithm Engineer

    Advertising AI Algorithm Engineer

    PERSOL SINGAPORE PTE. LTD.Islandwide, SG
    Participate in the development of core capabilities for the advertising system, including AI infrastructure, machine learning and recommendation algorithms, to enhance ad recommendation effectivene...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    Multimodal Algo Researcher - AI Innovation Center

    Multimodal Algo Researcher - AI Innovation Center

    TIKTOK PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    TikTok is the leading destination for short-form mobile video.At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and its of...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    AI engineer

    AI engineer

    JEET ANALYTICS PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    We are hiring AI engineer with below requirements;.Lead and facilitate workshops and cross-functional meetings to gather business requirements and define solution approaches.Analyze current busines...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Project Lead (Data Science / AI)

    Project Lead (Data Science / AI)

    FLINTEX CONSULTING PTE. LTD.D02 Anson, Tanjong Pagar, SG
    The Project Lead is responsible for the successful delivery of large scale, complex tech-enabled change projects, managing and coordinating the full project lifecycle across diverse industries.You ...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    AI Quality Engineer

    AI Quality Engineer

    NCS PTE. LTD.D20 Bishan, Ang Mo Kio, SG
    Asia Pacific region in over 20 cities, providing consulting, digital services, technology solutions, and more.We believe in harnessing the power of technology to achieve extraordinary things, creat...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Research Associate / Fellow (AI for Cybersecurity - Automatic Agentic Penetration Testing)

    Research Associate / Fellow (AI for Cybersecurity - Automatic Agentic Penetration Testing)

    NATIONAL UNIVERSITY OF SINGAPORED05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG
    Interested applicants are invited to apply directly at the.Your application will be processed only if you apply via.We regret that only shortlisted candidates will be notified.We are looking to rec...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    AI Engineer

    AI Engineer

    FLINTEX CONSULTING PTE. LTD.D02 Anson, Tanjong Pagar, SG
    As an AI Engineer, you will leverage cutting-edge AI to solve complex, industry-specific problems, particularly within the maritime sector. You will be instrumental in a rapidly evolving, client-cen...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    LLM Code AI algorithm research scientist Graduate (AI Innovation Center)-2026(PhD)

    LLM Code AI algorithm research scientist Graduate (AI Innovation Center)-2026(PhD)

    TIKTOK PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    TikTok is the leading destination for short-form mobile video.At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we als...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    • Baharu!
    Senior / AI Engineer

    Senior / AI Engineer

    VIAVI SOLUTIONS SINGAPORE PTE. LTD.D20 Bishan, Ang Mo Kio, SG
    VIAVI (NASDAQ : VIAV) is a global provider of network test, monitoring and assurance solutions for telecommunications, cloud, enterprises, first responders, military, aerospace, and railway.VIAVI is...Tunjukkan lagiKemas kini terakhir: 2 jam yang lalu
    • Dinaikkan pangkat
    • Baharu!
    Lead AI Software Engineer with 7 year Experience (Contract

    Lead AI Software Engineer with 7 year Experience (Contract

    WEBSPARKS PTE. LTD.D14 Geylang, Eunos, SG
    This is a hands-on leadership role at the forefront of building.You will drive the architecture, development, and deployment of secure, scalable applications infused with AI — from.LLM-powered expe...Tunjukkan lagiKemas kini terakhir: 10 jam yang lalu
    • Dinaikkan pangkat
    Digital Technology - Specialist (AI)

    Digital Technology - Specialist (AI)

    ONG & ONG PTE. LTD.D11 Novena, Thomson, Watten Estate, SG
    Join us – Digital Technology Team.You will be part of the multi-award-winning team – ONG&ONG Group Digital Technology, an expert who will be focusing on digital transformation for ONG&ONG G...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    Gen AI Quality Engineer (Government Projects)

    Gen AI Quality Engineer (Government Projects)

    SCIENTEC CONSULTING PTE. LTD.D04 Harbourfront,Telok Blangah, Sentosa Island, SG
    Gain exposure to cutting-edge Gen AI applications, including RAG chatbots and AI-driven classification systems.Work on next-generation AI quality initiatives with cross-functional product and AI re...Tunjukkan lagiKemas kini terakhir: 4 hari yang lalu