Talent.com
AI QA Trainer – LLM Evaluation
AI QA Trainer – LLM EvaluationInvisible Expert Marketplace • Malaysia, Malaysia
AI QA Trainer – LLM Evaluation

AI QA Trainer – LLM Evaluation

Invisible Expert Marketplace • Malaysia, Malaysia
30+ days ago
Job description

Join to apply for the AI QA Trainer – LLM Evaluation role at Invisible Expert Marketplace

Get AI-powered advice on this job and more exclusive features.

Role Overview

Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability.

Responsibilities

  • Converse with models on real-world scenarios and evaluation prompts
  • Verify factual accuracy and logical soundness of responses
  • Design and run test plans and regression suites to identify failure modes
  • Build clear rubrics and pass / fail criteria for evaluation tasks
  • Capture reproducible error traces with root‑cause hypotheses
  • Suggest improvements to prompt engineering, guardrails, and evaluation metrics (precision / recall, faithfulness, toxicity, latency SLOs)
  • Partner on adversarial red‑teaming, automation (Python / SQL), and dashboarding to track quality deltas over time
  • Document every failure mode to raise the bar for model performance
  • Challenge advanced language models on tasks such as hallucination detection, factual consistency, prompt‑injection and jailbreak resistance, bias / fairness audits, chain‑of‑reasoning reliability, tool‑use correctness, retrieval‑augmentation fidelity, and end‑to‑end workflow validation

Qualifications

  • Bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field
  • QA experience for ML / AI systems, safety / red‑team experience
  • Test automation frameworks (e.g., PyTest)
  • Hands‑on work with LLM evaluation tooling such as OpenAI Evals, RAG evaluators, W&B
  • Strong skills in evaluation rubric design, adversarial testing / red‑teaming, regression testing at scale, bias / fairness auditing, grounding verification, prompt and system‑prompt engineering
  • Test automation experience with Python / SQL and high‑signal bug reporting
  • Clear, metacognitive communication—"showing your work"—is essential
  • Compensation & Logistics

    We offer a pay range of $6 to $65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.

    Employment type : Contract

    Workplace type : Remote

    Seniority level : Mid‑Senior Level

    Call to Action

    Ready to turn your QA expertise into the quality backbone for tomorrow’s AI? Apply today and start teaching the model that will teach the world.

    #J-18808-Ljbffr

    Create a job alert for this search

    Trainer • Malaysia, Malaysia

    Related jobs
    Aymara Language Expert - AI Trainer

    Aymara Language Expert - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Be among the first 25 applicants.Are you an Aymara language expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into powerful tools for communicati...Show more
    Last updated: 30+ days ago • Promoted
    Analytical Chemistry Specialist - AI Trainer

    Analytical Chemistry Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Analytical Chemistry Specialist – AI Trainer.Analytical Chemistry Specialist – AI Trainer.Are you an analytical chemistry expert eager to shape the future of AI? Large‑scale language models are evo...Show more
    Last updated: 30+ days ago • Promoted
    Freelance AI Agent Trainer

    Freelance AI Agent Trainer

    Mindrift • MY
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Show more
    Last updated: 30+ days ago
    Freelance ServiceNow Consultant - AI Trainer

    Freelance ServiceNow Consultant - AI Trainer

    Mindrift • Malaysia, Malaysia
    Freelance ServiceNow Consultant - AI Trainer.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.Please submit your re...Show more
    Last updated: 30+ days ago • Promoted
    Mapudungun Language Expert - AI Trainer

    Mapudungun Language Expert - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Be among the first 25 applicants.Are you a Mapudungun language expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into powerful tools for communic...Show more
    Last updated: 30+ days ago • Promoted
    Engineer, AI / ML

    Engineer, AI / ML

    Fairview International School • Malaysia, Malaysia
    As an Engineer, AI / Machine Learning, your responsibility will include proposing and developing innovative solutions for our business problems using AI and ML frameworks. CompAsia is a digital and te...Show more
    Last updated: 30+ days ago • Promoted
    Oromo Language Specialist - AI Trainer

    Oromo Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Oromo Language Specialist – AI Trainer.Join to apply for the Oromo Language Specialist – AI Trainer role at Invisible Expert Marketplace. Large-scale language models are evolving rapidly.With high-q...Show more
    Last updated: 30+ days ago • Promoted
    Wolof Language Specialist - AI Trainer

    Wolof Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Wolof Language Specialist – AI Trainer.AI models for Wolof speakers worldwide.Review and annotate Wolof content for training datasets. Evaluate AI-generated outputs for accuracy, fluency, and cultur...Show more
    Last updated: 30+ days ago • Promoted
    Kabuverdianu Language Specialist - AI Trainer

    Kabuverdianu Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Kabuverdianu Language Specialist - AI Trainer.Are you an experienced Kabuverdianu language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving bey...Show more
    Last updated: 30+ days ago • Promoted
    Luo Language Specialist - AI Trainer

    Luo Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Join to apply for the Luo Language Specialist - AI Trainer role at Invisible Expert Marketplace.Are you an experienced Luo language professional eager to shape the future of AI? Large-scale languag...Show more
    Last updated: 25 days ago • Promoted
    Umbundu Language Specialist - AI Trainer

    Umbundu Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Umbundu Language Specialist - AI Trainer.Are you an experienced Umbundu language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving beyond simple...Show more
    Last updated: 30+ days ago • Promoted
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    Mindrift • MY
    Quick Apply
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Show more
    Last updated: 30+ days ago
    Occitan Language Specialist - AI Trainer

    Occitan Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Occitan Language Specialist – AI Trainer.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. We’re looking for a highly skilled Occitan language specialis...Show more
    Last updated: 28 days ago • Promoted
    Freelance AI / ML Penetration Tester

    Freelance AI / ML Penetration Tester

    Mindrift • Malaysia, Malaysia
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Show more
    Last updated: 10 days ago • Promoted
    Guaraní Language Expert - AI Trainer

    Guaraní Language Expert - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Guaraní Language Expert – AI Trainer.Consulting, Training, and Information Technology.Are you a Guaraní language expert eager to shape the future of AI? Large‑scale language models are evolving fro...Show more
    Last updated: 30+ days ago • Promoted
    Freelance Zendesk Consultant - AI Trainer

    Freelance Zendesk Consultant - AI Trainer

    Mindrift • Malaysia, Malaysia
    Freelance Zendesk Consultant – AI Trainer.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing i...Show more
    Last updated: 10 days ago • Promoted
    Fulah Language Specialist - AI Trainer

    Fulah Language Specialist - AI Trainer

    Invisible Expert Marketplace • Malaysia, Malaysia
    Fulah Language Specialist - AI Trainer.Are you an experienced Fulah language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving beyond simple cha...Show more
    Last updated: 30+ days ago • Promoted
    Evaluation Scenario Writer - AI Agent Testing Specialist

    Evaluation Scenario Writer - AI Agent Testing Specialist

    Mindrift • Malaysia, Malaysia
    Mindrift is looking for a freelance.The role focuses on designing realistic and structured evaluation scenarios for LLM‑based agents, testing agent outputs, and refining tests.You will work on a fl...Show more
    Last updated: 9 days ago • Promoted