Talent.com
AI QA Trainer – LLM Evaluation

AI QA Trainer – LLM Evaluation

Invisible Expert MarketplaceMalacca City, Malacca, Malaysia
8 hari lalu
Penerangan pekerjaan

Join to apply for the AI QA Trainer – LLM Evaluation role at Invisible Expert Marketplace

Get AI-powered advice on this job and more exclusive features.

Role Overview

Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability.

Responsibilities

  • Converse with models on real-world scenarios and evaluation prompts
  • Verify factual accuracy and logical soundness of responses
  • Design and run test plans and regression suites to identify failure modes
  • Build clear rubrics and pass / fail criteria for evaluation tasks
  • Capture reproducible error traces with root‑cause hypotheses
  • Suggest improvements to prompt engineering, guardrails, and evaluation metrics (precision / recall, faithfulness, toxicity, latency SLOs)
  • Partner on adversarial red‑teaming, automation (Python / SQL), and dashboarding to track quality deltas over time
  • Document every failure mode to raise the bar for model performance
  • Challenge advanced language models on tasks such as hallucination detection, factual consistency, prompt‑injection and jailbreak resistance, bias / fairness audits, chain‑of‑reasoning reliability, tool‑use correctness, retrieval‑augmentation fidelity, and end‑to‑end workflow validation

Qualifications

  • Bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field
  • QA experience for ML / AI systems, safety / red‑team experience
  • Test automation frameworks (e.g., PyTest)
  • Hands‑on work with LLM evaluation tooling such as OpenAI Evals, RAG evaluators, W&B
  • Strong skills in evaluation rubric design, adversarial testing / red‑teaming, regression testing at scale, bias / fairness auditing, grounding verification, prompt and system‑prompt engineering
  • Test automation experience with Python / SQL and high‑signal bug reporting
  • Clear, metacognitive communication—"showing your work"—is essential
  • Compensation & Logistics

    We offer a pay range of $6 to $65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.

    Employment type : Contract

    Workplace type : Remote

    Seniority level : Mid‑Senior Level

    Call to Action

    Ready to turn your QA expertise into the quality backbone for tomorrow’s AI? Apply today and start teaching the model that will teach the world.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Trainer • Malacca City, Malacca, Malaysia

    Pekerjaan yang berkaitan
    • Dinaikkan pangkat
    Freelance Mathematics Expert - AI Trainer

    Freelance Mathematics Expert - AI Trainer

    MindriftPasir Panjang, Negeri Sembilan, Malaysia
    Freelance Mathematics Expert - AI Trainer.Be among the first 25 applicants.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility a...Tunjukkan lagiKemas kini terakhir: 19 hari yang lalu
    • Dinaikkan pangkat
    Test Application Development (Software)

    Test Application Development (Software)

    Rohde & SchwarzKebun Baharu, Johor, Malaysia
    Test Application Development (Software).Develop software solutions for high-precision test systems for software-defined radios, develop software solutions for PCBA testing.Keep a close eye on the e...Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
    • Dinaikkan pangkat
    ML Engineer Specialist – AI Trainer

    ML Engineer Specialist – AI Trainer

    Invisible Expert MarketplaceSeremban, Negeri Sembilan, Malaysia
    ML Engineer Specialist – AI Trainer.Be among the first 25 applicants.ML Engineer Specialist – AI Trainer.Get AI-powered advice on this job and more exclusive features. Do you enjoy Kaggle-style prob...Tunjukkan lagiKemas kini terakhir: 9 hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    SpeechifySeremban, Negeri Sembilan, Malaysia
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia.Speechify Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia. Join or sign in to find your next job.Senior Software Enginee...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Clutch CanadaMalacca City, Malacca, Malaysia
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Freelance Mathematics QA Reviewer - AI Trainer

    Freelance Mathematics QA Reviewer - AI Trainer

    MindriftPasir Panjang, Negeri Sembilan, Malaysia
    Freelance Mathematics QA Reviewer - AI Trainer.This opportunity is for candidates residing in the specified country.Your location may affect eligibility and rates. Please provide your resume in Engl...Tunjukkan lagiKemas kini terakhir: 13 hari yang lalu
    • Dinaikkan pangkat
    Fulah Language Specialist - AI Trainer

    Fulah Language Specialist - AI Trainer

    Invisible Expert MarketplacePasir Panjang, Negeri Sembilan, Malaysia
    Fulah Language Specialist - AI Trainer.Are you an experienced Fulah language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving beyond simple cha...Tunjukkan lagiKemas kini terakhir: 8 hari yang lalu
    • Dinaikkan pangkat
    Subject Matter Expert (STEM) - 47549

    Subject Matter Expert (STEM) - 47549

    TuringMalacca City, Malacca, Malaysia
    What does day-to-day look like.Develop structured evaluation problems across STEM subjects.Create datasets with clear, verifiable solutions. Evaluate AI performance for accuracy and rigor.Document r...Tunjukkan lagiKemas kini terakhir: 22 hari yang lalu
    • Dinaikkan pangkat
    QA / QC Manager - Data Center

    QA / QC Manager - Data Center

    Turner & TownsendKebun Baharu, Johor, Malaysia
    QA / QC Manager - Data Center at Turner & Townsend.Turner & Townsend is a global professional services company with over 22,000 people in more than 60 countries. Working with our clients across real e...Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
    • Dinaikkan pangkat
    Machine Learning Engineer (Contract)

    Machine Learning Engineer (Contract)

    career.ioPasir Panjang, Negeri Sembilan, Malaysia
    We’re rapidly growing and looking for driven and customer‑obsessed professionals to help our team revolutionize the career‑services industry. Through our community of career experts and data insight...Tunjukkan lagiKemas kini terakhir: 6 hari yang lalu
    • Dinaikkan pangkat
    Test Application Development (Software & Hardware)

    Test Application Development (Software & Hardware)

    Rohde & SchwarzKebun Baharu, Johor, Malaysia
    Test Application Development (Software & Hardware).We are looking for a dynamic, self-driven and motivated Test Application Development Engineer with experience in test technologies of analog / RF / Di...Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
    • Dinaikkan pangkat
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    MindriftMalacca City, Malacca, Malaysia
    Get AI-powered advice on this job and more exclusive features.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.Plea...Tunjukkan lagiKemas kini terakhir: 8 hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Asia

    Senior Software Engineer, AI Model serving - Asia

    SpeechifySeremban, Negeri Sembilan, Malaysia
    Senior Software Engineer, AI Model serving - Asia.Senior Software Engineer, AI Model serving - Asia.Speechify is a text-to-speech app that makes information accessible for 20+ million users across ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Store Manager / Store Manager Trainee (Batu Pahat, Jb Southkey, Aeon Tebrau, Jpo, Penang, Ipoh,[...]

    Store Manager / Store Manager Trainee (Batu Pahat, Jb Southkey, Aeon Tebrau, Jpo, Penang, Ipoh,[...]

    HLAKebun Baharu, Johor, Malaysia
    Store Manager / Store Manager Trainee (Batu Pahat, Jb Southkey, Aeon Tebrau, Jpo, Penang, Ipoh, Melaka, KL, Selangor, Kedah). Store Manager / Store Manager Trainee (Batu Pahat, Jb Southkey, Aeon Teb...Tunjukkan lagiKemas kini terakhir: 26 hari yang lalu
    • Dinaikkan pangkat
    Freelance Automotive Engineer Consultant - AI Trainer

    Freelance Automotive Engineer Consultant - AI Trainer

    MindriftMuar, Johor, Malaysia
    Freelance Automotive Engineering - Quality Assurance (AI Trainer).At Mindrift, innovation meets opportunity.We believe in using the power of collective intelligence to ethically shape the future of...Tunjukkan lagiKemas kini terakhir: 6 hari yang lalu
    • Dinaikkan pangkat
    Data Partner- Math- Chinese - Remote- Global

    Data Partner- Math- Chinese - Remote- Global

    TELUS Digital AI Data SolutionsMuar, Johor, Malaysia
    Data Partner- Math- Chinese - Remote- Global.TELUS Digital AI Data Solutions.We are seeking a Subject Matter Expert to design advanced, domain-specific questions and solutions and to create challen...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer - Based in Johor Bahru

    Site Reliability Engineer - Based in Johor Bahru

    Arvion ServicesKebun Baharu, Johor, Malaysia
    Site Reliability Engineer – Johor Bahru.Site Reliability Engineer – Johor Bahru.As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of crit...Tunjukkan lagiKemas kini terakhir: 7 hari yang lalu
    • Dinaikkan pangkat
    Azure AI Engineer - Malaysia

    Azure AI Engineer - Malaysia

    eduCLaaSMuar, Johor, Malaysia
    The engineer will be responsible for designing, developing, and deploying AI-powered applications on Microsoft’s Azure ecosystem. This role involves working with Azure AI Services, Azure OpenAI, Mic...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu