Talent.com
AI QA Trainer – LLM Evaluation

AI QA Trainer – LLM Evaluation

Invisible Expert MarketplaceMuar, Johor, Malaysia
11 hari lalu
Penerangan pekerjaan

Join to apply for the AI QA Trainer – LLM Evaluation role at Invisible Expert Marketplace

Get AI-powered advice on this job and more exclusive features.

Role Overview

Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability.

Responsibilities

  • Converse with models on real-world scenarios and evaluation prompts
  • Verify factual accuracy and logical soundness of responses
  • Design and run test plans and regression suites to identify failure modes
  • Build clear rubrics and pass / fail criteria for evaluation tasks
  • Capture reproducible error traces with root‑cause hypotheses
  • Suggest improvements to prompt engineering, guardrails, and evaluation metrics (precision / recall, faithfulness, toxicity, latency SLOs)
  • Partner on adversarial red‑teaming, automation (Python / SQL), and dashboarding to track quality deltas over time
  • Document every failure mode to raise the bar for model performance
  • Challenge advanced language models on tasks such as hallucination detection, factual consistency, prompt‑injection and jailbreak resistance, bias / fairness audits, chain‑of‑reasoning reliability, tool‑use correctness, retrieval‑augmentation fidelity, and end‑to‑end workflow validation

Qualifications

  • Bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field
  • QA experience for ML / AI systems, safety / red‑team experience
  • Test automation frameworks (e.g., PyTest)
  • Hands‑on work with LLM evaluation tooling such as OpenAI Evals, RAG evaluators, W&B
  • Strong skills in evaluation rubric design, adversarial testing / red‑teaming, regression testing at scale, bias / fairness auditing, grounding verification, prompt and system‑prompt engineering
  • Test automation experience with Python / SQL and high‑signal bug reporting
  • Clear, metacognitive communication—"showing your work"—is essential
  • Compensation & Logistics

    We offer a pay range of $6 to $65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.

    Employment type : Contract

    Workplace type : Remote

    Seniority level : Mid‑Senior Level

    Call to Action

    Ready to turn your QA expertise into the quality backbone for tomorrow’s AI? Apply today and start teaching the model that will teach the world.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Trainer • Muar, Johor, Malaysia

    Pekerjaan yang berkaitan
    • Dinaikkan pangkat
    Freelance Mathematics Expert - AI Trainer

    Freelance Mathematics Expert - AI Trainer

    MindriftMuar, Johor, Malaysia
    Freelance Mathematics Expert - AI Trainer.Be among the first 25 applicants.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility a...Tunjukkan lagiKemas kini terakhir: 22 hari yang lalu
    • Dinaikkan pangkat
    Fulah Language Specialist - AI Trainer

    Fulah Language Specialist - AI Trainer

    Invisible Expert MarketplaceMuar, Johor, Malaysia
    Fulah Language Specialist - AI Trainer.Are you an experienced Fulah language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving beyond simple cha...Tunjukkan lagiKemas kini terakhir: 11 hari yang lalu
    • Dinaikkan pangkat
    CSA Project Manager - Data Center

    CSA Project Manager - Data Center

    Turner & TownsendKebun Baharu, Johor, Malaysia
    Turner & Townsend is a global professional services company with over 22,000 people in more than 60 countries.Working with our clients across real estate, infrastructure, energy and natural resourc...Tunjukkan lagiKemas kini terakhir: 9 hari yang lalu
    • Dinaikkan pangkat
    Test Application Development (Software)

    Test Application Development (Software)

    Rohde & SchwarzKebun Baharu, Johor, Malaysia
    Test Application Development (Software).Develop software solutions for high-precision test systems for software-defined radios, develop software solutions for PCBA testing.Keep a close eye on the e...Tunjukkan lagiKemas kini terakhir: 21 hari yang lalu
    • Dinaikkan pangkat
    High Voltage / Substation Project Manager - Data Center

    High Voltage / Substation Project Manager - Data Center

    Turner & TownsendKebun Baharu, Johor, Malaysia
    High Voltage / Substation Project Manager - Data Center.Turner & Townsend is a global professional services company with over 22,000 people in more than 60 countries. Working with our clients across...Tunjukkan lagiKemas kini terakhir: 9 hari yang lalu
    • Dinaikkan pangkat
    Freelance Mathematics QA (with Python) - AI Trainer

    Freelance Mathematics QA (with Python) - AI Trainer

    MindriftBatu Pahat, Johor, Malaysia
    Freelance Mathematics QA (with Python) - AI Trainer.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently ...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Machine Learning Engineer (Contract)

    Machine Learning Engineer (Contract)

    career.ioMuar, Johor, Malaysia
    Through our community of career experts and data insights, our brands—we empower professionals to take control of their careers, tell the best version of their career stories, and reach their full ...Tunjukkan lagiKemas kini terakhir: 9 hari yang lalu
    • Dinaikkan pangkat
    • Baharu!
    Senior Software Engineer - AI Training Review Layer (Python)

    Senior Software Engineer - AI Training Review Layer (Python)

    G2i Inc.Muar, Johor, Malaysia
    Senior Software Engineer - AI Training Review Layer (Python).Senior Software Engineer - AI Training Review Layer (Python). We are currently accepting a limited number of new candidates and anticipat...Tunjukkan lagiKemas kini terakhir: 22 jam yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Clutch CanadaMalacca City, Malacca, Malaysia
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    QA / QC Manager - Data Center

    QA / QC Manager - Data Center

    Turner & TownsendKebun Baharu, Johor, Malaysia
    QA / QC Manager - Data Center at Turner & Townsend.Turner & Townsend is a global professional services company with over 22,000 people in more than 60 countries. Working with our clients across real e...Tunjukkan lagiKemas kini terakhir: 21 hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    SpeechifyBatu Pahat, Johor, Malaysia
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia.Speechify Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia. Join or sign in to find your next job.Senior Software Enginee...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Senior Product Manager - AI AdOps Copilot (REMOTE)

    Senior Product Manager - AI AdOps Copilot (REMOTE)

    MonetizeMoreMuar, Johor, Malaysia
    Senior Product Manager - AI AdOps Copilot (REMOTE).MonetizeMore is a global leader in ad tech, providing solutions that help publishers maximize their ad revenue while maintaining transparency, use...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Test Application Development (Software & Hardware)

    Test Application Development (Software & Hardware)

    Rohde & SchwarzKebun Baharu, Johor, Malaysia
    Test Application Development (Software & Hardware).We are looking for a dynamic, self-driven and motivated Test Application Development Engineer with experience in test technologies of analog / RF / Di...Tunjukkan lagiKemas kini terakhir: 21 hari yang lalu
    • Dinaikkan pangkat
    ML Engineer Specialist – AI Trainer

    ML Engineer Specialist – AI Trainer

    Invisible Expert MarketplaceBatu Pahat, Johor, Malaysia
    ML Engineer Specialist – AI Trainer.Be among the first 25 applicants.ML Engineer Specialist – AI Trainer.Get AI-powered advice on this job and more exclusive features. Do you enjoy Kaggle-style prob...Tunjukkan lagiKemas kini terakhir: 11 hari yang lalu
    • Dinaikkan pangkat
    Data Partner- Math- Chinese - Remote- Global

    Data Partner- Math- Chinese - Remote- Global

    TELUS Digital AI Data SolutionsMalacca City, Malacca, Malaysia
    Data Partner- Math- Chinese - Remote- Global.Data Partner- Math- Chinese - Remote- Global.TELUS Digital AI Data Solutions. Are you ready to use your domain knowledge to advance AI? Join us as a Data...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    MindriftBatu Pahat, Johor, Malaysia
    Get AI-powered advice on this job and more exclusive features.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.Plea...Tunjukkan lagiKemas kini terakhir: 14 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer - Based in Johor Bahru

    Site Reliability Engineer - Based in Johor Bahru

    Arvion ServicesKebun Baharu, Johor, Malaysia
    Site Reliability Engineer – Johor Bahru.Site Reliability Engineer – Johor Bahru.As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of crit...Tunjukkan lagiKemas kini terakhir: 11 hari yang lalu
    • Dinaikkan pangkat
    Azure AI Engineer - Malaysia

    Azure AI Engineer - Malaysia

    eduCLaaSMuar, Johor, Malaysia
    The engineer will be responsible for designing, developing, and deploying AI-powered applications on Microsoft’s Azure ecosystem. This role involves working with Azure AI Services, Azure OpenAI, Mic...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu