Talent.com
AI QA Trainer – LLM Evaluation

AI QA Trainer – LLM Evaluation

Invisible Expert MarketplaceKuantan, Pahang, Malaysia
5 hari lalu
Penerangan pekerjaan

Join to apply for the AI QA Trainer – LLM Evaluation role at Invisible Expert Marketplace

Get AI-powered advice on this job and more exclusive features.

Role Overview

Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability.

Responsibilities

  • Converse with models on real-world scenarios and evaluation prompts
  • Verify factual accuracy and logical soundness of responses
  • Design and run test plans and regression suites to identify failure modes
  • Build clear rubrics and pass / fail criteria for evaluation tasks
  • Capture reproducible error traces with root‑cause hypotheses
  • Suggest improvements to prompt engineering, guardrails, and evaluation metrics (precision / recall, faithfulness, toxicity, latency SLOs)
  • Partner on adversarial red‑teaming, automation (Python / SQL), and dashboarding to track quality deltas over time
  • Document every failure mode to raise the bar for model performance
  • Challenge advanced language models on tasks such as hallucination detection, factual consistency, prompt‑injection and jailbreak resistance, bias / fairness audits, chain‑of‑reasoning reliability, tool‑use correctness, retrieval‑augmentation fidelity, and end‑to‑end workflow validation

Qualifications

  • Bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field
  • QA experience for ML / AI systems, safety / red‑team experience
  • Test automation frameworks (e.g., PyTest)
  • Hands‑on work with LLM evaluation tooling such as OpenAI Evals, RAG evaluators, W&B
  • Strong skills in evaluation rubric design, adversarial testing / red‑teaming, regression testing at scale, bias / fairness auditing, grounding verification, prompt and system‑prompt engineering
  • Test automation experience with Python / SQL and high‑signal bug reporting
  • Clear, metacognitive communication—"showing your work"—is essential
  • Compensation & Logistics

    We offer a pay range of $6 to $65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.

    Employment type : Contract

    Workplace type : Remote

    Seniority level : Mid‑Senior Level

    Call to Action

    Ready to turn your QA expertise into the quality backbone for tomorrow’s AI? Apply today and start teaching the model that will teach the world.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Trainer • Kuantan, Pahang, Malaysia

    Pekerjaan yang berkaitan
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    SpeechifyKuantan, Pahang, Malaysia
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia.Speechify Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia. Join or sign in to find your next job.Senior Software Enginee...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Freelance Data Annotation Specialist - AI Trainer

    Freelance Data Annotation Specialist - AI Trainer

    Toloka AnnotatorsKuantan, Pahang, Malaysia
    Freelance Data Annotation Specialist - AI Trainer.Please submit your resume in English and indicate your level of English. At Toloka, we connect smart, curious people from around the world with free...Tunjukkan lagiKemas kini terakhir: 8 hari yang lalu
    • Dinaikkan pangkat
    ML Engineer Specialist – AI Trainer

    ML Engineer Specialist – AI Trainer

    Invisible Expert MarketplaceKuantan, Pahang, Malaysia
    ML Engineer Specialist – AI Trainer.Be among the first 25 applicants.ML Engineer Specialist – AI Trainer.Get AI-powered advice on this job and more exclusive features. Do you enjoy Kaggle-style prob...Tunjukkan lagiKemas kini terakhir: 7 hari yang lalu
    • Dinaikkan pangkat
    Subject Matter Expert (STEM) - 47549

    Subject Matter Expert (STEM) - 47549

    TuringKuantan, Pahang, Malaysia
    What does day-to-day look like.Develop structured evaluation problems across STEM subjects.Create datasets with clear, verifiable solutions. Evaluate AI performance for accuracy and rigor.Document r...Tunjukkan lagiKemas kini terakhir: 19 hari yang lalu
    • Dinaikkan pangkat
    Freelance Mathematics Expert - AI Trainer

    Freelance Mathematics Expert - AI Trainer

    MindriftKuantan, Pahang, Malaysia
    Freelance Mathematics Expert - AI Trainer.Be among the first 25 applicants.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility a...Tunjukkan lagiKemas kini terakhir: 16 hari yang lalu
    • Dinaikkan pangkat
    Fulah Language Specialist - AI Trainer

    Fulah Language Specialist - AI Trainer

    Invisible Expert MarketplaceKuantan, Pahang, Malaysia
    Fulah Language Specialist - AI Trainer.Are you an experienced Fulah language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving beyond simple cha...Tunjukkan lagiKemas kini terakhir: 5 hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Asia

    Senior Software Engineer, AI Model serving - Asia

    SpeechifyKuantan, Pahang, Malaysia
    Senior Software Engineer, AI Model serving - Asia.Senior Software Engineer, AI Model serving - Asia.Speechify is a text-to-speech app that makes information accessible for 20+ million users across ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    MindriftKuantan, Pahang, Malaysia
    Get AI-powered advice on this job and more exclusive features.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.Plea...Tunjukkan lagiKemas kini terakhir: 5 hari yang lalu
    • Dinaikkan pangkat
    Aymara Language Expert - AI Trainer

    Aymara Language Expert - AI Trainer

    Invisible Expert MarketplaceKuantan, Pahang, Malaysia
    Be among the first 25 applicants.Are you an Aymara language expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into powerful tools for communicati...Tunjukkan lagiKemas kini terakhir: 5 hari yang lalu
    • Dinaikkan pangkat
    Umbundu Language Specialist - AI Trainer

    Umbundu Language Specialist - AI Trainer

    Invisible Expert MarketplaceKuantan, Pahang, Malaysia
    Umbundu Language Specialist - AI Trainer.Are you an experienced Umbundu language professional eager to shape the future of AI? Large‑scale language models are evolving rapidly, moving beyond simple...Tunjukkan lagiKemas kini terakhir: 5 hari yang lalu
    • Dinaikkan pangkat
    Azure AI Engineer - Malaysia

    Azure AI Engineer - Malaysia

    eduCLaaSKuantan, Pahang, Malaysia
    The engineer will be responsible for designing, developing, and deploying AI-powered applications on Microsoft’s Azure ecosystem. This role involves working with Azure AI Services, Azure OpenAI, Mic...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Data Partner- Math- Chinese - Remote- Global

    Data Partner- Math- Chinese - Remote- Global

    TELUS Digital AI Data SolutionsKuantan, Pahang, Malaysia
    Data Partner- Math- Chinese - Remote- Global.TELUS Digital AI Data Solutions.We are seeking a Subject Matter Expert to design advanced, domain-specific questions and solutions and to create challen...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Director of Finance Strategy, Trilogy (Remote) - $400,000 / year USD

    Director of Finance Strategy, Trilogy (Remote) - $400,000 / year USD

    TrilogyKuantan, Pahang, Malaysia
    Director of Finance Strategy, Trilogy (Remote).Salary : $200 / hour ($400,000 / year).This role is for a finance strategist with an operator’s instinct and a dealmaker’s clarity : someone who thrives in ...Tunjukkan lagiKemas kini terakhir: 21 hari yang lalu
    • Dinaikkan pangkat
    Freelance Chemistry Expert - Quality Assurance (AI Trainer)

    Freelance Chemistry Expert - Quality Assurance (AI Trainer)

    MindriftKuantan, Pahang, Malaysia
    Freelance Chemistry Expert - Quality Assurance (AI Trainer).This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.Please...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Freelance Mathematics QA Reviewer - AI Trainer

    Freelance Mathematics QA Reviewer - AI Trainer

    MindriftKuantan, Pahang, Malaysia
    Freelance Mathematics QA Reviewer - AI Trainer.This opportunity is for candidates residing in the specified country.Your location may affect eligibility and rates. Please provide your resume in Engl...Tunjukkan lagiKemas kini terakhir: 12 hari yang lalu
    • Dinaikkan pangkat
    Machine Learning Engineer (Contract)

    Machine Learning Engineer (Contract)

    career.ioKuantan, Pahang, Malaysia
    We’re rapidly growing and looking for driven and customer‑obsessed professionals to help our team revolutionize the career‑services industry. Through our community of career experts and data insight...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Clutch CanadaKuantan, Pahang, Malaysia
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Senior Product Manager - AI AdOps Copilot (REMOTE)

    Senior Product Manager - AI AdOps Copilot (REMOTE)

    MonetizeMoreKuantan, Pahang, Malaysia
    Senior Product Manager - AI AdOps Copilot (REMOTE).MonetizeMore is a global leader in ad tech, providing solutions that help publishers maximize their ad revenue while maintaining transparency, use...Tunjukkan lagiKemas kini terakhir: 25 hari yang lalu