Talent.com
AI QA Trainer – LLM Evaluation

AI QA Trainer – LLM Evaluation

Invisible Expert MarketplaceBatu Pahat, Johor, Malaysia
15 hari lalu
Penerangan pekerjaan

Join to apply for the AI QA Trainer – LLM Evaluation role at Invisible Expert Marketplace

Get AI-powered advice on this job and more exclusive features.

Role Overview

Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability.

Responsibilities

  • Converse with models on real-world scenarios and evaluation prompts
  • Verify factual accuracy and logical soundness of responses
  • Design and run test plans and regression suites to identify failure modes
  • Build clear rubrics and pass / fail criteria for evaluation tasks
  • Capture reproducible error traces with root‑cause hypotheses
  • Suggest improvements to prompt engineering, guardrails, and evaluation metrics (precision / recall, faithfulness, toxicity, latency SLOs)
  • Partner on adversarial red‑teaming, automation (Python / SQL), and dashboarding to track quality deltas over time
  • Document every failure mode to raise the bar for model performance
  • Challenge advanced language models on tasks such as hallucination detection, factual consistency, prompt‑injection and jailbreak resistance, bias / fairness audits, chain‑of‑reasoning reliability, tool‑use correctness, retrieval‑augmentation fidelity, and end‑to‑end workflow validation

Qualifications

  • Bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field
  • QA experience for ML / AI systems, safety / red‑team experience
  • Test automation frameworks (e.g., PyTest)
  • Hands‑on work with LLM evaluation tooling such as OpenAI Evals, RAG evaluators, W&B
  • Strong skills in evaluation rubric design, adversarial testing / red‑teaming, regression testing at scale, bias / fairness auditing, grounding verification, prompt and system‑prompt engineering
  • Test automation experience with Python / SQL and high‑signal bug reporting
  • Clear, metacognitive communication—"showing your work"—is essential
  • Compensation & Logistics

    We offer a pay range of $6 to $65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.

    Employment type : Contract

    Workplace type : Remote

    Seniority level : Mid‑Senior Level

    Call to Action

    Ready to turn your QA expertise into the quality backbone for tomorrow’s AI? Apply today and start teaching the model that will teach the world.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Trainer • Batu Pahat, Johor, Malaysia

    Pekerjaan yang berkaitan
    • Dinaikkan pangkat
    AI Strategy Specialist

    AI Strategy Specialist

    YWVISIONBatu Pahat, Johor, Malaysia
    Translate business or management needs into clear AI prompts • Communicate with AI tools (ChatGPT, Claude, Perplexity, etc. Filter and organize AI outputs, removing errors and redundant information • ...Tunjukkan lagiKemas kini terakhir: 2 hari yang lalu
    • Dinaikkan pangkat
    Senior Software Engineer - AI Training Review Layer (Python)

    Senior Software Engineer - AI Training Review Layer (Python)

    G2i Inc.Batu Pahat, Johor, Malaysia
    Senior Software Engineer - AI Training Review Layer (Python).Senior Software Engineer - AI Training Review Layer (Python). We are currently accepting a limited number of new candidates and anticipat...Tunjukkan lagiKemas kini terakhir: 4 hari yang lalu
    • Dinaikkan pangkat
    Job Opportunity – Senior Engineer, QA / QC (Onshore)

    Job Opportunity – Senior Engineer, QA / QC (Onshore)

    EP Group of CompaniesBatu Pahat, Johor, Malaysia
    Job Opportunity – Senior Engineer, QA / QC (Onshore).Senior Engineer, QA / QC (Onshore).Experience of 20 years in QA / QC in oil & gas construction projects out of which min. QA / QC in Lead QA / QC role and ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Test Application Development (Software)

    Test Application Development (Software)

    Rohde & SchwarzKebun Baharu, Johor, Malaysia
    Test Application Development (Software).Develop software solutions for high-precision test systems for software-defined radios, develop software solutions for PCBA testing.Keep a close eye on the e...Tunjukkan lagiKemas kini terakhir: 25 hari yang lalu
    • Dinaikkan pangkat
    Machine Learning Engineer (Contract)

    Machine Learning Engineer (Contract)

    career.ioMuar, Johor, Malaysia
    Through our community of career experts and data insights, our brands—we empower professionals to take control of their careers, tell the best version of their career stories, and reach their full ...Tunjukkan lagiKemas kini terakhir: 13 hari yang lalu
    • Dinaikkan pangkat
    HPC / AI Solution Architect

    HPC / AI Solution Architect

    Hewlett Packard EnterpriseMuar, Johor, Malaysia
    Remote / teleworker role; you will primarily work from home.Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We focus on connecting, protecting, a...Tunjukkan lagiKemas kini terakhir: 22 hari yang lalu
    • Dinaikkan pangkat
    Freelance Mathematics QA (with Python) - AI Trainer

    Freelance Mathematics QA (with Python) - AI Trainer

    MindriftMuar, Johor, Malaysia
    Freelance Mathematics QA (with Python) - AI Trainer.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently ...Tunjukkan lagiKemas kini terakhir: 7 hari yang lalu
    • Dinaikkan pangkat
    Data Engineer

    Data Engineer

    VAT GroupMuar, Johor, Malaysia
    VAT’s analytics and business intelligence solutions.Your work ensures that data is accessible, reliable, and performant—enabling teams across the organization to make informed, data-driven decision...Tunjukkan lagiKemas kini terakhir: 7 hari yang lalu
    • Dinaikkan pangkat
    Freelance Software Developer (Kotlin) - Quality Assurance (AI Trainer)

    Freelance Software Developer (Kotlin) - Quality Assurance (AI Trainer)

    MindriftBatu Pahat, Johor, Malaysia
    Freelance Software Developer (Kotlin) - Quality Assurance (AI Trainer).This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and r...Tunjukkan lagiKemas kini terakhir: 15 hari yang lalu
    • Dinaikkan pangkat
    AI Data Analyst

    AI Data Analyst

    BROOKS AUTOMATIONJohorMalaysia, Johor, Malaysia
    Brooks is a leading provider of automation solutions with over 40 years of experience in the semiconductor industry, offering precision robotics, integrated automation systems, and contamination co...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Senior QA Engineer

    Senior QA Engineer

    TokuMuar, Johor, Malaysia
    Join to apply for the Senior QA Engineer role at Toku.At Toku, we create bespoke cloud communications and customer engagement solutions to reimagine customer experiences for enterprises.We provide ...Tunjukkan lagiKemas kini terakhir: 4 hari yang lalu
    • Dinaikkan pangkat
    Freelance Physics QA (with Python) - AI Trainer

    Freelance Physics QA (with Python) - AI Trainer

    MindriftMuar, Johor, Malaysia
    Freelance Physics QA (with Python) - AI Trainer.Be among the first 25 applicants.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibi...Tunjukkan lagiKemas kini terakhir: 4 hari yang lalu
    • Dinaikkan pangkat
    AI Project

    AI Project

    FreelancingMuar, Johor, Malaysia
    Develab is an IT consulting company operating in Malaysia, Singapore and Indonesia.We continuously seek innovation with a mission to help businesses realize their dreams with quality digital soluti...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    ML Engineer Specialist – AI Trainer

    ML Engineer Specialist – AI Trainer

    Invisible Expert MarketplaceBatu Pahat, Johor, Malaysia
    ML Engineer Specialist – AI Trainer.Be among the first 25 applicants.ML Engineer Specialist – AI Trainer.Get AI-powered advice on this job and more exclusive features. Do you enjoy Kaggle-style prob...Tunjukkan lagiKemas kini terakhir: 15 hari yang lalu
    • Dinaikkan pangkat
    Data Partner- Math- Chinese - Remote- Global

    Data Partner- Math- Chinese - Remote- Global

    TELUS Digital AI Data SolutionsMuar, Johor, Malaysia
    Data Partner- Math- Chinese - Remote- Global.Data Partner- Math- Chinese - Remote- Global.TELUS Digital AI Data Solutions. Are you ready to use your domain knowledge to advance AI? Join us as a Data...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    MindriftBatu Pahat, Johor, Malaysia
    Get AI-powered advice on this job and more exclusive features.This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.Plea...Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
    • Dinaikkan pangkat
    Azure AI Engineer - Malaysia

    Azure AI Engineer - Malaysia

    eduCLaaSMuar, Johor, Malaysia
    The engineer will be responsible for designing, developing, and deploying AI-powered applications on Microsoft’s Azure ecosystem. This role involves working with Azure AI Services, Azure OpenAI, Mic...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Data Scientist (Python, Problem Solving, Machine Learning)

    Data Scientist (Python, Problem Solving, Machine Learning)

    Daimler Trucks North America LLCBatu Pahat, Johor, Malaysia
    Job Description - Data Scientist (Python, Problem Solving , Machine Learning) (MER0003L3Q).Data Scientist (Python, Problem Solving , Machine Learning) Group : Mercedes-Benz Group AG.At Mercedes-Ben...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu