Talent.com
AI QA Trainer – LLM Evaluation

AI QA Trainer – LLM Evaluation

Invisible Expert MarketplacePort Klang, Port Klang, Malaysia
23 hari lalu
Penerangan pekerjaan

Join to apply for the AI QA Trainer – LLM Evaluation role at Invisible Expert Marketplace

Get AI-powered advice on this job and more exclusive features.

Role Overview

Are you an AI QA expert eager to shape the future of AI? Large-scale language models are evolving from clever chatbots into enterprise-grade platforms. With rigorous evaluation data, tomorrow’s AI can democratize world-class education, keep pace with cutting-edge research, and streamline workflows for teams everywhere. That quality begins with you—we need your expertise to harden model reasoning and reliability.

Responsibilities

  • Converse with models on real-world scenarios and evaluation prompts
  • Verify factual accuracy and logical soundness of responses
  • Design and run test plans and regression suites to identify failure modes
  • Build clear rubrics and pass / fail criteria for evaluation tasks
  • Capture reproducible error traces with root‑cause hypotheses
  • Suggest improvements to prompt engineering, guardrails, and evaluation metrics (precision / recall, faithfulness, toxicity, latency SLOs)
  • Partner on adversarial red‑teaming, automation (Python / SQL), and dashboarding to track quality deltas over time
  • Document every failure mode to raise the bar for model performance
  • Challenge advanced language models on tasks such as hallucination detection, factual consistency, prompt‑injection and jailbreak resistance, bias / fairness audits, chain‑of‑reasoning reliability, tool‑use correctness, retrieval‑augmentation fidelity, and end‑to‑end workflow validation

Qualifications

  • Bachelor’s, master’s, or PhD in computer science, data science, computational linguistics, statistics, or a related field
  • QA experience for ML / AI systems, safety / red‑team experience
  • Test automation frameworks (e.g., PyTest)
  • Hands‑on work with LLM evaluation tooling such as OpenAI Evals, RAG evaluators, W&B
  • Strong skills in evaluation rubric design, adversarial testing / red‑teaming, regression testing at scale, bias / fairness auditing, grounding verification, prompt and system‑prompt engineering
  • Test automation experience with Python / SQL and high‑signal bug reporting
  • Clear, metacognitive communication—"showing your work"—is essential
  • Compensation & Logistics

    We offer a pay range of $6 to $65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.

    Employment type : Contract

    Workplace type : Remote

    Seniority level : Mid‑Senior Level

    Call to Action

    Ready to turn your QA expertise into the quality backbone for tomorrow’s AI? Apply today and start teaching the model that will teach the world.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Trainer • Port Klang, Port Klang, Malaysia

    Pekerjaan berkaitan
    • Dinaikkan pangkat
    OLTC Trainer

    OLTC Trainer

    Maschinenfabrik Reinhausen GmbHKuala Lumpur, de
    At the world market leader in energy technology, you will contribute to keeping the energy supply stable in the future.With our intelligent digital and analogue solutions, you will help shape the i...Tunjukkan lagiKemas kini terakhir: 17 hari yang lalu
    • Dinaikkan pangkat
    Luo Language Specialist - AI Trainer

    Luo Language Specialist - AI Trainer

    Invisible Expert MarketplaceSeri Kembangan, Selangor, Malaysia
    Join to apply for the Luo Language Specialist - AI Trainer role at Invisible Expert Marketplace.Are you an experienced Luo language professional eager to shape the future of AI? Large-scale languag...Tunjukkan lagiKemas kini terakhir: 8 hari yang lalu
    Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)

    Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)

    Tether Operations LimitedKuala Lumpur, 14, MY
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Hungarian Voice Actor - AI Trainer

    Hungarian Voice Actor - AI Trainer

    Invisible Expert MarketplaceSubang Jaya, Subang Jaya, Malaysia
    Are you an experienced Hungarian voice actor eager to shape the future of AI? Large-scale language models and speech technologies are evolving rapidly, moving beyond simple interactions into expres...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    Senior AI Research Engineer, Model Inference (100% Remote)

    Senior AI Research Engineer, Model Inference (100% Remote)

    Tether Operations LimitedPuchong, 10, MY
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Search Operations Specialist - Model Data

    Search Operations Specialist - Model Data

    TikTokKuala Lumpur, Kuala Lumpur, Malaysia
    Search Operations Specialist - Model Data.TikTok Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Perform as the product operations role. manage the operation of labeling projects that del...Tunjukkan lagiKemas kini terakhir: 6 hari yang lalu
    • Dinaikkan pangkat
    Digital Sales Trainee

    Digital Sales Trainee

    GambullsKlang Municipal Council, Klang Municipal Council, Malaysia
    We're looking for motivated, hard‑working individuals that are result driven and understand that hard work pays off.You'll have to contact potential customers and onboard them onto our platform; by...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Software Developer for Training AI Data - Remote

    Software Developer for Training AI Data - Remote

    G2i Inc.Kajang Municipal Council, Selangor, Malaysia
    Software Developer for Training AI Data – Remote.Software Developer for Training AI Data – Remote.Remote (Worldwide – see accepted locations below). Flexible, 15‑40+ hours per week.Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
    • Dinaikkan pangkat
    French Voice Actor - AI Trainer

    French Voice Actor - AI Trainer

    Invisible Expert MarketplaceKlang Municipal Council, Klang Municipal Council, Malaysia
    French Voice Acting Specialist – AI Trainer.Are you an experienced French voice actor eager to shape the future of AI? Large‑scale language models and speech technologies are evolving rapidly, movi...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    Research Engineer Intern (NLP / Video / Multimodal LLM)

    Research Engineer Intern (NLP / Video / Multimodal LLM)

    Tether Operations LimitedKuala Lumpur, 14, MY
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Principal or Staff AI / ML Engineer

    Principal or Staff AI / ML Engineer

    Wallaroo.AIKuala Lumpur, Kuala Lumpur, Malaysia
    Want to work on the next wave of AI? Join Wallaroo.AI and help us build the next generation of enterprise AI inference software!. AI is on the hunt for a Principal or Staff AI / ML Engineer to lead th...Tunjukkan lagiKemas kini terakhir: 4 hari yang lalu
    • Dinaikkan pangkat
    Italian Voice Actor - AI Trainer

    Italian Voice Actor - AI Trainer

    Invisible Expert MarketplaceKlang Municipal Council, Klang Municipal Council, Malaysia
    Italian Voice Acting Specialist – AI Trainer.We’re looking for a highly skilled Italian voice acting professional to help build AI voice models. You’ll use cutting‑edge tools, record and evaluate It...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    German Voice Actor - AI Trainer

    German Voice Actor - AI Trainer

    Invisible Expert MarketplaceKuala Selangor, Kuala Selangor, Malaysia
    Are you an experienced German voice actor eager to shape the future of AI? Large‑scale language models and speech technologies are evolving rapidly, moving beyond simple interactions into expressiv...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Corporate English Trainer (Asia)

    Corporate English Trainer (Asia)

    goFLUENTKepong, Kuala Lumpur, Malaysia
    Location : Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Corporate English Trainer (Asia).Job Summary : This is your chance to join a dynamic, global team and play a key role in helping c...Tunjukkan lagiKemas kini terakhir: 24 hari yang lalu
    • Dinaikkan pangkat
    AI Engineer (Training Provided)

    AI Engineer (Training Provided)

    Tap Growth aiKuala Lumpur, Kuala Lumpur, Malaysia
    Tap Growth ai Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Break into AI : Hack to Hire – Kuala Lumpur, 27th Nov – 12th Dec 2025. This is a 2‑week hands‑on sprint where you build deploya...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Machine Learning Engineer

    Machine Learning Engineer

    Second TalentCyberjaya, Selangor, Malaysia
    Member of Technical Staff - Environments (ML).As an Environment Engineer (ML), you will build on top of our core platform to create the simulation environments in which frontier coding agents learn...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Analytics Modelling & Analytics Manager

    Analytics Modelling & Analytics Manager

    CTOSPetaling Jaya, Selangor, Malaysia
    Analytics Modelling & Analytics Manager.The role focuses on building and validating statistical and machine learning models that power decision‑making across the credit lifecycle.The successful can...Tunjukkan lagiKemas kini terakhir: 23 hari yang lalu
    Machine Learning Operation Engineer (MLOps Engineer)

    Machine Learning Operation Engineer (MLOps Engineer)

    Always Marketing Malaysia Sdn BhdKuala Lumpur, Kuala Lumpur, MY
    Quick Apply
    Duties and Responsibilities : Provides deep technical expertise in the aspects of cloud infrastructure design and API development for the business environments.Bridges the ga...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu