Talent.com
HPC System Administrator
HPC System AdministratorUniversity of Malaya • Kuala Lumpur, Kuala Lumpur, Malaysia
HPC System Administrator

HPC System Administrator

University of Malaya • Kuala Lumpur, Kuala Lumpur, Malaysia
1 day ago
Job description

Direct message the job poster from University of Malaya

The ideal candidate will design, organize, and modify the company's computer systems. This individual will evaluate and assess systems to ensure they are operating effectively. Based on assessments, this individual will harness collected knowledge and make adjustments to existing systems.

Here is the detailed breakdown of responsibilities for the role :

1. System & Infrastructure Management

  • Cluster Operations : Install, configure, and maintain Linux-based HPC clusters, compute nodes, and associated server hardware.
  • Storage Management : Manage and optimize large-scale parallel file systems (e.g., Lustre, GPFS) and high-performance storage solutions.
  • Network Administration : Configure, manage, and monitor the high-speed HPC network infrastructure, including InfiniBand and Ethernet fabrics, to ensure optimal performance.
  • Security & Patching : Implement system security policies, perform regular security hardening, and apply OS patches and updates to ensure system integrity.
  • Backup & Recovery : Oversee and execute data backup and disaster recovery procedures for critical systems and user data.

2. Application & Software Support

  • Software Deployment : Install, compile, and manage a wide range of scientific applications, compilers (e.g., GNU, Intel), and parallel libraries (e.g., MPI, OpenMP, CUDA).
  • Scheduler Management : Manage and configure the HPC job scheduling system (e.g., Slurm, PBS) to ensure fair resource allocation, manage queues, and optimize cluster efficiency.
  • Application Troubleshooting : Assist researchers in debugging and optimizing their parallel codes and software workflows.
  • 3. Monitoring & Performance Tuning

  • System Monitoring : Implement and maintain robust monitoring tools (e.g., Ganglia, Nagios, Prometheus / Grafana) to track cluster health, resource utilization, and job performance.
  • Problem Resolution : Proactively identify, troubleshoot, and resolve system bottlenecks, hardware failures, and software issues to minimize downtime.
  • Performance Analysis : Analyze system logs and performance metrics to recommend and implement optimisations for the cluster and storage systems.
  • Technical Support : Serve as a primary point of contact for researchers and students, providing expert technical support for job submission, data management, and software issues.
  • Account Management : Manage user accounts, project allocations, and resource quotas.
  • Training & Documentation : Develop and deliver training workshops, user guides, and technical documentation to help users effectively utilize the HPC resources.
  • Liaison : Collaborate with researchers to understand their computational needs and provide guidance on HPC best practices.
  • Qualifications

  • Bachelor's degree in computer science, preferably in networking or computer systems
  • Experience as a System Administrator
  • Interested to learn about HPC
  • Strong analytical skills
  • Local candidate only (Malaysian)
  • Seniority level

  • Entry level
  • Employment type

  • Full-time
  • Industries

  • Education Management
  • #J-18808-Ljbffr

    Create a job alert for this search

    System Administrator • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    System Administrator - MT4 / MT5

    System Administrator - MT4 / MT5

    Hytech • Kuala Lumpur, Kuala Lumpur, Malaysia
    Hytech Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Hytech is a leading company specializing in cutting-edge financial technology solutions. Our innovative platforms and applications em...Show more
    Last updated: 30+ days ago • Promoted
    System Administrator

    System Administrator

    Ryt Bank • Kuala Lumpur, Kuala Lumpur, Malaysia
    Passionate about building an innovative workforce for The Right Bank.Ryt Bank is seeking a skilled and proactive System Administrator to manage and support our IT infrastructure and end‑user comput...Show more
    Last updated: 19 days ago • Promoted
    Linux System Administrator [Hybrid]

    Linux System Administrator [Hybrid]

    TDCX • Kuala Lumpur, Kuala Lumpur, Malaysia
    TDCX Federal Territory of Kuala Lumpur, Malaysia.Get AI-powered advice on this job and more exclusive features.Do you aspire to have a rewarding career where you can thrive, grow, and achieve your ...Show more
    Last updated: 26 days ago • Promoted
    System Administrator, Engineering

    System Administrator, Engineering

    Prometric Ireland Limited • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Posted Wednesday, July 23, 2025 at 4 : 00 PM | Expires Friday, August 29, 2025 at 3 : 59 PM. About Us : Prometric is a leading provider of technol...Show more
    Last updated: 30+ days ago • Promoted
    System Administrator

    System Administrator

    RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS) • Kuala Lumpur, Kuala Lumpur, Malaysia
    Direct message the job poster from RiDiK (a Subsidiary of CLPS.RiDiK is a global technology solutions provider and a subsidiary of CLPS Incorporation (NASDAQ : CLPS), delivering cutting‑edge end‑to‑...Show more
    Last updated: 15 days ago • Promoted
    DevOps and System Administrator

    DevOps and System Administrator

    Hiredly X • Subang Jaya, Selangor, Malaysia
    Our client is a leading technology solutions provider specializing in enterprise software development, data systems, and IT infrastructure services. The company partners with clients across diverse ...Show more
    Last updated: 8 days ago • Promoted
    Microsoft System Administrator

    Microsoft System Administrator

    Tata Consultancy Services • Kuala Lumpur, Kuala Lumpur, Malaysia
    Direct message the job poster from Tata Consultancy Services.A purpose-led organization that is building a meaningful future through innovation, technology, and collective knowledge.Tata Consultanc...Show more
    Last updated: 24 days ago • Promoted
    Microsoft System Administrator

    Microsoft System Administrator

    Ambition • Kuala Lumpur, Kuala Lumpur, Malaysia
    Senior Consultant - Technology | Banking & Financial Services | IT Infrastructure.Ambition Federal Territory of Kuala Lumpur, Malaysia. Administering M365 collaboration tools and Exchange (on-prem, ...Show more
    Last updated: 17 days ago • Promoted
    Utilities Engineer

    Utilities Engineer

    Lonza • Batang Berjuntai, Selangor, Malaysia
    Today, Lonza is a global leader in life sciences operating across five continents.While we work in science, there’s no magic formula to how we do it. Our greatest scientific solution is dedicated in...Show more
    Last updated: 20 days ago • Promoted
    Cloudera System Administrator

    Cloudera System Administrator

    RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS) • Kuala Lumpur, Kuala Lumpur, Malaysia
    Primary Skills : Cloudera, CDP, Hadoop ecosystem, Kubernetes, Big Data technologies, CCA Administrator, CCA Spark and Hadoop Developer. We are looking for an experienced Cloudera System Administrator...Show more
    Last updated: 6 days ago • Promoted
    IT System Administrator

    IT System Administrator

    CGI • Kuala Lumpur, Malaysia
    CGI Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job Join to apply for the. The IT System Administrator supports IT infrastructure, ensuring systems ru...Show more
    Last updated: 30+ days ago • Promoted
    IAM System Engineer

    IAM System Engineer

    Prometric Ireland Limited • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Posted Sunday 24 August 2025 at 16 : 00 | Expires Sunday 31 August 2025 at 15 : 59. The IAM System Engineer plays a critical role in safeguardin...Show more
    Last updated: 30+ days ago • Promoted
    Head of System & Operation (GPU System)

    Head of System & Operation (GPU System)

    EPS Consultants • Kuala Lumpur, Kuala Lumpur, Malaysia
    Oversee the design, implementation, and maintenance of IT systems that support operational activities, ensuring high availability and performance of GPU resources. Provide technical guidance across ...Show more
    Last updated: 30+ days ago • Promoted
    System Administrator Specialist (POS System)

    System Administrator Specialist (POS System)

    ZUS COFFEE • Subang Jaya, Selangor, Malaysia
    System Maintenance : Regularly perform system maintenance tasks for ERP and POS systems to ensure they operate efficiently without interruption. Upgrades and Enhancements : Plan and implement system u...Show more
    Last updated: 30+ days ago • Promoted
    Fintech System Administrator

    Fintech System Administrator

    Hytech • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Hytech is a leading player in the fintech industry, committed to providing cutting-edge trading solutions and services to our clients.Our R...Show more
    Last updated: 30+ days ago • Promoted
    System Lead Administrator

    System Lead Administrator

    Accenture Southeast Asia • Kuala Lumpur, Kuala Lumpur, Malaysia
    The Senior System Administrator (SA) is responsible for effective provisioning, installation / configuration, operation, and maintenance of systems hardware and software and related infrastructure.Pa...Show more
    Last updated: 30+ days ago • Promoted
    Senior System Administrator

    Senior System Administrator

    Gamuda Group • Petaling Jaya, Selangor, Malaysia
    Press Tab to Move to Skip to Content Link.Select how often (in days) to receive an alert : .We are seeking a skilled and proactive L3 System Administrator to join our support team.This role provides ...Show more
    Last updated: 30+ days ago • Promoted
    Junior IBM Power Systems Administrator

    Junior IBM Power Systems Administrator

    Clarks group • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Posted Sunday, June 22, 2025 at 4 : 00 PM.The Infrastructure Platforms team manages all server-related infrastructure and hardware within the...Show more
    Last updated: 30+ days ago • Promoted