Talent.com
HPC System Administrator
HPC System AdministratorUniversity of Malaya • Kuala Lumpur, Kuala Lumpur, Malaysia
HPC System Administrator

HPC System Administrator

University of Malaya • Kuala Lumpur, Kuala Lumpur, Malaysia
3 hari lalu
Penerangan pekerjaan

Direct message the job poster from University of Malaya

The ideal candidate will design, organize, and modify the company's computer systems. This individual will evaluate and assess systems to ensure they are operating effectively. Based on assessments, this individual will harness collected knowledge and make adjustments to existing systems.

Here is the detailed breakdown of responsibilities for the role :

1. System & Infrastructure Management

  • Cluster Operations : Install, configure, and maintain Linux-based HPC clusters, compute nodes, and associated server hardware.
  • Storage Management : Manage and optimize large-scale parallel file systems (e.g., Lustre, GPFS) and high-performance storage solutions.
  • Network Administration : Configure, manage, and monitor the high-speed HPC network infrastructure, including InfiniBand and Ethernet fabrics, to ensure optimal performance.
  • Security & Patching : Implement system security policies, perform regular security hardening, and apply OS patches and updates to ensure system integrity.
  • Backup & Recovery : Oversee and execute data backup and disaster recovery procedures for critical systems and user data.

2. Application & Software Support

  • Software Deployment : Install, compile, and manage a wide range of scientific applications, compilers (e.g., GNU, Intel), and parallel libraries (e.g., MPI, OpenMP, CUDA).
  • Scheduler Management : Manage and configure the HPC job scheduling system (e.g., Slurm, PBS) to ensure fair resource allocation, manage queues, and optimize cluster efficiency.
  • Application Troubleshooting : Assist researchers in debugging and optimizing their parallel codes and software workflows.
  • 3. Monitoring & Performance Tuning

  • System Monitoring : Implement and maintain robust monitoring tools (e.g., Ganglia, Nagios, Prometheus / Grafana) to track cluster health, resource utilization, and job performance.
  • Problem Resolution : Proactively identify, troubleshoot, and resolve system bottlenecks, hardware failures, and software issues to minimize downtime.
  • Performance Analysis : Analyze system logs and performance metrics to recommend and implement optimisations for the cluster and storage systems.
  • Technical Support : Serve as a primary point of contact for researchers and students, providing expert technical support for job submission, data management, and software issues.
  • Account Management : Manage user accounts, project allocations, and resource quotas.
  • Training & Documentation : Develop and deliver training workshops, user guides, and technical documentation to help users effectively utilize the HPC resources.
  • Liaison : Collaborate with researchers to understand their computational needs and provide guidance on HPC best practices.
  • Qualifications

  • Bachelor's degree in computer science, preferably in networking or computer systems
  • Experience as a System Administrator
  • Interested to learn about HPC
  • Strong analytical skills
  • Local candidate only (Malaysian)
  • Seniority level

  • Entry level
  • Employment type

  • Full-time
  • Industries

  • Education Management
  • #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    System Administrator • Kuala Lumpur, Kuala Lumpur, Malaysia

    Pekerjaan berkaitan
    HPC / AI Solution Architect

    HPC / AI Solution Architect

    Hewlett Packard Enterprise Development LP • Kuala Lumpur, Malaysia
    HPC / AI Presales tasks as a team member of the APAC / India HPC / AI Presales team • Solution architecting, system configuration, technical consulting, presentation delivery, and sales support for genera...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    System Analyst Lead, Cards

    System Analyst Lead, Cards

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    Lead, build and manage a team of IT System analysts supporting cards platform and strategic projects across the franchise. Build, manage and implement system capabilities based on understanding of t...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    VP2 System Analyst (SAP ECC6), Far Tech

    VP2 System Analyst (SAP ECC6), Far Tech

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    Lead and facilitate IT projects for the bank and APAC branches.Gather and analyze business requirements, define project scope, and build consensus with users. Develop and manage project plans, sched...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Hiring L2 Support Expert

    Hiring L2 Support Expert

    Two95 International Inc. • Bangsar South, Kuala Lumpur, MY
    Quick Apply
    L2 Support Expert and L1 Support.Application Support | Trouble Shooting | L2 Technical Support | No-SQL | Linux / UNIX | HTTP | JSON / XML | Java Script | Bash. L2 Support Expert is software focused, te...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    VP1 Senior Application Maintenance Support

    VP1 Senior Application Maintenance Support

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    We Are Hiring – VP1 Senior Application Maintenance Support (Project) | Kuala Lumpur, Malaysia.We are looking for an experienced Senior Application Maintenance Support professional (VP1 level) to jo...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Sr. Systems Engineer

    Sr. Systems Engineer

    Two95 International Inc. • Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    To ensure successful implementation of projects within schedule.To ensure SLAs are met and achieved the highest customer satisfaction. Oversee the design, development and implementation of clients s...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    Technical Support Administrator

    Technical Support Administrator

    RIMINI_MALAYSIA RIMINI STREET MALAYSIA Sdn Bhd • Nilai, Negeri Sembilan, Malaysia
    Technical Support Administrator page is loaded.Technical Support Administrator.Apply locations Remote Malaysia time type Full time posted on Posted 12 Days Ago job requisition id R-.Nasdaq : RMNI), ...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    23. 116588 Lead Application / System Sales Engineer

    23. 116588 Lead Application / System Sales Engineer

    half the sky • Kuala Lumpur, Malaysia
    Start your career by making an impact and real connections with some of the most meaningful challenges around.When you join Honeywell, you become a member of our performance culture comprised of di...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu
    System Analyst

    System Analyst

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    You will be involved in Compliance domain and be responsible for understanding business requirements and translating them to functional specifications and technical design specifications.You will b...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    System Analyst (Wealth), Core Banking for a leading bank in Malaysia.

    System Analyst (Wealth), Core Banking for a leading bank in Malaysia.

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    You will be responsible for the end-to-end software development and support for all work transitioned from Group (which could be projects, quarterly change requests, L3 production fixes).This inclu...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    IT ADMINISTRATOR

    IT ADMINISTRATOR

    Iswanah Resources Sdn Bhd • Batu Caves, Selangor, Malaysia
    Diploma in Computer Science, Information Technology or a related field.IT Admininstrator, Network Administrator or similar role. Strong Understanding of TCP / IP, DNS, DHCP, VPN, Firewalls, and routin...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    System Analyst, BWise

    System Analyst, BWise

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    As the System Analyst, you will be reporting to the Delivery Lead and be in the Company Group IT Governance, Risk and Compliance (GRC) Team, part of Group Risk Management.Being a Technical Analyst ...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    HPC / AI Solution Architect

    HPC / AI Solution Architect

    Hewlett Packard Enterprise • Kajang Municipal Council, Selangor, Malaysia
    Remote / teleworker role; you will primarily work from home.Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We focus on connecting, protecting, a...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    System Analyst (Wealth), Core Banking for a leading bank inMalaysia.

    System Analyst (Wealth), Core Banking for a leading bank inMalaysia.

    Sperton Global AS • Subang Jaya, Selangor, Malaysia
    You will be responsible for the end-to-end software development and support for all work transitioned from Group (which could be projects, quarterly change requests, L3 production fixes).This inclu...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    IT Support & Systems Administrator

    IT Support & Systems Administrator

    TechBiz Global GmbH • Kuala Lumpur, 14, MY
    At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio.IT Support & Systems Administrator. If you're looking for an exciting opportunity to grow in a innov...Tunjukkan lagi
    Kemas kini terakhir: 17 hari yang lalu
    Senior Linux Administrator &Application Middleware Infrastructure SME, GIPS for Well Known Bank at Kuala Lumpur, Malaysia

    Senior Linux Administrator &Application Middleware Infrastructure SME, GIPS for Well Known Bank at Kuala Lumpur, Malaysia

    Sperton Global AS • Subang Jaya, Selangor, Malaysia
    Analyze, design, plan, co-ordinate, create infrastructure required to run the application environments.Install & deploy the application software and provide BAU support that includes midd...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat
    Airbus - Protege RTW - Business Support Operations

    Airbus - Protege RTW - Business Support Operations

    Airbus Customer Services Sdn Bhd • Sepang, Malaysia
    Job Description : • • Opening Application for PROTEGE Program within Airbus in Malaysia.This program is for Malaysian fresh graduates only, as mandated by Malaysian Government.Kindly note this appli...Tunjukkan lagi
    Kemas kini terakhir: 4 jam yang lalu • Dinaikkan pangkat • Baharu!
    VP1 System Analyst (CardLink), Cards

    VP1 System Analyst (CardLink), Cards

    Sperton Global AS • Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    Gather and document business requirements, translating them into detailed user stories and functional specifications.Conduct cross-functional analysis and collaborate with technology teams for solu...Tunjukkan lagi
    Kemas kini terakhir: 30+ hari yang lalu • Dinaikkan pangkat