Talent.com
Platform Reliability Engineer
Platform Reliability EngineerAbhidi Solution • Kuala Lumpur, Kuala Lumpur, Malaysia
Platform Reliability Engineer

Platform Reliability Engineer

Abhidi Solution • Kuala Lumpur, Kuala Lumpur, Malaysia
6 hours ago
Job description

Direct message the job poster from Abhidi Solution

Assistant Manager - Talent Acquisition @ Abhidi Solution | MBA | IT and Non-IT Recruitment | APAC

Job Purpose

Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining GEL’s internal container platform and its supporting infrastructure, with a strong focus on reliability, resiliency, and security. As a Senior PRE within GEL’s Infrastructure team, you will play a pivotal role in designing, building, and operating distributed container hosting solutions using Broadcom’s Tanzu product.

Responsibilities

  • As a Senior Platform Reliability Engineer, you will play a key role in maintaining the stability, reliability, and efficiency of GEL’s internal container platform and its supporting infrastructure. Your responsibilities will include core operational tasks such as resource provisioning and management, responding to platform and application outages, capacity planning, monitoring, and driving reliability enhancements.
  • You will continuously evaluate platform’s technical architecture to ensure it scales effectively with evolving application demands.
  • This includes proactively identifying and resolving reliability issues, analyzing product dependencies, pinpointing performance bottlenecks, and implementing optimization strategies to enhance platform availability and cost efficiency.
  • In this role, you will participate in a 24 / 7 on-call rotation, promptly addressing alerts from the global monitoring team and resolving production incidents to maintain platform and application uptime. Additionally, you will regularly review team workflows to identify manual processes and implement automation solutions that reduce effort and minimize human error.
  • Regularly review the security advisory issued by Broadcom related to Tanzu suite of products and deploy product updates as required to keep platform vulnerable free.
  • Work with open-source technologies, CI / CD, SCM tools as necessary, and source control such as Bitbucket, implement organization containers (eg, Docker and Kubernetes). Stay current with industry trends and propose new ways for our business to improve.
  • Takes accountability in considering business and regulatory compliance risks and takes appropriate steps to mitigate the risks.
  • Maintains awareness of industry trends on regulatory compliance, emerging threats and technologies in order to understand the risk and better safeguard the company.
  • Highlights any potential concerns / risks and proactively shares best risk management practices.

Requirements

  • Working experience as a Platform Reliability Engineer or strong working experience as a Site Reliability Engineer in a cloud operating environment. Candidates with excellent DevOps experience will be considered.
  • Good working knowledge of DevOps pipeline and automation tools (E.g. Selenium, SOAPUI, Bamboo, Jenkins, Ansible, Maven, Github, Bitbucket, Nexus, Jira, Confluence etc).
  • Strong technical and business acumen with the ability to lead a small technical team.
  • Experience with infrastructure-as-code, server templating, orchestration, configuration management and provisioning tools is advantageous e.g. Terraform, Chef, Docker, Packer, Kubernetes.
  • Must code, debug and optimize code and automate repetitive tasks.
  • Systematic problem-solving approach, coupled with effective communication skills and a sense of ownership and drive.
  • Experienced in one or more of the following : C, C++, Java, Python, Go, Perl or Ruby.
  • Strong experience in a Continuous Integration / Continuous Delivery (CI / CD) environment with strong appreciation of change / version control process and methodologies.
  • Strong experience in dealing with platform upgrades, patching and buildpack management.
  • Strong experience in troubleshooting network related issues.
  • Good working knowledge of NSX‑T solution and its integration with various Tanzu suite of products.
  • Candidate should be open to take up on call support on rotation basis.
  • Candidate should be willing to work in shifts.
  • Seniority level

    Mid-Senior level

    Employment type

    Full-time

    Job function

    Information Technology

    Industries

    IT Services and IT Consulting

    Referrals increase your chances of interviewing at Abhidi Solution by 2x

    Get notified about new Platform Engineer jobs in Greater Kuala Lumpur.

    #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    Site Reliability Engineer | PJ

    Site Reliability Engineer | PJ

    Hunters International Sdn Bhd • Petaling Jaya, Selangor, Malaysia
    About the job Site Reliability Engineer | PJ.Primarily responsible for day to day support of Ecommerce Platform Application, FX clients and Stock Brokers using FX. Perform monitoring using available...Show more
    Last updated: 3 days ago • Promoted
    Senior Digital Platform Reliability Lead

    Senior Digital Platform Reliability Lead

    CelcomDigi • SelangorMalaysia, Selangor, Malaysia
    A leading digital services provider in Malaysia is seeking a Senior Digital Platform Ops Specialist to enhance the stability and reliability of digital platforms. Responsibilities include applicatio...Show more
    Last updated: 1 day ago • Promoted
    Cloud Operations Engineer (Platform Reliability / NOC)

    Cloud Operations Engineer (Platform Reliability / NOC)

    Agensi Pekerjaan Genie Hunt Talent • Petaling Jaya, Selangor, Malaysia
    Quick Apply
    You'll play a key role in keeping our client-facing applications, APIs and cloud infrastructure running smoothly ensuring uptime, performance and reliability across multiple environments.This role ...Show more
    Last updated: 25 days ago
    System Reliability Engineer, Consultant

    System Reliability Engineer, Consultant

    AIA Malaysia • Kuala Lumpur, Kuala Lumpur, Malaysia
    AIA Malaysia Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.System Reliability Engineer, Consultant.At AIA we’ve started an exciting movement to create a healthier, more sustainable futu...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineers (Middle & Senior)

    Site Reliability Engineers (Middle & Senior)

    PEOPLE PROFILERS • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Site Reliability Engineers (Middle & Senior).People Profilers is hiring on behalf of. Site Reliability Engineers (Consultant & Senior Consul...Show more
    Last updated: 30+ days ago • Promoted
    Equipment Reliability Engineer — Hybrid / Remote

    Equipment Reliability Engineer — Hybrid / Remote

    Renesas Electronics • Shah Alam, Shah Alam, Malaysia
    A leading semiconductor solution provider in Malaysia is seeking an enthusiastic individual for equipment maintenance role. Responsibilities include resolving operational problems, managing maintena...Show more
    Last updated: 2 days ago • Promoted
    DevOps / Site Reliability Engineer (Malaysia)

    DevOps / Site Reliability Engineer (Malaysia)

    InsiderSecurity • Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    Build automation for DevOps and be its advocate in the product teams.Build automation for high availability and robustness of our infrastructure. Monitor our infrastructure health to ensure high ava...Show more
    Last updated: 30+ days ago
    Senior Platform Reliability Engineer - Container & Cloud

    Senior Platform Reliability Engineer - Container & Cloud

    Abhidi Solution • Kuala Lumpur, Kuala Lumpur, Malaysia
    A leading IT consulting firm is seeking an experienced Assistant Manager - Talent Acquisition to oversee platform reliability and lead a technical team. The role requires strong experience in DevOps...Show more
    Last updated: 1 day ago • Promoted
    System Reliability Engineer, Consultant

    System Reliability Engineer, Consultant

    AIA Hong Kong and Macau • Kuala Lumpur, Kuala Lumpur, Malaysia
    At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.As pioneering innovators for over 100 years, we’re now transforming our organisation to be fast...Show more
    Last updated: 5 days ago • Promoted
    Senior Site Reliability Engineer (Windows)

    Senior Site Reliability Engineer (Windows)

    Hytech Consulting Management Sdn Bhd • Kuala Lumpur, Kuala Lumpur, Malaysia
    Senior Site Reliability Engineer (Windows).Ensure high availability and performance of Windows infrastructure, applications, and services. Windows AD Policies, optimization & hardening in Windows en...Show more
    Last updated: 5 days ago • Promoted
    Site Reliability Engineer (Windows)

    Site Reliability Engineer (Windows)

    Hytech • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (Windows).Hytech Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Ensure high availability and performance of Windows infrastructure, applications, and services.D...Show more
    Last updated: 2 days ago • Promoted
    Platform Reliability & Cloud Operations Lead

    Platform Reliability & Cloud Operations Lead

    CelcomDigi • Petaling Jaya, Selangor, Malaysia
    A leading telecommunications company in Malaysia is seeking a Senior Digital Platform Ops Specialist to manage and optimize digital platforms supporting consumer and enterprise services.The ideal c...Show more
    Last updated: 4 days ago • Promoted
    Platform Engineer (Azure)

    Platform Engineer (Azure)

    Nintex • Kuala Lumpur, Kuala Lumpur, Malaysia
    At Nintex, we are transforming the way people work, everywhere.As the global standard for process intelligence and automation, we're trusted by over 10,000 public and private sector organizations a...Show more
    Last updated: 1 day ago • Promoted
    Senior Site Reliability Engineer (L3)

    Senior Site Reliability Engineer (L3)

    Coingecko • Kuala Lumpur, Kuala Lumpur, Malaysia
    CoinGecko is a global leader in tracking cryptocurrency data.Operating since 2014, CoinGecko has built the world's largest cryptocurrency data platform, tracking over 10,000 tokens across more than...Show more
    Last updated: 5 days ago • Promoted
    System Reliability Engineer, Consultant

    System Reliability Engineer, Consultant

    AIA Hong Kong • Kuala Lumpur, Kuala Lumpur, Malaysia
    At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.As pioneering innovators for over 100 years, we’re now transforming our organisation to be fast...Show more
    Last updated: 5 days ago • Promoted
    Platform Engineer

    Platform Engineer

    Hyred • Kuala Lumpur, Malaysia
    Quick Apply
    Our client specialises in building Agentic AI systems.Our client is looking to onboard a Platform Engineer to serve as one of the principal architects of SupplyOS, their agentic AI platform that po...Show more
    Last updated: 1 day ago
    Senior Site Reliability Engineer - Global Consulting, KL

    Senior Site Reliability Engineer - Global Consulting, KL

    PEOPLE PROFILERS • Kuala Lumpur, Kuala Lumpur, Malaysia
    A leading global consulting firm in Kuala Lumpur seeks Site Reliability Engineers at Consultant and Senior Consultant levels. The role includes leveraging DevOps practices, analytical skills, and ef...Show more
    Last updated: 3 days ago • Promoted
    Platform Engineer (AWS)

    Platform Engineer (AWS)

    Promapp • Kuala Lumpur, Kuala Lumpur, Malaysia
    At Nintex, we are transforming the way people work, everywhere.As the global standard for process intelligence and automation, we’re trusted by over 10,000 public and private sector organizations a...Show more
    Last updated: 3 days ago • Promoted