Talent.com
Platform Reliability Engineer

Platform Reliability Engineer

POWER IT SERVICESKuala Lumpur, Kuala Lumpur, Malaysia
1 day ago
Job description

Job Purpose

Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining internal container platform and its supporting infrastructure, with a strong focus on reliability, resiliency, and security. As a Senior PRE within the Infrastructure team you will play a pivotal role in designing, building, and operating distributed container hosting solutions.

The Job

  • As a Senior Platform Reliability Engineer, you will play a key role in maintaining the stability, reliability, and efficiency of the internal container platform and its supporting infrastructure.
  • Your responsibilities will include core operational tasks such as resource provisioning and management, responding to platform and application outages, capacity planning, monitoring, and driving reliability enhancements.
  • You will continuously evaluate the platform’s technical architecture to ensure it scales effectively with evolving application demands.
  • This includes proactively identifying and resolving reliability issues, analyzing product dependencies, pinpointing performance bottlenecks, and implementing optimization strategies to enhance platform availability and cost efficiency.
  • In this role you will participate in a 24 / 7 on‑call rotation, promptly addressing alerts from the global monitoring team and resolving production incidents to maintain platform and application uptime.
  • You will regularly review team workflows to identify manual processes and implement automation solutions that reduce effort and minimize human error.
  • Regularly review the security advisory issued by Broadcom related to the Tanzu suite of products and deploy product updates as required to keep the platform vulnerable‑free.
  • Work with open‑source technologies, CI / CD, and SCM tools such as Bitbucket, implementing organization containers (e.g., Docker and Kubernetes). Stay current with industry trends and propose new ways for our business to improve.
  • Take accountability for business and regulatory compliance risks and take appropriate steps to mitigate them.
  • Maintain awareness of industry trends on regulatory compliance, emerging threats, and technologies to safeguard the company.
  • Highlight any potential concerns or risks and proactively share best practices.

Seniority Level

Mid‑Senior level

Employment Type

Full‑time

Job Function

Information Technology

Industries

IT Services and IT Consulting

#J-18808-Ljbffr

Create a job alert for this search

Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

Related jobs
  • Promoted
Site Reliability Engineer III

Site Reliability Engineer III

Guidewire SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that’s a time of crisis, a n...Show moreLast updated: 30+ days ago
  • Promoted
System Reliability Engineer, Principal

System Reliability Engineer, Principal

AIA MalaysiaKuala Lumpur, Kuala Lumpur, Malaysia
AIA Malaysia Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.System Reliability Engineer, Principal.AIA Malaysia Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Direct message t...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

HCL Singapore Pte LtdCyberjaya, Selangor, Malaysia
Administer and support VMware environments including VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA / vRO, and Tanzu.Design, implement, and maintain automation scripts and tools to improve system reliabilit...Show moreLast updated: 4 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
Site Reliability Engineer page is loaded## Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provider of secure f...Show moreLast updated: 2 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

SWIFTKuala Lumpur, Kuala Lumpur, Malaysia
We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Show moreLast updated: 1 day ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

CanonicalSelayang Municipal Council, Selayang Municipal Council, Malaysia
Site Reliability Engineer role at Canonical.We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To succeed in this role, you need to ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Russell TobinKuala Lumpur, Kuala Lumpur, Malaysia
Job Opportunity : Site Reliability Engineer (SRE) in Cyberjaya.Note : Only Malaysian locals or PR holders can apply.We are looking for a Site Reliability Engineer (SRE) to join our forward-thinking C...Show moreLast updated: 22 days ago
  • Promoted
Specialist, Site Reliability Engineer (SRE)

Specialist, Site Reliability Engineer (SRE)

TNG DigitalKuala Lumpur, Kuala Lumpur, Malaysia
Specialist, Site Reliability Engineer (SRE).We are hiring for a Specialist, Site Reliability Engineer (SRE) to join our team. Role focuses on network administration, cloud infrastructure management,...Show moreLast updated: 18 days ago
  • Promoted
System Reliability Engineer

System Reliability Engineer

BusinesslistKuala Lumpur, Kuala Lumpur, Malaysia
Monitor and maintain system performance, ensuring uptime and reliability across all infrastructure.Develop and implement automation tools to improve system efficiency and reduce manual intervention...Show moreLast updated: 14 days ago
  • Promoted
System Reliability Engineer

System Reliability Engineer

Michael PageKuala Lumpur, Kuala Lumpur, Malaysia
Opportunity to work on cutting-edge projects.A dynamic fintech organization known for its commitment to simplifying online payment solutions through secure, user-friendly technology.The company fos...Show moreLast updated: 6 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

RazerKuala Lumpur, Kuala Lumpur, Malaysia
Joining Razer will place you on a global mission to revolutionize the way the world games.LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.We ar...Show moreLast updated: 30+ days ago
  • Promoted
Azure Site Reliability Engineer

Azure Site Reliability Engineer

Experian Asia PacificCyberjaya, Selangor, Malaysia
We are seeking a skilled Azure Cloud DevOps Engineer to join our team.The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Az...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tata Consultancy ServicesKuala Lumpur, Kuala Lumpur, Malaysia
Talent Acquisition | Human Resource Executive | Tata Consultancy Service.Join Tata Consultancy Services, Asia Pacific and be part of an organization committed to sustainable development for our fut...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

FPT SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
Design and maintain scalable failover systems, backup strategies, and redundancy mechanisms across cloud and on-prem environments. Develop and update DR documentation, runbooks, and recovery playboo...Show moreLast updated: 4 days ago
  • Promoted
Site Reliability Engineer - Kuala Lumpur, Malaysia

Site Reliability Engineer - Kuala Lumpur, Malaysia

Kneat SolutionsKuala Lumpur, Kuala Lumpur, Malaysia
Site Reliability Engineer – Kuala Lumpur, Malaysia.Kneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions.And we do it through the ...Show moreLast updated: 2 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Jobstreet MalaysiaKuala Lumpur, Kuala Lumpur, Malaysia
Infrastructure & Server Design.Plan, configure, and optimize on-premise and cloud servers, including DNS, IP routing, and load balancing. .Manage network devices, bandwidth, and traffic; implement Q...Show moreLast updated: 6 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

HCLTechSepang, Selangor, Malaysia
Administer and support VMware environments including VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA / vRO, and Tanzu.Design, implement, and maintain automation scripts and tools to improve system reliabilit...Show moreLast updated: 4 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Razer Inc.Kuala Lumpur, Kuala Lumpur, Malaysia
Bangsar South, Federal Territory of Kuala Lumpur, Malaysia.Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you ...Show moreLast updated: 4 days ago