Talent.com
Site Reliability Engineer

Site Reliability Engineer

Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
30+ days ago
Job type
  • Quick Apply
Job description

As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE practices such as Service Level Objectives (SLOs), Service Level Indicators (SLIs), and the reduction of operational toil. You will collaborate closely with diverse teams to drive reliability improvements and foster a culture of continuous learning and accountability.

Requirements

Key Responsibilities :

Design and implement resilient system architectures that support high availability and scalability.

Develop automation tools and scripts to enhance operational efficiency and reduce manual effort.

Define, track, and analyze SLOs and SLIs to ensure reliability and performance meet business needs.

Conduct thorough post-mortem analyses following incidents, driving continuous improvement through root cause identification and solution implementation.

Collaborate with development and operations teams to establish best practices in system reliability and incident management.

Troubleshoot and resolve issues related to database performance, network connectivity, and deployment failures, including diagnosing problems at the underlying platform level (e.g., Kubernetes, virtual machines).

Ensure that issues are resolved within the stipulated Service Level Agreements (SLAs), maintaining high standards of service delivery.

Identify and troubleshoot performance bottlenecks across systems, providing actionable recommendations for enhancements.

Maintain detailed documentation of processes and incident responses to support knowledge sharing and compliance.

Qualifications :

Proficiency in programming languages such as Python, Golang, Java, or similar, focusing on operational efficiency.

Demonstrated experience in system architecture and design, prioritizing reliability, and scalability.

Strong understanding of SRE principles, including SLOs, SLIs, toil reduction, and incident post-mortems.

Experience with cloud environments (e.g., AWS, Azure, Google Cloud) and their operational management.

Strong expertise in Linux system administration.

Proven experience in troubleshooting application support issues with a focus on performance and connectivity.

Familiarity with networking concepts and effective troubleshooting techniques.

Excellent problem-solving abilities and a proactive approach to operational challenges.

Ability to work independently while effectively collaborating within a team environment.

Preferred Skills :

Familiarity with monitoring tools and performance optimization techniques.

Experience in scripting or automation for system administration tasks.

Knowledge of networking concepts and troubleshooting methodologies.

Hands-on knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud) and their services.

Familiarity with DevOps practices and frameworks, including CI / CD, infrastructure as code, and containerization.

Create a job alert for this search

Site Engineer • Kuala Lumpur, Federal Territory of Kuala Lumpur, MY

Related jobs
  • Promoted
Platform / Site Reliability Engineer

Platform / Site Reliability Engineer

POWER IT SERVICESKuala Lumpur, Kuala Lumpur, Malaysia
Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining internal container platform and its supporting infrastructure, with a strong focus on reliability, res...Show moreLast updated: 1 day ago
  • Promoted
R&D Senior Engineer - Control Software Design / Embedded

R&D Senior Engineer - Control Software Design / Embedded

Daikin Malaysia Sdn BhdSungai Buloh, Selangor, Malaysia
Power the future of HVAC & IoT.We’re looking for a Senior Engineer, Control Software to join our dynamic R&D team.If you love turning complex requirements into clean, reliable code and want to work...Show moreLast updated: 30+ days ago
  • Promoted
Senior Sales Engineer

Senior Sales Engineer

SophosNilai, Negeri Sembilan, Malaysia
Sophos is a global leader and innovator of advanced security solutions designed to defeat cyberattacks.The company acquired Secureworks in February 2025, creating the largest pure‑play Managed Dete...Show moreLast updated: 1 day ago
  • Promoted
Senior Software Engineer – Full Time - Remote

Senior Software Engineer – Full Time - Remote

The FlexKuala Selangor, Kuala Selangor, Malaysia
Senior Software Engineer – Full Time - Remote.Location : The Flex Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia. Join the team reinventing how the world rents.At The Flex, we believe rent...Show moreLast updated: 7 days ago
  • Promoted
Senior Engineer, Supplier Quality (Based in Malaysia, 1 year WFH contract)

Senior Engineer, Supplier Quality (Based in Malaysia, 1 year WFH contract)

PCIPort Klang, Port Klang, Malaysia
Senior Engineer, Supplier Quality (Based in Malaysia, 1 year WFH contract).PCI Private Limited is looking for an experienced Senior Engineer, Supplier Quality to join our Quality Assurance departme...Show moreLast updated: 8 days ago
Site Engineer

Site Engineer

SolarvestPetaling Jaya, Selangor, MY
Quick Apply
The Site Engineer is the technical lead at project sites, responsible for conducting energy audits, managing technical data collection, and supporting the execution of energy efficiency (EE) soluti...Show moreLast updated: 29 days ago
  • Promoted
PROTEGE - Facilities / Site Engineer

PROTEGE - Facilities / Site Engineer

Airbus Helicopters Malaysia SDN. BHD.Subang, Malaysia
Job Description : • • Airbus is seeking motivated, fresh graduate talent to join a specific department for a •10-month Protégé Program •. This program is for •Malaysian fresh graduates only •, as manda...Show moreLast updated: 1 day ago
  • Promoted
BIM Modeller

BIM Modeller

Redstack (M) Sdn BhdSetia Alam, Selangor, Malaysia
Diploma / Degree in Mechanical, Electrical, or Building Services Engineering, Architecture, or related field.BIM modelling (fresh graduates with strong Revit / AutoCAD MEP skills may be considered).Pro...Show moreLast updated: 19 days ago
  • Promoted
Platform Reliability Engineer (Tanzu)

Platform Reliability Engineer (Tanzu)

HCL Technologies Malaysia SDN BHDKuala Lumpur, Kuala Lumpur, Malaysia
Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining GEL’s internal container platform and its supporting infrastructure, with a strong focus on reliabilit...Show moreLast updated: 1 day ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

Second TalentSepang, Selangor, Malaysia
Member of Technical Staff - Environments (ML).As an Environment Engineer (ML), you will build on top of our core platform to create the simulation environments in which frontier coding agents learn...Show moreLast updated: 1 day ago
  • Promoted
QA Engineer / Senior Engineer / Asst Manager QA

QA Engineer / Senior Engineer / Asst Manager QA

Wistron Technology (Malaysia) Sdn BhdKlang City, Selangor, Malaysia
Add expected salary to your profile for insights.Candidate must willing to work in Port Klang Selangor.Compilation of quality control information and drives for quality improvement.Prepare and perf...Show moreLast updated: 1 day ago
  • Promoted
Senior Web Developer (Remote)

Senior Web Developer (Remote)

RemotelySelayang Municipal Council, Selayang Municipal Council, Malaysia
We are currently searching for a Senior Web Engineer to join us and work as part of an enthusiastic, motivated, and delivery focused agile team. You will have the opportunity to work on all aspects ...Show moreLast updated: 1 day ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

SwiftKuala Lumpur, Kuala Lumpur, Malaysia
Site Reliability Engineer (DevOps) – Swift.Location : Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Senior Level : Mid‑senior level. Job Function : Engineering and Information Technology.Sw...Show moreLast updated: 1 day ago
Site Engineer - Central

Site Engineer - Central

SolarvestPetaling Jaya, Selangor, MY
Quick Apply
Supervise and manage daily solar project activities on site.Ensure work quality, safety compliance, and timely project progress. Coordinate effectively with clients, engineers, and subcontractors to...Show moreLast updated: 13 days ago
  • Promoted
Platform Reliability Engineer (Tanzhu)

Platform Reliability Engineer (Tanzhu)

HCLTechKuala Lumpur, Kuala Lumpur, Malaysia
Selected candidates need to relocate Kuala Lumpur, Malaysia.Visa & flight tickets will be sponsored by the company.Platform Reliability Engineer (PRE) is responsible for engineering, operating, and...Show moreLast updated: 1 day ago
Site Reliability Engineer (SRE) / Devops Engineer

Site Reliability Engineer (SRE) / Devops Engineer

Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
Quick Apply
As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 30+ days ago
  • Promoted
Airbus - PROTEGE - Facilities / Site Engineer

Airbus - PROTEGE - Facilities / Site Engineer

Airbus Helicopters Malaysia SDN. BHD.Subang, Malaysia
Job Description : • • Airbus is seeking motivated, fresh graduate talent to join a specific department for a •10-month Protégé Program •. This program is for •Malaysian fresh graduates only •, as manda...Show moreLast updated: 1 day ago
  • Promoted
Utilities Engineer

Utilities Engineer

LonzaBatu Arang, Selangor, Malaysia
Today, Lonza is a global leader in life sciences operating across five continents.While we work in science, there’s no magic formula to how we do it. Our greatest scientific solution is dedicated in...Show moreLast updated: 15 days ago