This job offer is not available in your country.

Site Reliability Engineer

MVC ResourcesKuala Lumpur, Malaysia

5 hours ago

Job description

Site : MVC Resources Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia Overview

We are seeking a

Site Reliability Engineer

to join our team in Kuala Lumpur. The role focuses on building and maintaining robust, scalable, and resilient systems to ensure reliability and performance of critical services. Responsibilities

Monitor and maintain system performance to ensure high availability and reliability. Design and implement resilient system architectures. Develop automation tools and scripts to enhance operational efficiency. Define, track, and analyze SLOs and SLIs to meet business needs. Conduct post-mortem analyses after incidents and drive continuous improvement. Collaborate with development teams to establish and promote best practices. Troubleshoot and resolve issues related to database performance, network connectivity, and deployment failures. Identify and resolve performance bottlenecks in applications and infrastructure. Participate in on-call rotations and respond to critical incidents. Analyze system logs and metrics to identify trends and opportunities for improvement. Qualifications

Strong experience with Linux systems and distributed computing fundamentals. Proven experience troubleshooting application issues with focus on performance and connectivity. Familiarity with networking concepts and effective troubleshooting techniques. Experience in Bash / Shell scripting or automation for system administration tasks. Experience in programming languages such as Python, Golang, or Java. Demonstrated experience in system architecture and design, prioritizing reliability and scalability. Understanding of SRE principles, including SLOs, SLIs, toil reduction, and incident post-mortems. Hands-on experience with cloud environments (AWS, Azure, Google Cloud) and their operational management. Excellent problem-solving abilities and a proactive approach to operational challenges. Ability to work independently while effectively collaborating within a team. Open to a rotational shift schedule across different time slots, with schedules shared in advance. Ability to communicate effectively in Mandarin is an added advantage. Preferred Skills

Observability & Monitoring : Prometheus, Grafana, Alertmanager, Loki, Jaeger / Tempo, OpenTelemetry Containerization & Orchestration : Kubernetes, Helm, service mesh (Istio / Linkerd) Big Data & Streaming : Apache Flink, Kafka, Spark Infrastructure as Code & Automation : Terraform, Ansible, CI / CD pipelines Cloud Platforms : AWS, Azure, GCP Programming & Scripting : Python, Go, Bash Resiliency & Reliability Engineering : Incident response, RCA, chaos engineering, disaster recovery Shift

Morning (Hybrid) : 7am - 4pm Afternoon (Flexi) : 3pm - 12am Night (Flexi) : 11pm - 8am Contact

Reach out to Cheyenne at with your updated resume for a private conversation. Employment details

Seniority level : Mid-Senior level Employment type : Full-time Job function : Information Technology Industries : Online Audio and Video Media

#J-18808-Ljbffr

Create a job alert for this search

Reliability Engineer • Kuala Lumpur, Malaysia

Related jobs

Promoted

Site Reliability Engineer

iSoftStoneKuala Lumpur, Malaysia

SoftStone Federal Territory of Kuala Lumpur, Malaysia Overview.Site Reliability Engineer iSoftStone Federal Territory of Kuala Lumpur, Malaysia 3 days ago Be among the first 25 applicants Get AI-po...Show moreLast updated: 5 days ago

Site Engineer (Central)

SolarvestPetaling Jaya, Selangor, MY

Quick Apply

Supervise and manage all on-site activities to ensure smooth project execution and adherence to timelines.Communicate effectively with clients and subcontractors to address project requirements and...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

Avensys ConsultingKuala Lumpur, Kuala Lumpur, Malaysia

Our client’s project is a well-established brand in the IT industry who is now looking for a passionate and driven Site Reliability Engineer. This is an exciting opportunity to expand your skill set...Show moreLast updated: 4 days ago

Promoted

Site Reliability Engineer

GlintsKuala Lumpur, Kuala Lumpur, Malaysia

Recruitment Consultant at Glints Singapore.Monitor and maintain system performance to ensure the stability and reliability of applications and infrastructure. Design and implement resilient system a...Show moreLast updated: 26 days ago

Promoted

Senior Site Reliability Engineer

bpKuala Lumpur, Kuala Lumpur, Malaysia

Senior Site Reliability Engineer.Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.A multi-disciplinary squad, engaging enterprise platform teams, data platform teams, vendors, and third pa...Show moreLast updated: 30+ days ago

Site Reliability Engineer

Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY

Quick Apply

As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 21 days ago

Promoted

Site Reliability Engineer (L2 Support)

CareCone GroupKuala Lumpur, Kuala Lumpur, Malaysia

Site Reliability Engineer (L2 Support) role at CareCone Group in Kuala Lumpur, Malaysia.Responsible for end-to-end application support, production incident handling, platform monitoring, and coordi...Show moreLast updated: 2 days ago

Promoted

Site Reliability Engineer

Russell TobinCyberjaya, Selangor, Malaysia

Senior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC.Strong hands-on experience with using and designing VMware solution such as NSX-T, vRealize Suite, vSphere / vC...Show moreLast updated: 2 days ago

Promoted

Reliability Engineer (R&D)

Daikin Malaysia Sdn BhdSungai Buloh, Selangor, Malaysia

Oversee daily test operations, including equipment setup, maintenance, and facility upgrades.Collaborate with designers to conduct tests and ensure compliance with specifications and standards.Mana...Show moreLast updated: 18 days ago

Site Engineer

Agensi Pekerjaan Great Pyramid Sdn BhdPetaling Jaya, Selangor, Malaysia

Quick Apply

We are seeking a skilled and experienced.As a Site Engineer, you will be responsible for overseeing and managing construction projects involving post-tensioned and pre-stressed structures, ensuring...Show moreLast updated: 30+ days ago

Promoted
New!

Azure Site Reliability Engineer

Experian Asia PacificCyberjaya, Selangor, Malaysia

We are seeking a skilled Azure Cloud DevOps Engineer to join our team.The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Az...Show moreLast updated: 5 hours ago

Promoted

Senior Site Reliability Engineer

Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia

Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provid...Show moreLast updated: 4 days ago

Promoted
New!

Site Reliability Engineer

MVC ResourcesKuala Lumpur, Kuala Lumpur, Malaysia

Site : MVC Resources Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.The role focuses on building and maintaining robust, scalable, and resilient systems to ensure reliability and performa...Show moreLast updated: 5 hours ago

Promoted

Site Reliability Engineer

VLink IncKuala Lumpur, Malaysia

Site Reliability Engineer – Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia Company : VLink Inc, Federal Territory of Kuala Lumpur, Malaysia Responsibilities. Monitor and maintain system pe...Show moreLast updated: 17 days ago

Site Reliability Engineer (SRE) / Devops Engineer

Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY

Quick Apply

Promoted

MANAGER - ENGINEERING (CNI AND P&P)

HartalegaSepang, Selangor, Malaysia

Lead and manage all engineering, maintenance, and technical operations across the plant.Drive equipment reliability, process efficiency, and continuous improvement. Ensure team development, cross-fu...Show moreLast updated: 6 days ago

Promoted

Site Reliability / Gitops Engineer

CanonicalKuala Lumpur, Kuala Lumpur, Malaysia

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago

Promoted

Senior Site Reliability Engineer

RHB Banking GroupBandar Baru Bangi, Selangor, Malaysia

Drive SRE practice and deliver the highest level of system and infrastructure resiliency that meets business and regulatory requirements. Drive consistent SRE practice across all application, infras...Show moreLast updated: 4 days ago

Promoted
New!

Senior Site Reliability / DevOps Engineer | Kuala Lumpur, MY | Remote

Hermeneutic InvestmentsSepang, Malaysia

Senior Site Reliability / DevOps Engineer Hermeneutic Investments Kuala Lumpur, Malaysia.Posted 3 days ago Remote Job Permanent Competitive Description About the Role : We're looking for an Senior Sit...Show moreLast updated: 5 hours ago

Promoted

Reliability Engineer- Asia Pacific (China)

ExxonMobilKuala Lumpur, Kuala Lumpur, Malaysia

At ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future.As one of the world’s largest publicly traded energy and chemical companies, we are power...Show moreLast updated: 30+ days ago