Talent.com
This job offer is not available in your country.
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Hexa BusinessPetaling Jaya, Selangor, Malaysia
17 hours ago
Job description

Job Description Summary :

You own the stability, scalability, and performance of critical infrastructure running complex data pipelines at enterprise scale. This is a hands-on leadership role for a seasoned engineer who drives engineering rigor, automation, and reliability across global teams. You bring deep technical mastery in big data systems, automation, and coding, combined with proven success leading projects or teams. Your work prevents outages, accelerates deployments, and raises operational standards.

Responsibilities :

  • Lead end-to-end reliability engineering for large-scale data ingestion and processing platforms.
  • Architect, build, and automate infrastructure using Ansible, Terraform, Kubernetes, and OpenShift.
  • Develop and enforce coding and testing standards, ensuring clean, maintainable, production-grade code in Java / Python.
  • Drive CI / CD pipelines with Jenkins, Maven, Git, Docker — delivering frequent, stable releases.
  • Mentor engineers, enforce accountability, and lift team maturity on reliability best practices.
  • Use telemetry, monitoring, and metrics to proactively identify risk and prevent incidents.
  • Collaborate closely with internal customers and global teams to solve complex problems quickly and effectively.

Requirements :

  • 12+ years of hands-on systems, DevOps, or SRE experience in large, international enterprises.
  • 2+ years leading teams or complex projects with clear ownership of outcomes.
  • Expert-level experience with Elastic Stack, Kafka, Logstash, and Kibana for data ingestion.
  • Mastery of Infrastructure as Code tools : Ansible, Terraform, Kubernetes, OpenShift.
  • Strong programming skills in Java or Python, with deep understanding of OOP and software engineering principles.
  • Experience with microservices, REST / SOAP APIs, and NoSQL databases.
  • Familiarity with ITIL processes and Agile delivery models.
  • The ability to operate effectively across cultures and time zones, driving alignment and delivery.
  • Create a job alert for this search

    Site Reliability Engineer • Petaling Jaya, Selangor, Malaysia

    Related jobs
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    HCL Singapore Pte LtdAmpang, Kuala Lumpur, Malaysia
    We are seeking a highly skilled and customer-focused Site Reliability Engineer (SRE) with deep expertise in VMware technologies and a strong background in system administration and automation.Admin...Show moreLast updated: 17 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Avensys ConsultingKuala Lumpur, Kuala Lumpur, Malaysia
    Our client’s project is a well-established brand in the IT industry who is now looking for a passionate and driven Site Reliability Engineer. This is an exciting opportunity to expand your skill set...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Horizontal TalentsTaman Tun Dr Ismail, Kuala Lumpur, Malaysia
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 17 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GlintsKuala Lumpur, Kuala Lumpur, Malaysia
    Recruitment Consultant at Glints Singapore.Monitor and maintain system performance to ensure the stability and reliability of applications and infrastructure. Design and implement resilient system a...Show moreLast updated: 25 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 20 days ago
    DevOps / Site Reliability Engineer (Malaysia)

    DevOps / Site Reliability Engineer (Malaysia)

    InsiderSecurityKuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    Build automation for DevOps and be its advocate in the product teams.Build automation for high availability and robustness of our infrastructure. Monitor our infrastructure health to ensure high ava...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    UnisonTech Consulting Sdn BhdKlang, Selangor, Malaysia
    We are looking for a detail-oriented and proactive.Proficiency in programming languages such as Python, Golang, Java, or similar, focusing on operational efficiency. Fluency in Mandarin, both writte...Show moreLast updated: 17 hours ago
    • Promoted
    Site Reliability Engineer (L2 Support)

    Site Reliability Engineer (L2 Support)

    CareCone GroupKuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (L2 Support) role at CareCone Group in Kuala Lumpur, Malaysia.Responsible for end-to-end application support, production incident handling, platform monitoring, and coordi...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Russell TobinCyberjaya, Selangor, Malaysia
    Senior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC.Strong hands-on experience with using and designing VMware solution such as NSX-T, vRealize Suite, vSphere / vC...Show moreLast updated: 1 day ago
    • Promoted
    Reliability Engineer (R&D)

    Reliability Engineer (R&D)

    Daikin Malaysia Sdn BhdSungai Buloh, Selangor, Malaysia
    Oversee daily test operations, including equipment setup, maintenance, and facility upgrades.Collaborate with designers to conduct tests and ensure compliance with specifications and standards.Mana...Show moreLast updated: 17 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
    Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provid...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    U3 InfoTech Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
    Position : Site Reliability Engineer (SRE).Duration : 2 years (direct contract & convertible to permanent).Experience : 3-8 years (Multiple headcounts). As a Site Reliability Engineer (SRE), you will p...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    MOL ACCESSPORTAL SDN. BHD.Bangsar, Kuala Lumpur, Malaysia
    Design, implement, and maintain Infrastructure as Code (IaC) Collaborate with development and operations teams to ensure IaC best practices are followed. Participate in architecture reviews to provi...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Senior SRE Engineer

    Senior SRE Engineer

    RHB Banking GroupKuala Lumpur, Kuala Lumpur, Malaysia
    We are seeking a highly motivated Senior Site Reliability Engineer (SRE) to join our Technology team at RHB Banking Group. In this role, you will engineer self-service, reliable systems to support h...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BoostKuala Lumpur, Kuala Lumpur, Malaysia
    The 'tech-tonic' shift has reshaped our day-to-day aspects and we, at Boost, aspire to shake things up further in the financial services scene. In the last 5 years, some of our highlights include : m...Show moreLast updated: 17 hours ago
    Site Reliability Engineer (SRE) / Devops Engineer

    Site Reliability Engineer (SRE) / Devops Engineer

    Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 20 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    RHB Banking GroupBandar Baru Bangi, Selangor, Malaysia
    Drive SRE practice and deliver the highest level of system and infrastructure resiliency that meets business and regulatory requirements. Drive consistent SRE practice across all application, infras...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    OM CONNECT SDN BHD (OpenMinds®)Klang, Selangor, Malaysia
    The Site Reliability Engineer (SRE) ensures the reliability and performance of critical services, bridging development and operations. The role focuses on scalable infrastructure, SRE practices such...Show moreLast updated: 17 hours ago