Talent.com
Site Reliability Engineer

Site Reliability Engineer

FINEXUS GroupKuala Lumpur, Kuala Lumpur, Malaysia
12 hari lalu
Penerangan pekerjaan

Site Reliability Engineer

Location : FINEXUS Group, Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.

Responsibilities

  • Ensure high availability and reliability of IT systems, applications, and PCI DSS‑certified data centers, supporting both internal operations and client‑facing platforms.
  • Perform system administration of Linux and Windows servers, including installation, configuration, patching, monitoring, and performance tuning.
  • Manage data storage, backup, and disaster recovery (DRP) to ensure data integrity, resilience, and compliance with industry standards.
  • Conduct capacity planning and lifecycle management of infrastructure resources, ensuring optimal performance and scalability.
  • Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets to measure and improve reliability.
  • Implement chaos testing and fault‑injection practices to proactively identify weaknesses.
  • Optimize observability and alerting systems (e.g., Prometheus, Grafana, ELK, Nagios or equivalent) to ensure actionable insights and minimal alert fatigue.
  • Implement and maintain system and network security controls, including firewall management, VPN, identity / access management, and endpoint security.
  • Support compliance with BNM RMiT, PCI DSS, ISO 27001 standards and external audits.
  • Manage logs and integrate SIEM platforms to strengthen monitoring and incident response.
  • Support vulnerability management and coordinate with Security Operations teams for patching.
  • Deploy, configure, and maintain Kubernetes clusters (SUSE Rancher Prime) and containerized workloads.
  • Build and maintain CI / CD pipelines for automated deployment, testing, and operational efficiency.
  • Automate configuration and patch management using tools such as Ansible, Puppet, or equivalent.
  • Implement IaC using Terraform or equivalent for consistent environment provisioning.
  • Automate auto‑healing and self‑recovery scripts to reduce MTTR.
  • Optimize cost and performance for cloud and container workloads.
  • Administer and troubleshoot DNS, DHCP, VPN, load balancers, and core network services.
  • Support virtualization platforms and physical server infrastructure within data centers.
  • Collaborate on zero‑trust segmentation and service mesh integration.
  • Provide on‑call support, collaborate on incident resolution, and maintain runbooks.
  • Lead post‑incident reviews (PIRs) and blameless retrospectives.
  • Leverage AIOps or event‑correlation tools for proactive incident detection.

Requirements

  • Bachelor’s or Master’s Degree in Computer Science, IT, Engineering or related field.
  • 4+ years of experience in Site Reliability Engineering, System Administration or IT Infrastructure.
  • Proven experience in Linux and Windows system administration.
  • Hands‑on experience with cloud operations (AWS, Azure, GCP) and container orchestration (Kubernetes, Rancher).
  • Strong knowledge of networking, firewalls, DNS, DHCP, VPN, and enterprise security best practices.
  • Experience in database management (MySQL, PostgreSQL, or equivalent) including backup, tuning, and recovery.
  • Knowledge of compliance frameworks (PCI DSS, ISO 27001, BNM RMiT) is highly desirable.
  • Strong problem‑solving and troubleshooting skills in mission‑critical environments.
  • Excellent communication skills in English and Malay (spoken and written).
  • Ability to work independently and collaboratively in a fast‑paced, regulated technology environment.
  • Experience with SRE toolchains : Prometheus, Grafana, ELK, Terraform, Ansible, Jenkins, GitLab CI / CD or equivalent.
  • Relevant certifications such as AWS Certified SysOps Administrator, RHCE, Kubernetes Administrator (CKA), or ISO 27001 Implementer are an advantage.
  • #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Site Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Pekerjaan berkaitan
    • Dinaikkan pangkat
    Site Reliability Engineer III

    Site Reliability Engineer III

    Guidewire SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
    At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that’s a time of crisis, a n...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Platform / Site Reliability Engineer

    Platform / Site Reliability Engineer

    POWER IT SERVICESKuala Lumpur, Kuala Lumpur, Malaysia
    Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining internal container platform and its supporting infrastructure, with a strong focus on reliability, res...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer page is loaded## Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provider of secure f...Tunjukkan lagiKemas kini terakhir: 10 hari yang lalu
    • Dinaikkan pangkat
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SWIFTKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    SWIFTKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Tunjukkan lagiKemas kini terakhir: 9 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    HCLTechCyberjaya, Selangor, Malaysia
    Administer and support VMware environments including VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA / vRO, and Tanzu.Design, implement, and maintain automation scripts and tools to improve system reliabilit...Tunjukkan lagiKemas kini terakhir: 9 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer (SRE) / Devops Engineer

    Site Reliability Engineer (SRE) / Devops Engineer

    Unison Consulting Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    SwiftKuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (DevOps) – Swift.Location : Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Senior Level : Mid‑senior level. Job Function : Engineering and Information Technology.Sw...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS)Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (SRE).We are seeking a skilled Site Reliability Engineer (SRE) to join our Cloud Engineering team in Cyberjaya. You will be responsible for ensuring the availability, perfo...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Encora Inc.Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Encora is a global digital engineering company specializing in AI, Cloud, and Data solutions to help enterprises become agile and adaptable...Tunjukkan lagiKemas kini terakhir: 4 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesKuala Lumpur, Kuala Lumpur, Malaysia
    Talent Acquisition | Human Resource Executive | Tata Consultancy Service.Join Tata Consultancy Services, Asia Pacific and be part of an organization committed to sustainable development for our fut...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Smart Teq Solution Sdn BhdKuala Lumpur, Kuala Lumpur, Malaysia
    Ensure all our infrastructure are running at optimal condition.Provide deployment, patches and update on all services that running on public cloud and on premise. Identify and resolve support ticket...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SwiftKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalSubang Jaya, Selangor, Malaysia
    Site Reliability Engineer role at Canonical.We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To succeed in this role, you need to ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    iSoftStoneKuala Lumpur, Kuala Lumpur, Malaysia
    SoftStone – Federal Territory of Kuala Lumpur, Malaysia.A leading global technology group, renowned for its extensive ecosystem of digital services and platforms. With a strong presence in cloud com...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
    Lead Site Reliability Engineer page is loaded## Lead Site Reliability Engineerlocations : Kuala Lumpur, Malaysiatime type : Full timeposted on : Posted Todayjob requisition id : We’re the worl...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Razer Inc.Kuala Lumpur, Kuala Lumpur, Malaysia
    Bangsar South, Federal Territory of Kuala Lumpur, Malaysia.Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you ...Tunjukkan lagiKemas kini terakhir: 12 hari yang lalu