Talent.com
Tawaran kerja ini tidak tersedia di negara anda.
Site Reliability Engineer

Site Reliability Engineer

Russell TobinKuala Lumpur, Kuala Lumpur, Malaysia
14 jam yang lalu
Penerangan pekerjaan

Overview

Job Opportunity : Site Reliability Engineer (SRE) in Cyberjaya. Experience : 5 to 7 Years. Note : Only Malaysian locals or PR holders can apply.

About the Role

We are looking for a Site Reliability Engineer (SRE) to join our forward-thinking Cloud Engineering team in Cyberjaya. This role is perfect for someone who thrives at the intersection of software development and systems engineering, with a strong passion for automation, cloud operations, and continuous improvement. As an SRE, you will play a key role in ensuring the availability, performance, scalability, and security of our cloud platform solutions while building tools to improve efficiency and reliability.

Key Responsibilities

  • Monitor and maintain the availability, latency, and performance of cloud systems.
  • Automate operational tasks to improve system reliability and reduce manual effort.
  • Participate in the full lifecycle of cloud solutions – from architecture and deployment to ongoing operations and optimization.
  • Develop and implement infrastructure as code (IaC), orchestration, and automation for sustainable scaling.
  • Support pre-launch planning, including system design consultations, and maintain post-launch system health with robust monitoring and alerting.
  • Lead patch management and version control for systems and middleware (e.g., RedHat, WebSphere, WebLogic, MSSQL).
  • Create and maintain VMware server templates (e.g., RedHat, Windows Server) and other cloud-native resources.
  • Leverage technologies like VMware NSX-T, vRealize Suite, and vSphere / vCenter in a secure, multi-tiered environment.
  • Work with DevOps and CI / CD tools (e.g., Jenkins, Bamboo, GitHub, Bitbucket, Ansible) to ensure streamlined development and deployment processes.
  • Write and maintain scripts using Bash, PowerShell, YAML, and similar languages for automation and system maintenance.
  • Collaborate with cross-functional teams to drive platform enhancements and address reliability issues proactively.
  • Stay ahead of industry trends and security requirements to mitigate risks and ensure compliance.

Must-Have Skills & Experience

  • 5–7 years of experience in SRE, DevOps, or Systems Engineering roles.
  • Strong hands-on expertise with VMware solutions including NSX-T, vRealize Suite, vSphere / vCenter.
  • Proven experience with patch management for operating systems (Windows, RedHat) and middleware (WebSphere, WebLogic, MSSQL).
  • Deep understanding of automation, orchestration, and infrastructure as code (IaC) tools.
  • Proficiency in scripting (Bash, PowerShell, YAML) and software development best practices.
  • Familiar with CI / CD tools and version control systems (Bitbucket, GitHub, Jenkins, etc.).
  • Experience in implementing and maintaining secure, distributed, multi-tier environments.
  • Solid problem-solving skills, a high sense of accountability, and a strong team player.
  • Strong communication skills with a proactive and adaptable mindset.
  • Exposure to container technologies (e.g., Docker, Kubernetes).
  • Experience with test automation tools (e.g., Selenium, SOAPUI).
  • Knowledge of regulatory compliance and cloud security practices.
  • Seniority level

  • Mid-Senior level
  • Employment type

  • Full-time
  • Job function

  • Engineering and Information Technology
  • Industries

  • Staffing and Recruiting
  • #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Site Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia