Overview
Job Opportunity : Site Reliability Engineer (SRE) in Cyberjaya. Experience : 5 to 7 Years. Note : Only Malaysian locals or PR holders can apply.
About the Role
We are looking for a Site Reliability Engineer (SRE) to join our forward-thinking Cloud Engineering team in Cyberjaya. This role is perfect for someone who thrives at the intersection of software development and systems engineering, with a strong passion for automation, cloud operations, and continuous improvement. As an SRE, you will play a key role in ensuring the availability, performance, scalability, and security of our cloud platform solutions while building tools to improve efficiency and reliability.
Key Responsibilities
- Monitor and maintain the availability, latency, and performance of cloud systems.
- Automate operational tasks to improve system reliability and reduce manual effort.
- Participate in the full lifecycle of cloud solutions – from architecture and deployment to ongoing operations and optimization.
- Develop and implement infrastructure as code (IaC), orchestration, and automation for sustainable scaling.
- Support pre-launch planning, including system design consultations, and maintain post-launch system health with robust monitoring and alerting.
- Lead patch management and version control for systems and middleware (e.g., RedHat, WebSphere, WebLogic, MSSQL).
- Create and maintain VMware server templates (e.g., RedHat, Windows Server) and other cloud-native resources.
- Leverage technologies like VMware NSX-T, vRealize Suite, and vSphere / vCenter in a secure, multi-tiered environment.
- Work with DevOps and CI / CD tools (e.g., Jenkins, Bamboo, GitHub, Bitbucket, Ansible) to ensure streamlined development and deployment processes.
- Write and maintain scripts using Bash, PowerShell, YAML, and similar languages for automation and system maintenance.
- Collaborate with cross-functional teams to drive platform enhancements and address reliability issues proactively.
- Stay ahead of industry trends and security requirements to mitigate risks and ensure compliance.
Must-Have Skills & Experience
5–7 years of experience in SRE, DevOps, or Systems Engineering roles.Strong hands-on expertise with VMware solutions including NSX-T, vRealize Suite, vSphere / vCenter.Proven experience with patch management for operating systems (Windows, RedHat) and middleware (WebSphere, WebLogic, MSSQL).Deep understanding of automation, orchestration, and infrastructure as code (IaC) tools.Proficiency in scripting (Bash, PowerShell, YAML) and software development best practices.Familiar with CI / CD tools and version control systems (Bitbucket, GitHub, Jenkins, etc.).Experience in implementing and maintaining secure, distributed, multi-tier environments.Solid problem-solving skills, a high sense of accountability, and a strong team player.Strong communication skills with a proactive and adaptable mindset.Exposure to container technologies (e.g., Docker, Kubernetes).Experience with test automation tools (e.g., Selenium, SOAPUI).Knowledge of regulatory compliance and cloud security practices.Seniority level
Mid-Senior levelEmployment type
Full-timeJob function
Engineering and Information TechnologyIndustries
Staffing and Recruiting#J-18808-Ljbffr