Drive SRE practice and deliver the highest and industry leading level of system and infrastructure resiliency that meets business and regulatory requirements
Key Responsibilities
Drive consistent SRE practice across all application, infrastructure and IT security teams
Set up and operationalize SRE teams identified for specific application, infrastructure and IT security areas
Provide coaching for SRE related functions to SRE engineers and other team (application and infrastructure support teams) practicing SRE within Group Technology to ensure consistent practice of SRE across teams.
Contribute to the development and documentation of SRE best practices and procedures across the Group.
Take ownership of Application Monitoring tools such as Dynatrace and work with vendors to design and drive consistent use of the monitoring tools across all teams
Design, develop, and deploy automation scripts and tools to monitor, manage, and optimize systems.
Analyze system metrics and logs from Dynatrace or other monitoring tools to identify potential problems and areas for improvement.
Build internal expertise in Application Monitoring tools in order to continuously support and enhance observability across all relevant areas as technology and business environment changes
Train and enable active use of Application Monitoring tools across all application and infrastructure support teams
Provide support in deep analysis and trouble-shooting of technical issues encountered in the Critical and Required High applications and the underlying supporting infrastructure and IT security components. This applies during normal times and during incident / system downtime.
Advocate and develop a strong culture of system resiliency and delivery of non-functional requirements
Support, validate and sign off delivery of SRE-related non-functional requirements during project implementation
Continue to fine-tune and enhance SRE practice as business and technology environment evolves.
Keep abreast with issues and challenges encountered in system reliability and identify strategic / structural changes that need to be made to improve
Build strong teamwork and collaboration between SRE, Application, IT Infrastructure and all other relevant stakeholders within Group Technology
Promote continuous learning and culture of innovation within the team
Build strategic and mutually-beneficial relationship with technology solution partners and service providers to further strengthen the Group's capabilities
Requirements
Master's Degree - Master / Degree in Computer Science, IT or a related discipline.
8 - 10 years in IT system development & implementation experience in Financial Services Industry (FSI)
3 - 5 years in system architecture and design related experience
Knowledge of mainframe architecture, operations, and management is crucial. This includes understanding z / OS, CICS ,CICS Transaction Gateway, and other mainframe-specific technologies.
Programming Languages : Proficiency in COBOL is highly beneficial since many mainframe applications are written in COBOL. Additionally, knowledge of other languages like Java, C#, or scripting languages (e.g., Bash, Python, PowerShell) can be helpful.
Systems Reliability : Familiarity with principles of systems reliability, including monitoring, automation, and incident management.
Networking : Understanding of networking concepts, protocols, and troubleshooting.
Strong experience in designing and delivering non-functional requirements including High Availability (at hardware and software levels), Disaster Recovery, Archiving, Housekeeping, Backup and Recovery etc.
Experience and strong appreciation in SRE practice including Service Level Objectives, Service Level Indicators, System Observability, Elimination of Toils, Automation etc.
Excellent interpersonal and communication skills and highly influential in driving strong SRE culture
Strong analytical and problem solving skills
Strong R&D mindset
Create a job alert for this search
Engineer • Klang, Selangor, Malaysia
Related jobs
SSHE Engineer
Petron MalaysiaPort Dickson, Negeri Sembilan, MY
Quick Apply
At Petron, we are not just in the business of oil, we are also in the business of fueling lives.Petron Malaysia is an emerging and rapidly evolving Asian oil company.
It is part of Petron Corporatio...Show moreLast updated: 13 days ago
Promoted
Technical Marketing Engineer
Infotree Global SolutionsSelayang Municipal Council, Selayang Municipal Council, Malaysia
Job title : Developer : Technical Marketing - III.Max salary budget : RM8,000 / month.Experience : 5+ years in technical roles, 2+ years with developers, coding skills, and hands-on AI / edge / IoT experienc...Show moreLast updated: 18 days ago
Promoted
Senior Engineer I, Structural
Vantris Energy BerhadKuala Lumpur, Kuala Lumpur, Malaysia
Perform and / or review standard analyses such as, in minimum : .Lifting analysis and rigging design for jacket, topside, pile, module, boat landing, riser guard, flare boom, spool, riser, concrete sle...Show moreLast updated: 3 days ago
Promoted
New!
Site Reliability Engineer
Horizontal TalentsTaman Tun Dr Ismail, Kuala Lumpur, Malaysia
As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services.
Your expertise will help bridge the gap between development and op...Show moreLast updated: 17 hours ago
Site Reliability Engineer
Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
Quick Apply
As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services.
Your expertise will help bridge the gap between development and op...Show moreLast updated: 20 days ago
Promoted
New!
RSE Technical Support Engineer
Linde Malaysia Sdn BhdPetaling Jaya, Selangor, Malaysia
The position holder is responsible for the coordination and management of the following : -.Ensure that all Operations personnel in PGP Country have a training profile that is setup and tracked to e...Show moreLast updated: 17 hours ago
Sr. Systems Engineer
Two95 International Inc.Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
Quick Apply
To ensure successful implementation of projects within schedule.To ensure SLAs are met and achieved the highest customer satisfaction.
Oversee the design, development and implementation of clients s...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
GlintsKuala Lumpur, Kuala Lumpur, Malaysia
Recruitment Consultant at Glints Singapore.Monitor and maintain system performance to ensure the stability and reliability of applications and infrastructure.
Design and implement resilient system a...Show moreLast updated: 25 days ago
Promoted
Senior Process Engineer
Petron Malaysia Refining & Marketing BhdPort Dickson, Negeri Sembilan, Malaysia
At Petron, we are not just in the business of oil, we are also in the business of fueling lives.Petron Malaysia is an emerging and rapidly evolving Asian oil company.
It is part of Petron Corporatio...Show moreLast updated: 30+ days ago
Promoted
Reliability Engineer (R&D)
Daikin Malaysia Sdn BhdSungai Buloh, Selangor, Malaysia
Oversee daily test operations, including equipment setup, maintenance, and facility upgrades.Collaborate with designers to conduct tests and ensure compliance with specifications and standards.Mana...Show moreLast updated: 17 days ago
Promoted
Sales Director - Industrial and Smart Energy
Celestica Inc.Selayang Municipal Council, Selayang Municipal Council, Malaysia
Press Tab to Move to Skip to Content Link.Select how often (in days) to receive an alert : .Sales Director - Industrial and Smart Energy.
Remote Employee Europe, SHR, GB.Celestica is dedicated to deli...Show moreLast updated: 1 day ago
Promoted
Reservoir Engineer
ExxonMobilKuala Lumpur, Kuala Lumpur, Malaysia
At ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future.As one of the world’s largest publicly traded energy and chemical companies, we are power...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer
Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provid...Show moreLast updated: 3 days ago
Promoted
New!
Senior SRE Engineer
RHB Banking GroupKuala Lumpur, Kuala Lumpur, Malaysia
We are seeking a highly motivated Senior Site Reliability Engineer (SRE) to join our Technology team at RHB Banking Group.
In this role, you will engineer self-service, reliable systems to support h...Show moreLast updated: 17 hours ago
Promoted
New!
Site Reliability Engineer
U3 InfoTech Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
Position : Site Reliability Engineer (SRE).Duration : 2 years (direct contract & convertible to permanent).Experience : 3-8 years (Multiple headcounts).
As a Site Reliability Engineer (SRE), you will p...Show moreLast updated: 17 hours ago
Promoted
Division CFO, Trilogy (Remote) - $400,000 / year USD
TrilogyKlang Municipal Council, Klang Municipal Council, Malaysia
Division CFO, Trilogy (Remote) - $400,000 / year USD.Trilogy Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Get AI-powered advice on this job and more exclusive features.This range is prov...Show moreLast updated: 1 day ago
Promoted
MANAGER - ENGINEERING (CNI AND P&P)
HartalegaSepang, Selangor, Malaysia
Lead and manage all engineering, maintenance, and technical operations across the plant.Drive equipment reliability, process efficiency, and continuous improvement.
Ensure team development, cross-fu...Show moreLast updated: 5 days ago
Promoted
New!
Site Reliability Engineer
OM CONNECT SDN BHD (OpenMinds®)Klang, Selangor, Malaysia
The Site Reliability Engineer (SRE) ensures the reliability and performance of critical services, bridging development and operations.
The role focuses on scalable infrastructure, SRE practices such...Show moreLast updated: 17 hours ago