Drive SRE practice and deliver the highest and industry leading level of system and infrastructure resiliency that meets business and regulatory requirements
Key Responsibilities
Drive consistent SRE practice across all application, infrastructure and IT security teams
Set up and operationalize SRE teams identified for specific application, infrastructure and IT security areas
Provide coaching for SRE related functions to SRE engineers and other team (application and infrastructure support teams) practicing SRE within Group Technology to ensure consistent practice of SRE across teams.
Contribute to the development and documentation of SRE best practices and procedures across the Group.
Take ownership of Application Monitoring tools such as Dynatrace and work with vendors to design and drive consistent use of the monitoring tools across all teams
Design, develop, and deploy automation scripts and tools to monitor, manage, and optimize systems.
Analyze system metrics and logs from Dynatrace or other monitoring tools to identify potential problems and areas for improvement.
Build internal expertise in Application Monitoring tools in order to continuously support and enhance observability across all relevant areas as technology and business environment changes
Train and enable active use of Application Monitoring tools across all application and infrastructure support teams
Provide support in deep analysis and trouble-shooting of technical issues encountered in the Critical and Required High applications and the underlying supporting infrastructure and IT security components. This applies during normal times and during incident / system downtime.
Advocate and develop a strong culture of system resiliency and delivery of non-functional requirements
Support, validate and sign off delivery of SRE-related non-functional requirements during project implementation
Continue to fine-tune and enhance SRE practice as business and technology environment evolves.
Keep abreast with issues and challenges encountered in system reliability and identify strategic / structural changes that need to be made to improve
Build strong teamwork and collaboration between SRE, Application, IT Infrastructure and all other relevant stakeholders within Group Technology
Promote continuous learning and culture of innovation within the team
Build strategic and mutually-beneficial relationship with technology solution partners and service providers to further strengthen the Group's capabilities
Requirements
Master's Degree - Master / Degree in Computer Science, IT or a related discipline.
8 - 10 years in IT system development & implementation experience in Financial Services Industry (FSI)
3 - 5 years in system architecture and design related experience
Knowledge of mainframe architecture, operations, and management is crucial. This includes understanding z / OS, CICS ,CICS Transaction Gateway, and other mainframe-specific technologies.
Programming Languages : Proficiency in COBOL is highly beneficial since many mainframe applications are written in COBOL. Additionally, knowledge of other languages like Java, C#, or scripting languages (e.g., Bash, Python, PowerShell) can be helpful.
Systems Reliability : Familiarity with principles of systems reliability, including monitoring, automation, and incident management.
Networking : Understanding of networking concepts, protocols, and troubleshooting.
Strong experience in designing and delivering non-functional requirements including High Availability (at hardware and software levels), Disaster Recovery, Archiving, Housekeeping, Backup and Recovery etc.
Experience and strong appreciation in SRE practice including Service Level Objectives, Service Level Indicators, System Observability, Elimination of Toils, Automation etc.
Excellent interpersonal and communication skills and highly influential in driving strong SRE culture
Strong analytical and problem solving skills
Strong R&D mindset
Buat amaran kerja untuk carian ini
Engineer • Klang, Selangor, Malaysia
Pekerjaan yang berkaitan
SSHE Engineer
Petron MalaysiaPort Dickson, Negeri Sembilan, MY
Quick Apply
At Petron, we are not just in the business of oil, we are also in the business of fueling lives.Petron Malaysia is an emerging and rapidly evolving Asian oil company.
It is part of Petron Corporatio...Tunjukkan lagiKemas kini terakhir: 13 hari yang lalu
Dinaikkan pangkat
Baharu!
Subsea Engineer
MDE GroupKuala Lumpur, Kuala Lumpur, Malaysia
This role will be based in Kuala Lumpur and will focus on the integrity, operability, and long-term performance of subsea production systems.
Provide engineering support to ensure safe and reliable ...Tunjukkan lagiKemas kini terakhir: 19 jam yang lalu
Dinaikkan pangkat
Baharu!
Site Reliability Engineer
Horizontal TalentsTaman Tun Dr Ismail, Kuala Lumpur, Malaysia
As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services.
Your expertise will help bridge the gap between development and op...Tunjukkan lagiKemas kini terakhir: 19 jam yang lalu
Site Reliability Engineer
Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
Quick Apply
As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services.
Your expertise will help bridge the gap between development and op...Tunjukkan lagiKemas kini terakhir: 20 hari yang lalu
Dinaikkan pangkat
Baharu!
RSE Technical Support Engineer
Linde Malaysia Sdn BhdPetaling Jaya, Selangor, Malaysia
The position holder is responsible for the coordination and management of the following : -.Ensure that all Operations personnel in PGP Country have a training profile that is setup and tracked to e...Tunjukkan lagiKemas kini terakhir: 19 jam yang lalu
Sr. Systems Engineer
Two95 International Inc.Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
Quick Apply
To ensure successful implementation of projects within schedule.To ensure SLAs are met and achieved the highest customer satisfaction.
Oversee the design, development and implementation of clients s...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
Dinaikkan pangkat
Site Reliability Engineer
GlintsKuala Lumpur, Kuala Lumpur, Malaysia
Recruitment Consultant at Glints Singapore.Monitor and maintain system performance to ensure the stability and reliability of applications and infrastructure.
Design and implement resilient system a...Tunjukkan lagiKemas kini terakhir: 25 hari yang lalu
Dinaikkan pangkat
Senior Process Engineer
Petron Malaysia Refining & Marketing BhdPort Dickson, Negeri Sembilan, Malaysia
At Petron, we are not just in the business of oil, we are also in the business of fueling lives.Petron Malaysia is an emerging and rapidly evolving Asian oil company.
It is part of Petron Corporatio...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
Dinaikkan pangkat
Reliability Engineer (R&D)
Daikin Malaysia Sdn BhdSungai Buloh, Selangor, Malaysia
Oversee daily test operations, including equipment setup, maintenance, and facility upgrades.Collaborate with designers to conduct tests and ensure compliance with specifications and standards.Mana...Tunjukkan lagiKemas kini terakhir: 17 hari yang lalu
Dinaikkan pangkat
Reservoir Engineer
ExxonMobilKuala Lumpur, Kuala Lumpur, Malaysia
At ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future.As one of the world’s largest publicly traded energy and chemical companies, we are power...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
Dinaikkan pangkat
Senior Site Reliability Engineer
Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provid...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
Dinaikkan pangkat
Baharu!
Senior SRE Engineer
RHB Banking GroupKuala Lumpur, Kuala Lumpur, Malaysia
We are seeking a highly motivated Senior Site Reliability Engineer (SRE) to join our Technology team at RHB Banking Group.
In this role, you will engineer self-service, reliable systems to support h...Tunjukkan lagiKemas kini terakhir: 19 jam yang lalu
Dinaikkan pangkat
Baharu!
Site Reliability Engineer
U3 InfoTech Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
Position : Site Reliability Engineer (SRE).Duration : 2 years (direct contract & convertible to permanent).Experience : 3-8 years (Multiple headcounts).
As a Site Reliability Engineer (SRE), you will p...Tunjukkan lagiKemas kini terakhir: 19 jam yang lalu
Site Reliability Engineer (SRE) / Devops Engineer
Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
Quick Apply
As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services.
Your expertise will help bridge the gap between development and op...Tunjukkan lagiKemas kini terakhir: 20 hari yang lalu
Dinaikkan pangkat
MANAGER - ENGINEERING (CNI AND P&P)
HartalegaSepang, Selangor, Malaysia
Lead and manage all engineering, maintenance, and technical operations across the plant.Drive equipment reliability, process efficiency, and continuous improvement.
Ensure team development, cross-fu...Tunjukkan lagiKemas kini terakhir: 5 hari yang lalu
Dinaikkan pangkat
Senior Site Reliability Engineer
RHB Banking GroupBandar Baru Bangi, Selangor, Malaysia
Drive SRE practice and deliver the highest level of system and infrastructure resiliency that meets business and regulatory requirements.
Drive consistent SRE practice across all application, infras...Tunjukkan lagiKemas kini terakhir: 3 hari yang lalu
Dinaikkan pangkat
Sr. Saviynt Engineer
Tap Growth aiKuala Lumpur, Kuala Lumpur, Malaysia
We are seeking an experienced Sr.Saviynt Engineer to design, implement, and maintain our Identity and Access Management (IAM) solutions.
The ideal candidate will have extensive expertise in Saviynt ...Tunjukkan lagiKemas kini terakhir: 18 hari yang lalu
Dinaikkan pangkat
Baharu!
Site Reliability Engineer
OM CONNECT SDN BHD (OpenMinds®)Klang, Selangor, Malaysia
The Site Reliability Engineer (SRE) ensures the reliability and performance of critical services, bridging development and operations.
The role focuses on scalable infrastructure, SRE practices such...Tunjukkan lagiKemas kini terakhir: 19 jam yang lalu