Talent.com
System Reliability Engineer, Consultant
System Reliability Engineer, ConsultantAIA Hong Kong and Macau • Kuala Lumpur, Kuala Lumpur, Malaysia
System Reliability Engineer, Consultant

System Reliability Engineer, Consultant

AIA Hong Kong and Macau • Kuala Lumpur, Kuala Lumpur, Malaysia
10 days ago
Job description
  • At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.
  • As pioneering innovators for over 100 years, we’re now transforming our organisation to be faster, simpler and more connected. Because we want to be even better equipped to develop digital solutions and experiences that help more people live Healthier, Longer, Better Lives.
  • To get there, we need people with tech / digital / analytics expertise and passion to help develop positive, sustainable change through digitally enhanced experiences that will impact the lives of millions of people and create a healthier future for everyone.
  • If you believe in developing a better tomorrow, read on.
  • About the Role
  • To ensure the reliability, scalability, and performance of enterprise systems and services by applying software engineering principles to operations. The System / Site Reliability Engineer will collaborate with development and operations teams to build robust automation, monitor system health, respond to incidents, and continuously improve service availability and efficiency. This role is critical in bridging the gap between software development and IT operations, fostering a culture of resilience, observability, and proactive problem-solving.
  • Job Description
  • Ensure System Reliability and Availability
  • Oversee application performance, report any deviation and issue
  • Collaborate with application engineers and developers in root cause identificationIncident Management and Root Cause Analysis
  • Participate in incident response efforts for production outages as Subject Matter Advisor
  • Provide insights from monitoring and in-depth code / database review
  • Assist Application Operation post-mortems reviewAutomation and Tooling
  • Automate operational tasks such as monitoring, and recovery.
  • Develop scripts and tools to reduce manual toil and improve efficiency.Monitoring and Observability
  • Implement robust telemetry systems to monitor application health, latency, and error rates.
  • Manage Dynatrace platform and integration with all application services
  • Assist Application team in dashboarding design and setupSecurity and Compliance
  • Collaborate with Security teams to ensure systems meet regulatory and security standards (e.g., PCI-DSS, GDPR).
  • Implement access controls, encryption, and audit mechanisms as and where required by the scope of SRE teamCapacity Planning and Performance Optimization
  • Assist in analyzing usage trends to forecast demand and scale infrastructure accordingly.
  • Participate in resource optimization utilization to balance cost and performance.Work closely with development, QA, and infrastructure teams to embed reliability into the SDLC. Promote SRE principles across teams to foster a culture of resilience and accountability.Maintain clear operational documentation, runbooks, and architecture diagrams.
  • Job Requirements
  • Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • 3–5 years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles.
  • Prior experience supporting front-end applications in production environments, preferably in financial services or regulated industries.
  • Frontend Performance Monitoring; Ability to instrument front-end code for custom metrics and traces.
  • Experience with Real User Monitoring (RUM), Synthetic Monitoring, and Application Performance Monitoring (APM) tools (e.g., New Relic, Dynatrace, Datadog).
  • Proficiency in setting up dashboards and alerts using tools like Dynatrace, Grafana, Prometheus, Elastic Stack, or Splunk.
  • Familiarity with OpenTelemetry standards for distributed tracing.
  • Scripting skills in Python, Bash, or JavaScript for automation and tooling.
  • Experience with CI / CD pipelines (e.g., GitHub Flow).
  • Hands-on experience with cloud platforms (AWS, Azure).
  • Familiarity with containerization (Docker) and orchestration (Kubernetes).
  • Understanding of secure coding practices for front-end applications.
  • Awareness of financial compliance standards (e.g., PCI-DSS).
  • Build a career with us as we help our customers and the community live Healthier, Longer, Better Lives.
  • You must provide all requested information, including Personal Data, to be considered for this career opportunity. Failure to provide such information may influence the processing and outcome of your application. You are responsible for ensuring that the information you submit is accurate and up-to-date.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Canonical • Subang Jaya, Subang Jaya, Malaysia
    Site Reliability Engineer role at Canonical.We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To succeed in this role, you need to ...Show more
    Last updated: 30+ days ago • Promoted
    System Reliability Engineer, Consultant

    System Reliability Engineer, Consultant

    AIA Malaysia • Kuala Lumpur, Kuala Lumpur, Malaysia
    AIA Malaysia Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.System Reliability Engineer, Consultant.At AIA we’ve started an exciting movement to create a healthier, more sustainable futu...Show more
    Last updated: 7 days ago • Promoted
    System Reliability Engineer, Principal

    System Reliability Engineer, Principal

    AIA Malaysia • Kuala Lumpur, Kuala Lumpur, Malaysia
    AIA Malaysia Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.System Reliability Engineer, Principal.AIA Malaysia Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Direct message t...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    bp • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Your mission is to ensure that systems are highly available, scalable, secure, and efficient. You’ll work closely with engineering teams to ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HCLTech • Cyberjaya, Selangor, Malaysia
    Administer and support VMware environments including VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA / vRO, and Tanzu.Design, implement, and maintain automation scripts and tools to improve system reliabilit...Show more
    Last updated: 27 days ago • Promoted
    Site Reliability Engineer (DevOps / Linux)

    Site Reliability Engineer (DevOps / Linux)

    Krisvconsulting Services Pte Ltd • Cyberjaya, Selangor, Malaysia
    Site Reliability Engineer (DevOps / Linux).About the job Site Reliability Engineer (DevOps / Linux).Ensure high availability and performance of systems. Analyze performance metrics and resolve incidents...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Systems Engineer

    Sr. Systems Engineer

    Two95 International Inc. • Kuala Lumpur, Kuala Lumpur, Malaysia
    To ensure successful implementation of projects within schedule.To ensure SLAs are met and achieved the highest customer satisfaction. Oversee the design, development and implementation of clients s...Show more
    Last updated: 30+ days ago • Promoted
    System & Cloud Solution Architect

    System & Cloud Solution Architect

    Logicalis • Kuala Lumpur, Kuala Lumpur, Malaysia
    Logicalis Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.System & Cloud Solution Architect.Supporting sales team by providing business-technical consulting advice and solution, including...Show more
    Last updated: 30+ days ago • Promoted
    DevOps / Site Reliability Engineer (Malaysia)

    DevOps / Site Reliability Engineer (Malaysia)

    Insider Security Pte Ltd • Kuala Lumpur, Kuala Lumpur, Malaysia
    Build automation for DevOps and be its advocate in the product teams.Build automation for high availability and robustness of our infrastructure. Monitor our infrastructure health to ensure high ava...Show more
    Last updated: 30+ days ago • Promoted
    System Reliability Engineer

    System Reliability Engineer

    Businesslist • Kuala Lumpur, Kuala Lumpur, Malaysia
    Monitor and maintain system performance, ensuring uptime and reliability across all infrastructure.Develop and implement automation tools to improve system efficiency and reduce manual intervention...Show more
    Last updated: 30+ days ago • Promoted
    System Reliability Engineer & Consultant — Observability & Automation

    System Reliability Engineer & Consultant — Observability & Automation

    AIA Malaysia • Kuala Lumpur, Kuala Lumpur, Malaysia
    A leading insurance company in Kuala Lumpur is seeking a System Reliability Engineer to ensure the reliability and performance of its enterprise systems. The ideal candidate will have a Bachelor's d...Show more
    Last updated: 9 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Swift • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (DevOps) – Swift.Location : Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Senior Level : Mid‑senior level. Job Function : Engineering and Information Technology.Sw...Show more
    Last updated: 19 days ago • Promoted
    System Architect

    System Architect

    APJR CONSULTANCY SERVICES PRIVATE LIMITED • Kuala Lumpur, Kuala Lumpur, Malaysia
    Systems Architect / Integration Architect.Contract (12 Months – Extendable yearly).We are seeking an experienced Integration Architect to design, implement, and oversee scalable enterprise-level int...Show more
    Last updated: 13 days ago • Promoted
    Senior Site Reliability Engineer : Resilient Systems & Automation

    Senior Site Reliability Engineer : Resilient Systems & Automation

    Hunters International Sdn Bhd • Kuala Lumpur, Kuala Lumpur, Malaysia
    A leading recruitment agency in Kuala Lumpur is seeking a Site Reliability Engineer (SRE) to ensure the reliability and performance of critical services. The ideal candidate must be proficient in Ma...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS) • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (SRE).We are seeking a skilled Site Reliability Engineer (SRE) to join our Cloud Engineering team in Cyberjaya. You will be responsible for ensuring the availability, perfo...Show more
    Last updated: 21 days ago • Promoted
    Site Reliability Engineer (Windows)

    Site Reliability Engineer (Windows)

    Hytech • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (Windows).Hytech Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Ensure high availability and performance of Windows infrastructure, applications, and services.D...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte Ltd • Kuala Lumpur, Kuala Lumpur, Malaysia
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show more
    Last updated: 30+ days ago • Promoted
    System Reliability Engineer, Consultant

    System Reliability Engineer, Consultant

    AIA Hong Kong • Kuala Lumpur, Kuala Lumpur, Malaysia
    At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.As pioneering innovators for over 100 years, we’re now transforming our organisation to be fast...Show more
    Last updated: 10 days ago • Promoted