Talent.com
System Reliability Engineer, Consultant

System Reliability Engineer, Consultant

AIA Hong Kong and MacauKuala Lumpur, Kuala Lumpur, Malaysia
17 hours ago
Job description
  • At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.
  • As pioneering innovators for over 100 years, we’re now transforming our organisation to be faster, simpler and more connected. Because we want to be even better equipped to develop digital solutions and experiences that help more people live Healthier, Longer, Better Lives.
  • To get there, we need people with tech / digital / analytics expertise and passion to help develop positive, sustainable change through digitally enhanced experiences that will impact the lives of millions of people and create a healthier future for everyone.
  • If you believe in developing a better tomorrow, read on.
  • About the Role
  • To ensure the reliability, scalability, and performance of enterprise systems and services by applying software engineering principles to operations. The System / Site Reliability Engineer will collaborate with development and operations teams to build robust automation, monitor system health, respond to incidents, and continuously improve service availability and efficiency. This role is critical in bridging the gap between software development and IT operations, fostering a culture of resilience, observability, and proactive problem-solving.
  • Job Description
  • Ensure System Reliability and Availability
  • Oversee application performance, report any deviation and issue
  • Collaborate with application engineers and developers in root cause identificationIncident Management and Root Cause Analysis
  • Participate in incident response efforts for production outages as Subject Matter Advisor
  • Provide insights from monitoring and in-depth code / database review
  • Assist Application Operation post-mortems reviewAutomation and Tooling
  • Automate operational tasks such as monitoring, and recovery.
  • Develop scripts and tools to reduce manual toil and improve efficiency.Monitoring and Observability
  • Implement robust telemetry systems to monitor application health, latency, and error rates.
  • Manage Dynatrace platform and integration with all application services
  • Assist Application team in dashboarding design and setupSecurity and Compliance
  • Collaborate with Security teams to ensure systems meet regulatory and security standards (e.g., PCI-DSS, GDPR).
  • Implement access controls, encryption, and audit mechanisms as and where required by the scope of SRE teamCapacity Planning and Performance Optimization
  • Assist in analyzing usage trends to forecast demand and scale infrastructure accordingly.
  • Participate in resource optimization utilization to balance cost and performance.Work closely with development, QA, and infrastructure teams to embed reliability into the SDLC. Promote SRE principles across teams to foster a culture of resilience and accountability.Maintain clear operational documentation, runbooks, and architecture diagrams.
  • Job Requirements
  • Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • 3–5 years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles.
  • Prior experience supporting front-end applications in production environments, preferably in financial services or regulated industries.
  • Frontend Performance Monitoring; Ability to instrument front-end code for custom metrics and traces.
  • Experience with Real User Monitoring (RUM), Synthetic Monitoring, and Application Performance Monitoring (APM) tools (e.g., New Relic, Dynatrace, Datadog).
  • Proficiency in setting up dashboards and alerts using tools like Dynatrace, Grafana, Prometheus, Elastic Stack, or Splunk.
  • Familiarity with OpenTelemetry standards for distributed tracing.
  • Scripting skills in Python, Bash, or JavaScript for automation and tooling.
  • Experience with CI / CD pipelines (e.g., GitHub Flow).
  • Hands-on experience with cloud platforms (AWS, Azure).
  • Familiarity with containerization (Docker) and orchestration (Kubernetes).
  • Understanding of secure coding practices for front-end applications.
  • Awareness of financial compliance standards (e.g., PCI-DSS).
  • Build a career with us as we help our customers and the community live Healthier, Longer, Better Lives.
  • You must provide all requested information, including Personal Data, to be considered for this career opportunity. Failure to provide such information may influence the processing and outcome of your application. You are responsible for ensuring that the information you submit is accurate and up-to-date.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    • Promoted
    Lead Engineer

    Lead Engineer

    Soft Space Sdn BhdSeremban, Negeri Sembilan, Malaysia
    We are seeking a technically strong leader based in Malaysia to head our North America region projects.The Lead Engineer will take ownership of regional delivery, technical solutioning, and team le...Show moreLast updated: 20 days ago
    • Promoted
    • New!
    Technical Solutions Architect II

    Technical Solutions Architect II

    Akamai Technologies GmbHSepang, Sepang, Malaysia
    Join the Technical Solutions Architect team.Akamai is working to simplify the way people work in the cloud.The team's mission is to accelerate innovation by making computing simple, scalable, acces...Show moreLast updated: 19 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HCL Singapore Pte LtdCyberjaya, Selangor, Malaysia
    Administer and support VMware environments including VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA / vRO, and Tanzu.Design, implement, and maintain automation scripts and tools to improve system reliabilit...Show moreLast updated: 20 days ago
    • Promoted
    Senior / Staff / Principal Engineer

    Senior / Staff / Principal Engineer

    CanonicalSepang, Selangor, Malaysia
    Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Senior / Staff / Principal Engineer. Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Sales Engineer

    Senior Sales Engineer

    SophosSeremban, Negeri Sembilan, Malaysia
    Sophos is a global leader and innovator of advanced security solutions designed to defeat cyberattacks.The company acquired Secureworks in February 2025, creating the largest pure‑play Managed Dete...Show moreLast updated: 9 days ago
    • Promoted
    System Analyst Lead, Cards

    System Analyst Lead, Cards

    Sperton Global ASKuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    Lead, build and manage a team of IT System analysts supporting cards platform and strategic projects across the franchise. Build, manage and implement system capabilities based on understanding of t...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Web Engineer

    Senior Web Engineer

    CanonicalNilai, Negeri Sembilan, Malaysia
    Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Be among the first 25 a...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Vice President of Sales

    Vice President of Sales

    NextstarsSelayang Municipal Council, Selayang Municipal Council, Malaysia
    NextStars is a Canadian accelerator with a global vision committed to empowering local and newcomer entrepreneurs to build innovative and impactful businesses. By leveraging Canada’s position as a l...Show moreLast updated: 19 hours ago
    • Promoted
    Senior System Engineer (Lead Implementation)

    Senior System Engineer (Lead Implementation)

    PT. Wide Technologies IndonesiaKuala Lumpur, Kuala Lumpur, Malaysia
    We are an established IT Consulting firm, partnering with top regional and global clients to deliver robust and scalable technology solutions. To support our rapid growth, we are seeking an experien...Show moreLast updated: 16 days ago
    • Promoted
    Protege RTW - Business Support Operations

    Protege RTW - Business Support Operations

    Airbus Customer Services Sdn BhdSepang, Malaysia
    Job Description : • • Opening Application for PROTEGE Program within Airbus in Malaysia.This program is for Malaysian fresh graduates only, as mandated by Malaysian Government.Kindly note this appli...Show moreLast updated: 1 day ago
    • Promoted
    SENIOR QA ENGINEER

    SENIOR QA ENGINEER

    Teknion Furniture Systems (M) Sdn BhdKlang City, Selangor, Malaysia
    Assist in developing, leading and executing Quality Management strategies.Participate in ensuring consistent product quality by developing and enforcing good manufacturing practices systems, valida...Show moreLast updated: 17 days ago
    • Promoted
    • New!
    Associate Director, Organizational Change Management (OCM)

    Associate Director, Organizational Change Management (OCM)

    AvasantKuala Selangor, Kuala Selangor, Malaysia
    Organizational Change Manager (OCM).Avasant is a Los Angeles, California based top management consulting and research firm providing Strategic Sourcing, IT and Business Transformation, and Global S...Show moreLast updated: 19 hours ago
    • Promoted
    Principal Architect - Systems & Solutions

    Principal Architect - Systems & Solutions

    Axiata Digital LabsKuala Lumpur, Malaysia
    QUALIFICATIONS / SKILLS / KNOWLEDGE.Bachelor's degree in Computer Science, Software Engineering, or related field or BSc equivalent qualification with 11+ year(s) experience.Broad knowledge on differ...Show moreLast updated: 30+ days ago
    • Promoted
    Airbus - Protege RTW - Business Support Operations

    Airbus - Protege RTW - Business Support Operations

    Airbus Customer Services Sdn BhdSepang, Malaysia
    Job Description : • • Opening Application for PROTEGE Program within Airbus in Malaysia.This program is for Malaysian fresh graduates only, as mandated by Malaysian Government.Kindly note this appli...Show moreLast updated: 1 day ago
    • Promoted
    Reliability Engineer (Machinery)

    Reliability Engineer (Machinery)

    Petron Malaysia Refining & Marketing BhdPort Dickson, Negeri Sembilan, Malaysia
    Petron Malaysia is an emerging and rapidly evolving Asian oil company.It is part of Petron Corporation which is the leading oil company in the Philippines. Our integrated refining, distribution, and...Show moreLast updated: 30+ days ago
    • Promoted
    System Analyst

    System Analyst

    Sperton Global ASKuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    You will be involved in Compliance domain and be responsible for understanding business requirements and translating them to functional specifications and technical design specifications.You will b...Show moreLast updated: 30+ days ago
    • Promoted
    System Analyst, BWise

    System Analyst, BWise

    Sperton Global ASKuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
    As the System Analyst, you will be reporting to the Delivery Lead and be in the Company Group IT Governance, Risk and Compliance (GRC) Team, part of Group Risk Management.Being a Technical Analyst ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    System Reliability Engineer, Consultant

    System Reliability Engineer, Consultant

    AIA Hong KongKuala Lumpur, Kuala Lumpur, Malaysia
    At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.As pioneering innovators for over 100 years, we’re now transforming our organisation to be fast...Show moreLast updated: 17 hours ago