Talent.com
Site Reliability Engineer III

Site Reliability Engineer III

Guidewire SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
30+ hari lalu
Penerangan pekerjaan

Summary

At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that’s a time of crisis, a natural disaster, an accident, or exposure to cyber risks. We build the core applications that insurance companies use to sell and underwrite policies, settle claims, and bill their customers. We also have a portfolio of innovative products serving the needs of P&C insurance companies in areas such as data management, digital online portals, and predictive analytics. We run these products on the Guidewire Cloud Platform, and we help hundreds of insurance providers all over the world to handle billions of dollars of business.

About The Role : As a Site Reliability Engineer at Guidewire, you’ll join a passionate team dedicated to automating every process to ensure our systems run efficiently. Our Platform team is fully committed to developing and managing software that enhances the reliability of production systems—systems that serve hundreds of customers and support millions of transactions every day. You will play a key role in ensuring the stability of our flagship cloud platform products while building the tooling necessary for efficient operations and optimal availability of our SaaS multi-tenant, customer-focused systems. In close collaboration with our core product developers, you’ll help ensure our cloud products meet both functional and non-functional requirements including availability, performance, observability, and maintainability.

If you thrive on teamwork, embrace responsibility, and have a passion for solving problems at scale with technologies like AWS, Kubernetes, and Aurora, then we’d love to hear from you. We’re looking for someone who lives by the mantra, "If you have to do something more than once, automate it," and who is eager to learn and master new tools and concepts. Bonus points if you have experience in production support for a SaaS platform and are comfortable working with cutting-edge, highly containerized, cloud-native environments in AWS.

What You’ll Do

  • Drive Reliability & Automation :

Take a dedicated SRE approach to managing shared multi-tenant infrastructure for resilient SaaS microservice-based systems and customer-centric applications.

  • Oversee and continuously enhance our team’s presence in AWS by automating deployment and operational tasks.
  • Innovate and Improve Core Systems :
  • Contribute to the development of our core infrastructure systems—adding features, fixing bugs, and implementing reliability enhancements.

  • Engineer and maintain a complex single sign-on (SSO) authentication platform based on SAML / OAuth to ensure secure, seamless access for our users.
  • Enhance Observability & Incident Management :
  • Build and maintain comprehensive observability tooling, metrics, and dashboards to support our global platform infrastructure.

  • Improve our incident management lifecycle by identifying, mitigating, and learning from reliability risks, while helping to create a self-healing environment.
  • Foster a culture of curiosity, innovation, and responsible use of AI—empowering our teams to continuously leverage emerging technologies and data-driven insights to enhance productivity and outcomes.
  • Empower the Team :
  • Develop system documentation and training materials to educate and empower your teammates.

  • Collaborate with various engineering teams, providing feedback and contributing code when needed to enhance our products.
  • Who You Are

  • Technically Skilled :
  • Bachelor’s Degree in Computer Science or a related field.

  • Proven software engineering and automation skills using Bash, Python, and / or Go.
  • Deep background in Linux systems and Agile development methodologies (Scrum, Kanban, etc.).
  • Cloud & DevOps Savvy :
  • Significant experience automating and managing systems on AWS and supporting live production environments (Java / Apache / Tomcat).

  • Proficient with Infrastructure as Code (IaC) tools such as Terraform, Terragrunt, or Terraspace; experience with devops / gitops tools for code promotions.
  • Hands-on experience with containerization (Docker, Helm, Kubernetes / EKS) and a strong understanding of SSO, SAML, and OAuth (bonus if Okta).
  • Observability & Database Knowledge :
  • Experience with observability tools (Datadog, CloudWatch, PagerDuty) and event store / stream-processing technologies (Kafka, AWS SQS).

  • Relational databases such as Aurora Postgres or Oracle RDS; strong application development, web UI design, JSON, and overall architecture experience.
  • Open Application Model exposure (KubeVela or Crossplane) is a plus.
  • Demonstrated ability to embrace emerging technologies—especially AI—and apply data-driven insights to drive innovation and continuous improvement.
  • A Collaborative Problem Solver :
  • Prefer writing robust code over GUI-based work; enjoy mentoring others.

  • Strong troubleshooting skills, analytical mindset, and process-driven approach.
  • Proactive team player with excellent communication and the ability to explain complex concepts clearly.
  • Champion reliability by promoting blameless postmortems, SLO tracking, and learning from incidents.
  • About Guidewire

    Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540+ insurers in 40 countries, from new ventures to the largest and most complex in the world, run on Guidewire.

    As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record with 1600+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our Marketplace provides hundreds of applications that accelerate integration, localization, and innovation.

    For more information, please visit and follow us on Twitter : @Guidewire_PandC. Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success. Qualified applicants will receive consideration without regard to race, color, ancestry, religion, sex, national origin, citizenship, marital status, age, sexual orientation, gender identity, gender expression, veteran status, or disability. All offers are contingent upon passing a criminal history and other background checks where applicable.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Site Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Pekerjaan yang berkaitan
    • Dinaikkan pangkat
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SWIFTKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Tunjukkan lagiKemas kini terakhir: 16 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    AmpstekKuala Lumpur, Kuala Lumpur, Malaysia
    Ampstek Federal Territory of Kuala Lumpur, Malaysia.We are looking for a skilled Site Reliability Engineer (SRE) to join our technology operations team. The ideal candidate will be responsible for b...Tunjukkan lagiKemas kini terakhir: 8 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    GX Bank BerhadPetaling Jaya, Selangor, Malaysia
    Site Reliability Engineer page is loaded.Apply locations Petaling Jaya (First Avenue) time type Full time posted on Posted 9 Days Ago job requisition id R-. GX Bank Berhad - the Grab-led Digital Ban...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Azure Site Reliability Engineer

    Azure Site Reliability Engineer

    ExperianCyberjaya, Selangor, Malaysia
    We are seeking a skilled Azure Cloud DevOps Engineer to join our team.The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Az...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Razer Inc.Kuala LumpurMalaysia, Kuala Lumpur, Malaysia
    Bangsar South, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Bangsar South, Federal Territory of Kuala Lumpur, Malaysia. Get AI-powered advice on this job and mor...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalKlang Municipal Council, Klang Municipal Council, Malaysia
    Site Reliability Engineer role at Canonical.We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To succeed in this role, you need to ...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Russell TobinKuala Lumpur, Kuala Lumpur, Malaysia
    Job Opportunity : Site Reliability Engineer (SRE) in Cyberjaya.Note : Only Malaysian locals or PR holders can apply.We are looking for a Site Reliability Engineer (SRE) to join our forward-thinking C...Tunjukkan lagiKemas kini terakhir: 13 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer (DevOps)

    Site Reliability Engineer (DevOps)

    Ant InternationalKuala Lumpur, Kuala Lumpur, Malaysia
    Direct message the job poster from Ant International.Recruiter @ Ant International | Talent Acquisition Specialist.With headquarters in Singapore and main operations across Asia, Europe, the Middle...Tunjukkan lagiKemas kini terakhir: 21 hari yang lalu
    • Dinaikkan pangkat
    Specialist, Site Reliability Engineer (SRE)

    Specialist, Site Reliability Engineer (SRE)

    TNG DigitalKuala Lumpur, Kuala Lumpur, Malaysia
    Specialist, Site Reliability Engineer (SRE).We are hiring for a Specialist, Site Reliability Engineer (SRE) to join our team. Role focuses on network administration, cloud infrastructure management,...Tunjukkan lagiKemas kini terakhir: 10 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    GXBankPetaling Jaya, Selangor, Malaysia
    Be among the first 25 applicants.GX Bank Berhad - the Grab-led Digital Bank - is the FIRST digital bank in Malaysia, approved by BNM to commence operations. We aim to leverage technology and innovat...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesKuala Lumpur, Kuala Lumpur, Malaysia
    Talent Acquisition | Human Resource Executive | Tata Consultancy Service.Join Tata Consultancy Services, Asia Pacific and be part of an organization committed to sustainable development for our fut...Tunjukkan lagiKemas kini terakhir: 21 hari yang lalu
    • Dinaikkan pangkat
    Azure Site Reliability Engineer

    Azure Site Reliability Engineer

    Experian Asia PacificCyberjaya, Selangor, Malaysia
    We are seeking a skilled Azure Cloud DevOps Engineer to join our team.The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Az...Tunjukkan lagiKemas kini terakhir: 24 hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    RazerKuala Lumpur, Kuala Lumpur, Malaysia
    Joining Razer will place you on a global mission to revolutionize the way the world games.LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.We ar...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    Smart Teq Solution Sdn BhdKuala Lumpur, Kuala Lumpur, Malaysia
    Ensure all our infrastructure are running at optimal condition.Provide deployment, patches and update on all services that running on public cloud and on premise. Identify and resolve support ticket...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SwiftKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Tunjukkan lagiKemas kini terakhir: 1 hari yang lalu
    • Dinaikkan pangkat
    Hunyuan LLM Site Reliability Engineer

    Hunyuan LLM Site Reliability Engineer

    TencentKuala Lumpur, Kuala Lumpur, Malaysia
    Tencent Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Hunyuan LLM Site Reliability Engineer. Tencent Kuala Lumpur, Federal Territory of Kuala Lumpur...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu
    • Dinaikkan pangkat
    Site Reliability Engineer

    Site Reliability Engineer

    iSoftStoneKuala Lumpur, Kuala Lumpur, Malaysia
    SoftStone Federal Territory of Kuala Lumpur, Malaysia.SoftStone Federal Territory of Kuala Lumpur, Malaysia.Be among the first 25 applicants. Get AI-powered advice on this job and more exclusive fea...Tunjukkan lagiKemas kini terakhir: 30+ hari yang lalu