Talent.com
Tawaran kerja ini tidak tersedia di negara anda.
Site Reliability Engineer III

Site Reliability Engineer III

Guidewire SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
30+ hari lalu
Penerangan pekerjaan

Summary

At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that’s a time of crisis, a natural disaster, an accident, or exposure to cyber risks. We build the core applications that insurance companies use to sell and underwrite policies, settle claims, and bill their customers. We also have a portfolio of innovative products serving the needs of P&C insurance companies in areas such as data management, digital online portals, and predictive analytics. We run these products on the Guidewire Cloud Platform, and we help hundreds of insurance providers all over the world to handle billions of dollars of business.

About The Role : As a Site Reliability Engineer at Guidewire, you’ll join a passionate team dedicated to automating every process to ensure our systems run efficiently. Our Platform team is fully committed to developing and managing software that enhances the reliability of production systems—systems that serve hundreds of customers and support millions of transactions every day. You will play a key role in ensuring the stability of our flagship cloud platform products while building the tooling necessary for efficient operations and optimal availability of our SaaS multi-tenant, customer-focused systems. In close collaboration with our core product developers, you’ll help ensure our cloud products meet both functional and non-functional requirements including availability, performance, observability, and maintainability.

If you thrive on teamwork, embrace responsibility, and have a passion for solving problems at scale with technologies like AWS, Kubernetes, and Aurora, then we’d love to hear from you. We’re looking for someone who lives by the mantra, "If you have to do something more than once, automate it," and who is eager to learn and master new tools and concepts. Bonus points if you have experience in production support for a SaaS platform and are comfortable working with cutting-edge, highly containerized, cloud-native environments in AWS.

What You’ll Do

  • Drive Reliability & Automation :

Take a dedicated SRE approach to managing shared multi-tenant infrastructure for resilient SaaS microservice-based systems and customer-centric applications.

  • Oversee and continuously enhance our team’s presence in AWS by automating deployment and operational tasks.
  • Innovate and Improve Core Systems :
  • Contribute to the development of our core infrastructure systems—adding features, fixing bugs, and implementing reliability enhancements.

  • Engineer and maintain a complex single sign-on (SSO) authentication platform based on SAML / OAuth to ensure secure, seamless access for our users.
  • Enhance Observability & Incident Management :
  • Build and maintain comprehensive observability tooling, metrics, and dashboards to support our global platform infrastructure.

  • Improve our incident management lifecycle by identifying, mitigating, and learning from reliability risks, while helping to create a self-healing environment.
  • Foster a culture of curiosity, innovation, and responsible use of AI—empowering our teams to continuously leverage emerging technologies and data-driven insights to enhance productivity and outcomes.
  • Empower the Team :
  • Develop system documentation and training materials to educate and empower your teammates.

  • Collaborate with various engineering teams, providing feedback and contributing code when needed to enhance our products.
  • Who You Are

  • Technically Skilled :
  • Bachelor’s Degree in Computer Science or a related field.

  • Proven software engineering and automation skills using Bash, Python, and / or Go.
  • Deep background in Linux systems and Agile development methodologies (Scrum, Kanban, etc.).
  • Cloud & DevOps Savvy :
  • Significant experience automating and managing systems on AWS and supporting live production environments (Java / Apache / Tomcat).

  • Proficient with Infrastructure as Code (IaC) tools such as Terraform, Terragrunt, or Terraspace; experience with devops / gitops tools for code promotions.
  • Hands-on experience with containerization (Docker, Helm, Kubernetes / EKS) and a strong understanding of SSO, SAML, and OAuth (bonus if Okta).
  • Observability & Database Knowledge :
  • Experience with observability tools (Datadog, CloudWatch, PagerDuty) and event store / stream-processing technologies (Kafka, AWS SQS).

  • Relational databases such as Aurora Postgres or Oracle RDS; strong application development, web UI design, JSON, and overall architecture experience.
  • Open Application Model exposure (KubeVela or Crossplane) is a plus.
  • Demonstrated ability to embrace emerging technologies—especially AI—and apply data-driven insights to drive innovation and continuous improvement.
  • A Collaborative Problem Solver :
  • Prefer writing robust code over GUI-based work; enjoy mentoring others.

  • Strong troubleshooting skills, analytical mindset, and process-driven approach.
  • Proactive team player with excellent communication and the ability to explain complex concepts clearly.
  • Champion reliability by promoting blameless postmortems, SLO tracking, and learning from incidents.
  • About Guidewire

    Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540+ insurers in 40 countries, from new ventures to the largest and most complex in the world, run on Guidewire.

    As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record with 1600+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our Marketplace provides hundreds of applications that accelerate integration, localization, and innovation.

    For more information, please visit and follow us on Twitter : @Guidewire_PandC. Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success. Qualified applicants will receive consideration without regard to race, color, ancestry, religion, sex, national origin, citizenship, marital status, age, sexual orientation, gender identity, gender expression, veteran status, or disability. All offers are contingent upon passing a criminal history and other background checks where applicable.

    #J-18808-Ljbffr

    Buat amaran kerja untuk carian ini

    Site Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia