Talent.com
This job offer is not available in your country.
Site Reliability Engineer (SRE) - PD

Site Reliability Engineer (SRE) - PD

Beyondsoft SingaporeKuala Lumpur, Kuala Lumpur, Malaysia
30+ days ago
Job description

Join to apply for the Site Reliability Engineer (SRE) - PD role at Beyondsoft Singapore

Responsibilities

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of our systems. You will work closely with development and operations teams to design scalable, resilient architectures, implement automation, and manage Kubernetes and cloud infrastructure. This role involves proactive monitoring, incident response, and driving continuous improvements in system reliability.

Responsibilities

  • System Reliability
  • Partner with development teams to integrate reliability into the software development lifecycle.
  • Design and implement highly available and fault-tolerant architectures for mission-critical applications.
  • Kubernetes Operations
  • Design, implement, manage, and optimize Kubernetes clusters for availability, scalability, and security.
  • Perform upgrades, patches, and security hardening for Kubernetes infrastructure.
  • Automation & Infrastructure as Code (IaC)
  • Automate application deployment, scaling, and infrastructure provisioning.
  • Implement CI / CD pipelines for deploying and updating Kubernetes applications.
  • Develop and maintain IaC scripts (e.g., Terraform, Ansible) for provisioning and managing cloud and container resources.
  • Cloud Integration
  • Utilize AWS, GCP, or Azure services for Kubernetes deployments and integrations.
  • Apply cloud-native best practices for scalability and performance.
  • Monitoring & Alerting
  • Implement monitoring, logging, and alerting solutions (Prometheus, Grafana, ELK, etc.).
  • Proactively identify and resolve performance bottlenecks and reliability issues.
  • Incident Response
  • Respond to and resolve production incidents with minimal downtime.
  • Conduct post-incident analysis and implement preventive measures.
  • Capacity Planning
  • Perform capacity planning to ensure the Kubernetes infrastructure can accommodate current and future workloads in the cloud.
  • Security
  • Collaborate with the security team to implement and enforce Kubernetes and cloud security best practices.
  • Perform regular vulnerability assessments and compliance checks.
  • Collaboration & Documentation
  • Work cross-functionally with DevOps, security, and development teams.
  • Maintain comprehensive documentation for processes and configurations.

Qualifications

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Minimum 3 years of proven experience as a Site Reliability Engineer or similar functional role.
  • Strong programming or scripting skills, with proficiency in languages such as Bash, Python, Go, or Java.
  • Extensive experience with Kubernetes orchestration, including cluster setup, management, and troubleshooting.
  • Experience with infrastructure-as-code tools (e.g., Terraform, Ansible) and cloud platforms.
  • Solid understanding of virtualization and networking concepts and principles.
  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and collaboration skills.
  • Knowledge of cloud security best practices.
  • Familiarity with microservices frameworks.
  • Advantage : Certified Kubernetes Administrator (CKA) or equivalent certification.
  • Beyondsoft (Malaysia) Sdn. Bhd. is committed to being an equal opportunity employer and provides equal employment opportunities to all employees and applicants. We strive to cultivate a workplace that celebrates diversity and inclusion, where individuals of all backgrounds—regardless of nationality, ethnicity, religion, age, gender identity, sexual orientation, or any other distinguishing trait—can succeed and thrive. We prohibit discrimination and harassment of any type with regard to race, color, religion, age, national origin, disability status, genetics, sexual orientation, gender identity, or expression. This policy applies to all terms and conditions of employment, including recruiting, hiring, and the entire employee lifecycle. We are focused on creating an environment where everyone can reach their full potential.

    Employment offers from Beyondsoft (Malaysia) Sdn. Bhd. are contingent upon the successful completion of any required pre-employment processes, in line with applicable laws and regulations. Beyondsoft (Malaysia) Sdn. Bhd. does not ask for any recruitment fees, nor does it request any unauthorized payments from candidates as part of the hiring process.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Russell TobinCyberjaya, Malaysia
    Senior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC.Strong hands-on experience with using and designing VMware solution such as NSX-T, vRealize Suite, vSphere / vC...Show moreLast updated: 6 days ago
    • Promoted
    Senior Supplier Quality Engineer

    Senior Supplier Quality Engineer

    TelecontinentSeremban, Negeri Sembilan, Malaysia
    Senior Supplier Quality Engineer.Get AI-powered advice on this job and more exclusive features.This range is provided by Telecontinent. Your actual pay will be based on your skills and experience — ...Show moreLast updated: 27 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    bpKuala Lumpur, Malaysia
    Senior Site Reliability Engineer bp Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Overview Who You Will Work With. A multi-disciplinary squad, engaging enterprise platform teams, data pl...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalSubang Jaya, Selangor, Malaysia
    Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our...Show moreLast updated: 26 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Experian GroupKuala Lumpur, Kuala Lumpur, Malaysia
    Experian Software Solutions (ESS) empowers businesses to make smarter decisions by integrating predictive data, analytics, AI and modern software. With a growing cloud-based platform, ESS is transfo...Show moreLast updated: 4 hours ago
    • Promoted
    Senior Reservoir Engineer (GaffneyCline)

    Senior Reservoir Engineer (GaffneyCline)

    Baker HughesKuala Lumpur, Kuala Lumpur, Malaysia
    Senior Reservoir Engineer (GaffneyCline).Do you have a passion for delivering new technology and problem solving to our customers in the field?. Do you like working in collaborative teams and solvin...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior / Staff / Principal Engineer

    Senior / Staff / Principal Engineer

    CanonicalSepang, Selangor, Malaysia
    Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Senior / Staff / Principal Engineer. Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, ...Show moreLast updated: 4 hours ago
    Site Reliability Engineer

    Site Reliability Engineer

    Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 27 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    SwiftKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Show moreLast updated: 4 hours ago
    • Promoted
    Site Reliability / Gitops Engineer

    Site Reliability / Gitops Engineer

    CanonicalPort Klang, Malaysia
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 6 days ago
    • Promoted
    Sr. Systems Engineer

    Sr. Systems Engineer

    Two95 International Inc.Kuala Lumpur, Kuala Lumpur, Malaysia
    To ensure successful implementation of projects within schedule.To ensure SLAs are met and achieved the highest customer satisfaction. Oversee the design, development and implementation of clients s...Show moreLast updated: 27 days ago
    • Promoted
    • New!
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia

    SpeechifySepang, Selangor, Malaysia
    Senior Software Engineer, AI Model serving - Kuala Lumpur, Malaysia.Speechify Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia. Join or sign in to find your next job.Senior Software Enginee...Show moreLast updated: 4 hours ago
    Site Engineer

    Site Engineer

    Agensi Pekerjaan Great Pyramid Sdn BhdPetaling Jaya, Selangor, Malaysia
    Quick Apply
    We are seeking a skilled and experienced.As a Site Engineer, you will be responsible for overseeing and managing construction projects involving post-tensioned and pre-stressed structures, ensuring...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Software Engineering Manager, Ubuntu Gaming

    Software Engineering Manager, Ubuntu Gaming

    CanonicalSepang, Selangor, Malaysia
    Canonical is hiring for the Software Engineering Manager, Ubuntu Gaming.Location : Remote in the Americas or EMEA region. As the Software Engineering Manager for Ubuntu Gaming, your mission is to hel...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalSepang, Selangor, Malaysia
    Site Reliability Engineer role at Canonical.We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To succeed in this role, you need to ...Show moreLast updated: 4 hours ago
    Site Reliability Engineer (SRE) / Devops Engineer

    Site Reliability Engineer (SRE) / Devops Engineer

    Unison GroupKuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer (L2 Support)

    Site Reliability Engineer (L2 Support)

    CareCone GroupKuala Lumpur, Malaysia
    Site Reliability Engineer (L2 Support) role at CareCone Group in Kuala Lumpur, Malaysia.Responsible for end-to-end application support, production incident handling, platform monitoring, and coordi...Show moreLast updated: 6 days ago
    • Promoted
    • New!
    Senior Software Engineer (Remote)

    Senior Software Engineer (Remote)

    PortcastKepong, Kuala Lumpur, Malaysia
    Our mission is to transform international supply chains to be more resilient by helping logistics companies realize the full potential of their data. We cater to both shipping lines and cargo airlin...Show moreLast updated: 4 hours ago