Join to apply for the Site Reliability Engineer (SRE) - PD role at Beyondsoft Singapore
Responsibilities
We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of our systems. You will work closely with development and operations teams to design scalable, resilient architectures, implement automation, and manage Kubernetes and cloud infrastructure. This role involves proactive monitoring, incident response, and driving continuous improvements in system reliability.
Responsibilities
- System Reliability
- Partner with development teams to integrate reliability into the software development lifecycle.
- Design and implement highly available and fault-tolerant architectures for mission-critical applications.
- Kubernetes Operations
- Design, implement, manage, and optimize Kubernetes clusters for availability, scalability, and security.
- Perform upgrades, patches, and security hardening for Kubernetes infrastructure.
- Automation & Infrastructure as Code (IaC)
- Automate application deployment, scaling, and infrastructure provisioning.
- Implement CI / CD pipelines for deploying and updating Kubernetes applications.
- Develop and maintain IaC scripts (e.g., Terraform, Ansible) for provisioning and managing cloud and container resources.
- Cloud Integration
- Utilize AWS, GCP, or Azure services for Kubernetes deployments and integrations.
- Apply cloud-native best practices for scalability and performance.
- Monitoring & Alerting
- Implement monitoring, logging, and alerting solutions (Prometheus, Grafana, ELK, etc.).
- Proactively identify and resolve performance bottlenecks and reliability issues.
- Incident Response
- Respond to and resolve production incidents with minimal downtime.
- Conduct post-incident analysis and implement preventive measures.
- Capacity Planning
- Perform capacity planning to ensure the Kubernetes infrastructure can accommodate current and future workloads in the cloud.
- Security
- Collaborate with the security team to implement and enforce Kubernetes and cloud security best practices.
- Perform regular vulnerability assessments and compliance checks.
- Collaboration & Documentation
- Work cross-functionally with DevOps, security, and development teams.
- Maintain comprehensive documentation for processes and configurations.
Qualifications
Bachelor's degree in Computer Science, Information Technology, or a related field.Minimum 3 years of proven experience as a Site Reliability Engineer or similar functional role.Strong programming or scripting skills, with proficiency in languages such as Bash, Python, Go, or Java.Extensive experience with Kubernetes orchestration, including cluster setup, management, and troubleshooting.Experience with infrastructure-as-code tools (e.g., Terraform, Ansible) and cloud platforms.Solid understanding of virtualization and networking concepts and principles.Excellent problem-solving and troubleshooting skills.Strong communication and collaboration skills.Knowledge of cloud security best practices.Familiarity with microservices frameworks.Advantage : Certified Kubernetes Administrator (CKA) or equivalent certification.Beyondsoft (Malaysia) Sdn. Bhd. is committed to being an equal opportunity employer and provides equal employment opportunities to all employees and applicants. We strive to cultivate a workplace that celebrates diversity and inclusion, where individuals of all backgrounds—regardless of nationality, ethnicity, religion, age, gender identity, sexual orientation, or any other distinguishing trait—can succeed and thrive. We prohibit discrimination and harassment of any type with regard to race, color, religion, age, national origin, disability status, genetics, sexual orientation, gender identity, or expression. This policy applies to all terms and conditions of employment, including recruiting, hiring, and the entire employee lifecycle. We are focused on creating an environment where everyone can reach their full potential.
Employment offers from Beyondsoft (Malaysia) Sdn. Bhd. are contingent upon the successful completion of any required pre-employment processes, in line with applicable laws and regulations. Beyondsoft (Malaysia) Sdn. Bhd. does not ask for any recruitment fees, nor does it request any unauthorized payments from candidates as part of the hiring process.
#J-18808-Ljbffr