Talent.com
Specialist, Site Reliability Engineer (SRE)

Specialist, Site Reliability Engineer (SRE)

TNG DigitalKuala Lumpur, Kuala Lumpur, Malaysia
15 days ago
Job description

Specialist, Site Reliability Engineer (SRE)

We are hiring for a Specialist, Site Reliability Engineer (SRE) to join our team.

Overview

Role focuses on network administration, cloud infrastructure management, security and compliance, and incident response to ensure reliable and secure services.

Responsibilities

  • Network administration

Design, implement and manage network infrastructure

  • Monitor network performance and ensure security compliance
  • Maintain and implement cross / multi cloud networking
  • Implement network segmentation and access control to improve security
  • Implement and maintain monitoring systems to proactively identify performance bottlenecks, security vulnerabilities, and other issues
  • Cloud infrastructure management, capacity planning and monitoring
  • Maintain and optimize cloud-based infrastructure

  • Deploy, configure, and manage Linux and Windows servers using automation tools
  • Monitor and troubleshoot infrastructure performance, security, scalability
  • Assess system capacity and performance requirements, implement scalable solutions that meet future growth needs
  • Monitor and analyze cloud spending across multi-cloud environments (AWS, Azure, Alibaba Cloud)
  • Work with finance teams to ensure accurate cloud budgeting, forecasting, and chargeback models
  • Optimize storage, networking, and computing costs without impacting performance
  • Security and compliance
  • Collaborate with SRE and Security teams to manage and optimize resources and apply industry-standard security

  • Collaborate with development teams to design and implement secure infrastructure architectures, ensuring confidentiality, integrity, and availability
  • Ensure compliance with security, audits and regulatory requirements
  • Plan and execute disaster recovery plans
  • Incident response and troubleshooting
  • Collaborate with cross-functional teams to address incidents and restore services quickly

  • Investigate and resolve incidents, perform root cause analysis to prevent recurrence
  • Qualifications

  • Bachelor's degree in Computer Science, Network or related field
  • Professional cloud certification
  • Proven 5+ years of experience in a Cloud Network or Cloud Infrastructure role
  • Strong experience in site reliability engineering, infrastructure engineering or a similar role
  • Strong knowledge of networks, protocols, network security and cloud networking
  • Proven track record of cloud cost optimization
  • Experience with cloud platforms (e.g., AWS, Azure, GCP, Alibaba Cloud) and infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation); basic or advanced cloud certification is a plus
  • Experience with containerization technologies like Docker and container orchestration platforms such as Kubernetes is a plus
  • Knowledge of networking principles and protocols
  • Deep knowledge of Linux / Unix systems and administration
  • Strong problem-solving skills and the ability to handle high-pressure situations calmly and effectively
  • Strong attention to detail and a commitment to delivering high-quality results
  • Monthly eWallet allowance
  • Additional 1% employer EPF contribution from 1st to 3rd year, with increases based on continued service
  • Unlimited office pantry fruits, snacks and drinks
  • Mobile and broadband subscription reimbursement
  • Flexibility to opt dependants coverage for outpatient medical benefits
  • Additional leave including family leave and paid care leave
  • Medical coverage including dental, optometrist, mental care, maternity, traditional Chinese medicine (TCM) and Chiropractic
  • Corporate membership discount and more to explore
  • Seniority level

  • Mid-Senior level
  • Employment type

  • Full-time
  • Job function

  • Information Technology
  • We believe you have what it takes to fit into the Touch ’ n Go family and help revolutionize the Fintech industry. If you're ready to take the next step, apply now!

    Touch ’ n Go is an organization that strives to provide Equal Opportunity Employment, based on merit, qualifications, capabilities, and calibre. It is Touch ’ n Go’s policy to not discriminate based on age, race, religion, colour or other personal status, identity or characteristics. Fair Opportunity is Our Value and Practice. Please advise us of any accommodations you may need by e-mailing :

    Note : Only shortlisted candidates will be contacted.

    Let’s keep LEAP-ing forward together!

    Location : Kuala Lumpur, Malaysia

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia