6 days ago Be among the first 25 applicants.
Get AI-powered advice on this job and more exclusive features.
Who are Tyk, and what do we do?
The Tyk API Management platform is helping to drive the connected world and power new products and services. We’re changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across the retail, finance, telecoms, healthcare, or media industries.
Our Mission
Tyk is on a mission to connect every system in the world. We started by building an API Management platform.
The role
At Tyk, we’re obsessed with building software that solves problems. We count on our Site Reliability Engineers (SREs) to empower users with a rich feature set, high availability, and stellar performance to pursue their missions. We’re looking for an experienced Senior SRE to optimise, automate, and improve our performance using real‑time, massive‑scale data insights.
Responsibilities
- Lead hands‑on maintenance and optimisation of our global cloud platform within defined SLAs.
- Collaborate to shape SRE strategy and translate it into actionable technical plans coordinated through SCRUM.
- Identify reliability issues, drive root‑cause analysis, and implement solutions alongside your squad.
- Lead performance tuning and fault finding through analysis of OS and application metrics.
- Design and implement automation for common operational tasks and cloud‑operations workflows.
- Develop proactive alerting, monitoring roadmap, and relevant dashboards; define and track KPIs.
- Participate in on‑call rotation, ensuring effective incident response and resolution within SLAs.
- Conduct blame‑free postmortems, document findings, and maintain operational runbooks.
- Drive multi‑region and multi‑cloud platform expansion with focus on scalability and automation.
- Optimize infrastructure performance and cost efficiency without impacting service delivery.
- Engage with commercial teams on growth plans and translate them into technical SRE strategies.
- Coordinate penetration testing through provider liaison, technical setup, and environment configuration.
- Champion continuous improvement across processes, communication, and team practices.
- Model excellence in software design and knowledge sharing.
- Plan and execute software upgrades to enhance cloud services.
Qualifications
Experience in an SRE role.Strong knowledge of cloud technologies and SLA / SLO / SLI management.Excellent communication and leadership skills.Ability to analyse and improve operational processes and performance metrics.Experience in software design, automation, and root‑cause analysis.On‑call support experience and customer‑focused mindset.Collaborative attitude with commercial and technical teams.Launching and operating production Kubernetes clusters.Designing and operating infrastructure on AWS and other providers.Operating MongoDB or other document database clusters.Operating Redis or other key‑value storage clusters.Administering Linux servers.Operating Prometheus and Grafana.Operating logging collection and analysis systems.Participating in the on‑call rotation (4 : 00am – 16 : 00pm UTC).Skills
Kubernetes (administrator)Go and / or Python (advanced)AWS / EKS (advanced)Linux (advanced)Terraform and IaC (proficient)Helm (proficient)MongoDB (or similar)Redis (or similar)Monitoring – Prometheus, Grafana, Thanos (familiar)Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.)Common networking protocols (DNS, TCP / IP, HTTP, TLS, UDP)Proactive, energetic, innovative and change‑orientedA desire to lead / mentor a teamBenefits
Unlimited paid holidays.Flexible work hours and total flexibility to work from anywhere.Employee share scheme.Generous maternity and paternity leave.Volunteering days.Employee wellbeing platform.Tyk is an equal‑opportunity employer and we are determined to ensure that no applicant or employee receives less favourable treatment on the grounds of gender, age, disability, religion, belief, sexual orientation, marital status, or race, or is disadvantaged by conditions or requirements that cannot be shown to be justifiable.
Location : Kuala Lumpur, Malaysia
#J-18808-Ljbffr