Talent.com
Lead Site Reliability Engineer

Lead Site Reliability Engineer

SwiftKuala Lumpur, Kuala Lumpur, Malaysia
1 day ago
Job description

About Us

We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value – across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we’re proud to support the global economy.

We’re unique too. We were established to find a better way for the global financial community to move value – a reliable, safe and secure approach that the community can trust, completely. We’re always striving to be better and are constantly evolving in an ever‑changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions.

Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia

What to Expect

  • Work through all phases of the system administration life cycle, including capacity planning, architecture design, compliance, deployment & configuration, monitoring, and incident management.
  • Develop automation scripts, infrastructure as code, and tooling using industry best practices to improve system reliability, reduce manual effort, and enable self‑service.
  • Review system architectures design, deployment strategies, observability setups, and operational documentation to ensure reliability and operational excellence.
  • Analyze production issues, identify root causes, and implement long‑term reliability improvements through automation, monitoring, and architectural enhancements.
  • Work collaboratively with other team members and provide guidance to more junior team members.
  • Organize an efficient handover through high quality documentation and training.
  • Automate the deployment and operation of multi‑tenant infrastructure, handling tasks that ensure system resilience and availability.
  • Develop and maintain monitoring tools, dashboards, and self‑healing mechanisms.
  • Participate in on‑call rotations, conduct blameless postmortems, and drive continuous learning.
  • Work closely with developers, product teams, and engineering stakeholders to troubleshoot issues, improve systems, and integrate reliability improvements.
  • Capable of providing accurate project estimates and strategically adapting plans throughout the project lifecycle.

Qualifications

  • Minimum 10 years of system administration experience in an (preferably) international setting.
  • Minimum 2 years of experience leading projects.
  • Familiarity or experience with data ingestion with big data technologies (Elastic Search, Logstash, Kibana and Kafka).
  • Experience with CI / CD development & deployment tools such as Maven, Jenkins, Nexus, Git, and Docker.
  • Proficiency in Linux OS.
  • Proficiency in scripting and automation (e.g. Python, PowerShell, YAML) with the ability to develop tools and infrastructure as code (preferably Ansible, Terraform, Kubernetes, OpenShift).
  • Understanding of distributed systems and microservices architectures, including REST and SOAP APIs.
  • Hands‑on experience with ITIL processes, including Incident, Problem, and Continual Improvement, is an advantage.
  • Experience working within an Agile‑driven environment.
  • Practical experience in building metrics for data‑driven reporting.
  • Strong interpersonal skills with a customer‑centric mindset and ability to work effectively across diverse cultures.
  • Proven ability to collaborate with both local and remote teams across different time zones.
  • Familiarity with or experience in managing VM hosts using vCenter is an advantage.
  • What We Offer

  • We put you in control of career
  • We give you a competitive package
  • We help you perform at your best
  • We help you make a difference
  • We give you the freedom to be yourself
  • We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyone’s voice counts and where you can reach your full potential.

    If you believe you require a reasonable accommodation to participate in the job application or interview process, please contact us to request accommodation.

    Don’t meet every single requirement? At Swift, we are dedicated to building a workplace where people can bring their full selves and ideas to the team, so if you are excited about this role, we encourage you to apply even if you do not meet every single qualification.

    Seniority Level

  • Mid‑Senior level
  • Employment Type

  • Full‑time
  • Job Function

  • Engineering and Information Technology
  • #J-18808-Ljbffr

    Create a job alert for this search

    Site Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Guidewire SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
    At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that’s a time of crisis, a n...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SWIFTKuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AmpstekKuala Lumpur, Kuala Lumpur, Malaysia
    Ampstek Federal Territory of Kuala Lumpur, Malaysia.We are looking for a skilled Site Reliability Engineer (SRE) to join our technology operations team. The ideal candidate will be responsible for b...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GX Bank BerhadPetaling Jaya, Selangor, Malaysia
    Site Reliability Engineer page is loaded.Apply locations Petaling Jaya (First Avenue) time type Full time posted on Posted 9 Days Ago job requisition id R-. GX Bank Berhad - the Grab-led Digital Ban...Show moreLast updated: 30+ days ago
    • Promoted
    Azure Site Reliability Engineer

    Azure Site Reliability Engineer

    ExperianCyberjaya, Selangor, Malaysia
    We are seeking a skilled Azure Cloud DevOps Engineer to join our team.The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Az...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Razer Inc.Kuala LumpurMalaysia, Kuala Lumpur, Malaysia
    Bangsar South, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Bangsar South, Federal Territory of Kuala Lumpur, Malaysia. Get AI-powered advice on this job and mor...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Russell TobinKuala Lumpur, Kuala Lumpur, Malaysia
    Job Opportunity : Site Reliability Engineer (SRE) in Cyberjaya.Note : Only Malaysian locals or PR holders can apply.We are looking for a Site Reliability Engineer (SRE) to join our forward-thinking C...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer (DevOps)

    Site Reliability Engineer (DevOps)

    Ant InternationalKuala Lumpur, Kuala Lumpur, Malaysia
    Direct message the job poster from Ant International.Recruiter @ Ant International | Talent Acquisition Specialist.With headquarters in Singapore and main operations across Asia, Europe, the Middle...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalPort Klang, Port Klang, Malaysia
    Site Reliability Engineer role at Canonical.We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To succeed in this role, you need to ...Show moreLast updated: 30+ days ago
    • Promoted
    Specialist, Site Reliability Engineer (SRE)

    Specialist, Site Reliability Engineer (SRE)

    TNG DigitalKuala Lumpur, Kuala Lumpur, Malaysia
    Specialist, Site Reliability Engineer (SRE).We are hiring for a Specialist, Site Reliability Engineer (SRE) to join our team. Role focuses on network administration, cloud infrastructure management,...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GXBankPetaling Jaya, Selangor, Malaysia
    Be among the first 25 applicants.GX Bank Berhad - the Grab-led Digital Bank - is the FIRST digital bank in Malaysia, approved by BNM to commence operations. We aim to leverage technology and innovat...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesKuala Lumpur, Kuala Lumpur, Malaysia
    Talent Acquisition | Human Resource Executive | Tata Consultancy Service.Join Tata Consultancy Services, Asia Pacific and be part of an organization committed to sustainable development for our fut...Show moreLast updated: 21 days ago
    • Promoted
    Azure Site Reliability Engineer

    Azure Site Reliability Engineer

    Experian Asia PacificCyberjaya, Selangor, Malaysia
    We are seeking a skilled Azure Cloud DevOps Engineer to join our team.The ideal candidate will have a strong background in DevOps practices, cloud solutions, and network engineering in Microsoft Az...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RazerKuala Lumpur, Kuala Lumpur, Malaysia
    Joining Razer will place you on a global mission to revolutionize the way the world games.LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.We ar...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte LtdKuala Lumpur, Kuala Lumpur, Malaysia
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Smart Teq Solution Sdn BhdKuala Lumpur, Kuala Lumpur, Malaysia
    Ensure all our infrastructure are running at optimal condition.Provide deployment, patches and update on all services that running on public cloud and on premise. Identify and resolve support ticket...Show moreLast updated: 30+ days ago
    • Promoted
    Hunyuan LLM Site Reliability Engineer

    Hunyuan LLM Site Reliability Engineer

    TencentKuala Lumpur, Kuala Lumpur, Malaysia
    Tencent Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Join or sign in to find your next job.Hunyuan LLM Site Reliability Engineer. Tencent Kuala Lumpur, Federal Territory of Kuala Lumpur...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Swift SoftwareKuala Lumpur, Kuala Lumpur, Malaysia
    Lead Site Reliability Engineer page is loaded## Lead Site Reliability Engineerlocations : Kuala Lumpur, Malaysiatime type : Full timeposted on : Posted Todayjob requisition id : We’re the worl...Show moreLast updated: 30+ days ago