Talent.com
Lead Site Reliability Engineer
Lead Site Reliability EngineerSwift • Kuala Lumpur, Kuala Lumpur, Malaysia
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Swift • Kuala Lumpur, Kuala Lumpur, Malaysia
30+ days ago
Job description

About Us

We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value – across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we’re proud to support the global economy.

We’re unique too. We were established to find a better way for the global financial community to move value – a reliable, safe and secure approach that the community can trust, completely. We’re always striving to be better and are constantly evolving in an ever‑changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions.

Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia

What to Expect

  • Work through all phases of the system administration life cycle, including capacity planning, architecture design, compliance, deployment & configuration, monitoring, and incident management.
  • Develop automation scripts, infrastructure as code, and tooling using industry best practices to improve system reliability, reduce manual effort, and enable self‑service.
  • Review system architectures design, deployment strategies, observability setups, and operational documentation to ensure reliability and operational excellence.
  • Analyze production issues, identify root causes, and implement long‑term reliability improvements through automation, monitoring, and architectural enhancements.
  • Work collaboratively with other team members and provide guidance to more junior team members.
  • Organize an efficient handover through high quality documentation and training.
  • Automate the deployment and operation of multi‑tenant infrastructure, handling tasks that ensure system resilience and availability.
  • Develop and maintain monitoring tools, dashboards, and self‑healing mechanisms.
  • Participate in on‑call rotations, conduct blameless postmortems, and drive continuous learning.
  • Work closely with developers, product teams, and engineering stakeholders to troubleshoot issues, improve systems, and integrate reliability improvements.
  • Capable of providing accurate project estimates and strategically adapting plans throughout the project lifecycle.

Qualifications

  • Minimum 10 years of system administration experience in an (preferably) international setting.
  • Minimum 2 years of experience leading projects.
  • Familiarity or experience with data ingestion with big data technologies (Elastic Search, Logstash, Kibana and Kafka).
  • Experience with CI / CD development & deployment tools such as Maven, Jenkins, Nexus, Git, and Docker.
  • Proficiency in Linux OS.
  • Proficiency in scripting and automation (e.g. Python, PowerShell, YAML) with the ability to develop tools and infrastructure as code (preferably Ansible, Terraform, Kubernetes, OpenShift).
  • Understanding of distributed systems and microservices architectures, including REST and SOAP APIs.
  • Hands‑on experience with ITIL processes, including Incident, Problem, and Continual Improvement, is an advantage.
  • Experience working within an Agile‑driven environment.
  • Practical experience in building metrics for data‑driven reporting.
  • Strong interpersonal skills with a customer‑centric mindset and ability to work effectively across diverse cultures.
  • Proven ability to collaborate with both local and remote teams across different time zones.
  • Familiarity with or experience in managing VM hosts using vCenter is an advantage.
  • What We Offer

  • We put you in control of career
  • We give you a competitive package
  • We help you perform at your best
  • We help you make a difference
  • We give you the freedom to be yourself
  • We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyone’s voice counts and where you can reach your full potential.

    If you believe you require a reasonable accommodation to participate in the job application or interview process, please contact us to request accommodation.

    Don’t meet every single requirement? At Swift, we are dedicated to building a workplace where people can bring their full selves and ideas to the team, so if you are excited about this role, we encourage you to apply even if you do not meet every single qualification.

    Seniority Level

  • Mid‑Senior level
  • Employment Type

  • Full‑time
  • Job Function

  • Engineering and Information Technology
  • #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    Senior site reliability engineer

    Senior site reliability engineer

    bp • Kuala Lumpur, Kuala Lumpur, Malaysia
    Senior Site Reliability Engineer – bp, Kuala Lumpur, Malaysia.We are looking for an experienced.Senior Site Reliability Engineer. As a core member of the engineering team, you will ensure our platfo...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Guidewire Software • Kuala Lumpur, Kuala Lumpur, Malaysia
    At Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that’s a time of crisis, a n...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SWIFT • Kuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    bp • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Your mission is to ensure that systems are highly available, scalable, secure, and efficient. You’ll work closely with engineering teams to ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Swift Software • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer page is loaded## Site Reliability Engineerlocations : Kuala Lumpur, Malaysiaposted on : Posted Todayjob requisition id : We’re the world’s leading provider of secure f...Show more
    Last updated: 25 days ago • Promoted
    Site Reliability Engineers (Middle & Senior)

    Site Reliability Engineers (Middle & Senior)

    PEOPLE PROFILERS • Kuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Site Reliability Engineers (Middle & Senior).People Profilers is hiring on behalf of. Site Reliability Engineers (Consultant & Senior Consul...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Razer Inc. • Kuala Lumpur, Kuala Lumpur, Malaysia
    Bangsar South, Federal Territory of Kuala Lumpur, Malaysia.Senior Site Reliability Engineer role at Razer Inc.The SRE will join the infrastructure and platform engineering team, with hands-on exper...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    BP PLC • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer page is loaded## Site Reliability Engineerremote type : This position is a hybrid of office / remote workinglocations : Malaysia - Kuala Lumpurtime type : Full timeposted...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SWIFT • Kuala Lumpur, Kuala Lumpur, Malaysia
    We’re the world’s leading provider of secure financial messaging services, headquartered in Belgium.We are the way the world moves value – across borders, through cities and overseas.No other organ...Show more
    Last updated: 24 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Group • Kuala Lumpur, Federal Territory of Kuala Lumpur, MY
    Quick Apply
    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and op...Show more
    Last updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Swift • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (DevOps) – Swift.Location : Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Senior Level : Mid‑senior level. Job Function : Engineering and Information Technology.Sw...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS) • Kuala Lumpur, Kuala Lumpur, Malaysia
    Site Reliability Engineer (SRE).We are seeking a skilled Site Reliability Engineer (SRE) to join our Cloud Engineering team in Cyberjaya. You will be responsible for ensuring the availability, perfo...Show more
    Last updated: 18 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Canonical • Subang Jaya, Subang Jaya, Malaysia
    Canonical Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Kuala Lumpur, Kuala Lumpur, Malaysia
    Talent Acquisition | Human Resource Executive | Tata Consultancy Service.Join Tata Consultancy Services, Asia Pacific and be part of an organization committed to sustainable development for our fut...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TIME dotCom Berhad • Shah Alam, Selangor, Malaysia
    We are looking for a skilled and passionate Senior or Lead, Site Reliability Engineer to join our dynamic retail IT team at TIME. In this critical role, you will be instrumental in integrating secur...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Smart Teq Solution Sdn Bhd • Kuala Lumpur, Kuala Lumpur, Malaysia
    Ensure all our infrastructure are running at optimal condition.Provide deployment, patches and update on all services that running on public cloud and on premise. Identify and resolve support ticket...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Swift Software • Kuala Lumpur, Kuala Lumpur, Malaysia
    Lead Site Reliability Engineer page is loaded## Lead Site Reliability Engineerlocations : Kuala Lumpur, Malaysiatime type : Full timeposted on : Posted Todayjob requisition id : We’re the worl...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Razer Inc. • Kuala Lumpur, Kuala Lumpur, Malaysia
    Bangsar South, Federal Territory of Kuala Lumpur, Malaysia.Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you ...Show more
    Last updated: 27 days ago • Promoted