Talent.com
Tawaran kerja ini tidak tersedia di negara anda.
Senior DevOps / Site Reliability Engineer

Senior DevOps / Site Reliability Engineer

MoneyLionKuala Lumpur, Malaysia
17 hari lalu
Penerangan pekerjaan

Responsibilities

  • Provide or develop the tooling that will allow the individual Product Teams to be autonomous, via shared Kubernetes platform, Codefresh CI / CD and self-services infra resources via Atlantis / Terraform
  • Participate in a 24 / 7 on-call rotation that supports our production Kubernetes platform running in AWS
  • Work to constantly improve our resiliency by developing self-healing, self-assembling infrastructure; proactively running load tests and Chaos Engineering experiments
  • Dive into problems with an eye to both immediate remediation as well as the follow-through changes and automation that will prevent future occurrences
  • Maintain day-to-day vigilance with regards to security while helping to enhance the intrinsic security of the overall production system.
  • Own and ensure that internal and external SLA's meet and exceed expectations, System centric KPIs are continuously monitored and improved
  • Provide consultation and support for Product Teams in achieving their OKRs : Availability and Service Excellence
  • Handle day-to-day duties : on-boarding, off-boarding, manage resource access permissions and maintain the shared tooling like CI / CD, inc. artifact repositories
  • Review architecture across teams; ensuring best practices are propagatedpany wide

About You :

  • Exposure to cloud IaaS (AWS, GCP or other relevant)
  • Linux administration (CoreOS, or any Linux in general)
  • Linux containers, orchestration (Docker, Kubernetes), and Immutable infrastructure
  • Familiarity with Infrastructure-as-Code principals and technologies like Terraform or CloudFormation
  • Ability to learn quickly, think critically and make snap judgements based on measured data in high pressure situations
  • Strongmunicator and have the ability to guide teams to troubleshoot and tune production performance issues
  • Working knowledge of industry best practices with regards to information security
  • Bonus Points :

  • Have prior experience working in high performance and highly available distributed systems
  • Are able to knowledgeably implement performance, and security inplex multi-teams scenarios
  • Are familiar with microservices architectures and able to understand the trade-offs
  • Have practical knowledge of event streaming and experience in designing systems to leverage SQS, Kafka, Kinesis correctly
  • Have good knowledge about Hashicorp stack; especially Vault
  • What's Next...

    After you submit your application, you can expect to prepare for the following steps in the recruitment process :

  • Interview - Talent Acquisition Team (Virtual or face-to-face)
  • Online Technical Test
  • Hiring Manager interview -(Virtual or face-to-face)
  • What We Value

    We value growth-minded and collaborative people with high learning agility who embody our core values of teamwork, customer-first and innovation . Every member of the MoneyLion Team is passionate about fintech and ready to give 100% in helping us achieve our mission.

    Working At MoneyLion

    At MoneyLion, we want you to be well and thrive. Our generous benefits package includes :

  • Wellness perks
  • Paid parental leave
  • Generous Paid Time Off
  • Learning and Development resources
  • Flexible working hours
  • MoneyLion ismitted to equal employment opportunities for all employees. Inside ourpany, every decision we make regarding our employees is based on merit,petence, and performance,pletely free of discrimination. We aremitted to building a team that represents a variety of backgrounds, perspectives, and skills. Within that team, no one will feel more "other" than anyone else. We realize the full promise of diversity and want you to bring your whole self to work every single day. Job ID 4334609004

    Buat amaran kerja untuk carian ini

    Reliability Engineer • Kuala Lumpur, Malaysia