Tawaran kerja ini tidak tersedia di negara anda.

Senior DevOps / Site Reliability Engineer

MoneyLionKuala Lumpur, Malaysia

17 hari lalu

Penerangan pekerjaan

Responsibilities

Provide or develop the tooling that will allow the individual Product Teams to be autonomous, via shared Kubernetes platform, Codefresh CI / CD and self-services infra resources via Atlantis / Terraform
Participate in a 24 / 7 on-call rotation that supports our production Kubernetes platform running in AWS
Work to constantly improve our resiliency by developing self-healing, self-assembling infrastructure; proactively running load tests and Chaos Engineering experiments
Dive into problems with an eye to both immediate remediation as well as the follow-through changes and automation that will prevent future occurrences
Maintain day-to-day vigilance with regards to security while helping to enhance the intrinsic security of the overall production system.
Own and ensure that internal and external SLA's meet and exceed expectations, System centric KPIs are continuously monitored and improved
Provide consultation and support for Product Teams in achieving their OKRs : Availability and Service Excellence
Handle day-to-day duties : on-boarding, off-boarding, manage resource access permissions and maintain the shared tooling like CI / CD, inc. artifact repositories
Review architecture across teams; ensuring best practices are propagatedpany wide

About You :

Exposure to cloud IaaS (AWS, GCP or other relevant)

Linux administration (CoreOS, or any Linux in general)

Linux containers, orchestration (Docker, Kubernetes), and Immutable infrastructure

Familiarity with Infrastructure-as-Code principals and technologies like Terraform or CloudFormation

Ability to learn quickly, think critically and make snap judgements based on measured data in high pressure situations

Strongmunicator and have the ability to guide teams to troubleshoot and tune production performance issues

Working knowledge of industry best practices with regards to information security

Bonus Points :

Have prior experience working in high performance and highly available distributed systems

Are able to knowledgeably implement performance, and security inplex multi-teams scenarios

Are familiar with microservices architectures and able to understand the trade-offs

Have practical knowledge of event streaming and experience in designing systems to leverage SQS, Kafka, Kinesis correctly

Have good knowledge about Hashicorp stack; especially Vault

What's Next...

After you submit your application, you can expect to prepare for the following steps in the recruitment process :

Interview - Talent Acquisition Team (Virtual or face-to-face)

Online Technical Test

Hiring Manager interview -(Virtual or face-to-face)

What We Value

We value growth-minded and collaborative people with high learning agility who embody our core values of teamwork, customer-first and innovation . Every member of the MoneyLion Team is passionate about fintech and ready to give 100% in helping us achieve our mission.

Working At MoneyLion

At MoneyLion, we want you to be well and thrive. Our generous benefits package includes :

Wellness perks

Paid parental leave

Generous Paid Time Off

Learning and Development resources

Flexible working hours

MoneyLion ismitted to equal employment opportunities for all employees. Inside ourpany, every decision we make regarding our employees is based on merit,petence, and performance,pletely free of discrimination. We aremitted to building a team that represents a variety of backgrounds, perspectives, and skills. Within that team, no one will feel more "other" than anyone else. We realize the full promise of diversity and want you to bring your whole self to work every single day. Job ID 4334609004

Buat amaran kerja untuk carian ini

Reliability Engineer • Kuala Lumpur, Malaysia