What is your mission?
The Linux System Administrator team works with various internal teams to assist with deployment procedures, deploying code and configuration changes. The team also provides Tier‑1 support for the various mission critical applications and services. In the near future, the team will also be taking over monitoring aspects of various systems and services from other teams as well.
Core Responsibilities
- Collaborate with internal teams to deploy in‑house developed software and configuration changes to Linux / Unix servers, ensuring seamless completion with no service disruption.
- Troubleshooting server issues related to Linux / Unix.
- Provide operational support across a suite of production systems and services running on the cloud.
- Proactively monitor the system logs, system monitoring and performance of Linux / Unix systems and services, responding promptly to incidents and alerts, and keeping stakeholders informed of resolutions.
- Collaborate with DevOps, development, and support teams to deploy and manage applications.
- Participate in a 24 / 7 on‑call rotation, with flexibility for night shifts, weekends, variable schedules, and necessary overtime.
- Provide Tier‑1 Linux / Unix support for mission‑critical production services.
- Create and maintain documentation for daily processes, enhancing team collaboration, learning, and training.
- Manage and troubleshoot network services (DNS, DHCP, NFS, FTP, SSH, etc.)
Who are we looking for?
Bachelor’s degree in computer science or a related field.Experience in server and application support.Proficiency in Linux / Unix administration.Familiarity with ticketing and documentation tools like Jira and Confluence is preferred.Strong analytical skills and excellent written and verbal communication abilities.Proven ability to multitask, prioritize effectively, and perform well under pressure in a fast‑paced environment.Experience in configuring, deploying, and maintaining infrastructure and production services using automation tools to support scalable and resilient production systems.Experience supporting Kubernetes environments.Knowledge of Docker / container technologies and microservices architecture.Hands‑on experience with cloud technologies and supporting cloud‑based services.Demonstrated ability to work effectively within a team.Nice to have
Familiarity with incident review processes, to understand root cause of issues and follow‑up to ensure proper fixes are deployed.Monitoring experience with Grafana and other monitoring solutions.Experience with automation to automate day‑to‑day tasks.Experience working in a micro‑services environment.Comfortable partnering with cross‑functional teams to understand servers / services setup, and work with them to come up with monitoring solutions based on best practice.#J-18808-Ljbffr