Cloud Deployment Engineer (NOC)
Overview :
We are seeking a skilled NOC Engineer with a strong focus on Linux system administration and application support. This role involves troubleshooting a range of issues, including database performance, network connectivity, and deployment failures. The ideal candidate will have hands-on experience with compute platforms such as Kubernetes and virtual machines, along with a solid understanding of various storage solutions. We are looking for high-performance engineers who are curious and capable of solving real-world problems.
Key Responsibilities :
- Monitor and maintain system performance to ensure the stability and reliability of applications and infrastructure.
- Troubleshoot and resolve issues related to database performance, network connectivity, and deployment failures, including diagnosing problems at the underlying platform level (e.g., Kubernetes, virtual machines).
- Ensure that issues are resolved within the stipulated Service Level Agreements (SLAs), maintaining high standards of service delivery.
- Identify and address performance bottlenecks in applications and infrastructure.
- Conduct root cause analysis for recurring incidents to develop long-term solutions.
- Improve monitoring solutions to proactively identify and mitigate issues before they impact services.
- Assist in the deployment and configuration of new applications and services, ensuring adherence to best practices.
- Develop and maintain scripts for automation of routine tasks and monitoring processes.
- Participate in on-call rotations and respond to critical incidents as they arise.
- Analyze system logs and metrics to identify trends and potential areas for improvement.
- Assist in capacity planning and performance tuning to ensure optimal resource utilization.
Qualifications :
Strong expertise in Linux system administration.Proven experience in troubleshooting application support issues with a focus on performance and connectivity.Experience in Bash / Shell scripting or automation for system administration tasks.Solid understanding of database management and performance tuning.Hands-on experience with Kubernetes and virtual machines.Ability to diagnose and resolve complex technical issues across compute, storage, network, and database components.Strong analytical skills and intellectual curiosity; able to question existing processes and understand their implications.Self-motivated learner who can operate autonomously with minimal guidance.Excellent problem-solving abilities and a proactive approach to identifying and addressing challenges.Open to a rotational shift schedule across different time slots, with reasonable schedules shared in advance.Able to communicate effectively in Mandarin would be an added advantage.Preferred Skills :
Familiarity with monitoring tools and performance optimization techniques.Knowledge of networking concepts and troubleshooting methodologies.Hands-on knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud) and their services.Familiarity with DevOps practices and frameworks, including CI / CD, infrastructure as code, and containerization.Familiarity with Big Data lifecycle (Big Data management / ingestion / processing / visualization) and the corresponding technologies (e.g., HDFS, YARN, Kafka, Spark, Flink, Hive, ELK)