Overview
We are Lenovo. We do what we say. We own what we do. We WOW our customers. Lenovo is a global technology company with a diverse portfolio and a focus on delivering Smarter Technology for All. We are seeking a skilled
Golang
Engineer to design and implement comprehensive
monitoring
and
observability
solutions for our cloud infrastructure. This role is responsible for building scalable monitoring systems that provide real-time visibility into system health and performance across Linux, OpenStack, and Kubernetes environments. Note :
This job description reflects the responsibilities and qualifications for the role and may be subject to change. Responsibilities
Design and develop core components of
Kubernetes -based container platforms using
Golang , focusing on control plane extensions, operators, and cloud-native service meshes. Implement and optimize
Kubernetes networking
(CNI plugins like Calico / Cilium) and
storage solutions
(CSI drivers, Rook / Ceph integration), addressing multi-tenant isolation and high-throughput data paths. Troubleshoot deep-level Kubernetes issues (e.g., etcd corruption, kube-scheduler deadlocks, CNI policy conflicts) using Golang debugging tools (pprof, delve) and log analysis. Build automation frameworks for cluster lifecycle management, security hardening, and observability using
Golang
(primary) and
Python
(secondary for scripting). Collaborate with infrastructure teams to align platform capabilities with AI workload requirements, optimizing resource scheduling for GPU / accelerator workloads. Qualifications
Technical Expertise : Mastery of
Golang
with 3+ years building production-grade systems using Goroutines, interfaces, and standard library (e.g., net / http, k8s.io / client-go). Kubernetes Internals : Deep understanding of control plane components (API server, scheduler, controller manager) and ability to extend via CRDs / Operators. Network / Storage Proficiency : Hands-on experience selecting and implementing CNI (VXLAN / BGP modes) and CSI solutions (RBD, iSCSI), with performance benchmarking skills. Linux / Container Expertise : Proficient in cgroups, namespaces, and container runtimes (containerd, CRI-O) for debugging resource leaks or security flaws. Experience
3+ years developing cloud infrastructure with
Golang
as primary language, including at least one major Kubernetes platform project (e.g., cluster autoscaler, custom scheduler). Demonstrated ability to resolve critical production issues (e.g., etcd leader election failures, network policy drops) in large-scale clusters (1k nodes). Soft Skills
Rigorous analytical approach to system design and failure root-cause analysis. Ability to document complex technical concepts for cross-team alignment. Preferred Add-ons
Kubernetes SIG contributions (e.g., networking, storage, or scheduling working groups). Experience with eBPF-based tools (Cilium, Pixie) for advanced network observability. Proficiency in Python for infrastructure scripting (Ansible / Terraform integrations) or Java for enterprise service interoperability. Familiarity with service meshes (Linkerd, Istio) and GitOps pipelines (Argo CD, Flux). Knowledge of cloud-native security (OPA / Gatekeeper, Kyverno) and AI / ML workload optimization. #NOVA #NovaCloud EEO Statement :
We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, national origin, veteran status, disability or any federal, state, or local protected class. WD
Developer • Petaling Jaya, Malaysia