Responsibilities :
Documentation & Inventory Management
- Maintain accurate and up-to-date documentation of server racks, servers, and networking equipment across data centers.
- Provide detailed documentation to support server rack expansion, dismantling, or relocation.
Project & Process Coordination
Coordinate between logistics and technical teams to ensure alignment on server rack deployment timelines and status.Facilitate smooth handovers and updates during infrastructure changes or new deployments.Act as a communication bridge between technical teams, ensuring information flows clearly and accurately, while also working closely with global teams to maintain alignment and collaboration.GPU Cluster Management
Manage and monitor clusters of GPU servers hosted on the company website.Perform basic troubleshooting on Ubuntu systems to ensure GPU servers remain operational.Verify and ensure that all hosted servers are legitimate and compliant with company policies.Operational Excellence
Ensure infrastructure-related processes are executed efficiently, with minimal downtime.Act as the primary point of contact for documentation, equipment details, and operational readiness.Requirements :
Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience).Proven experience in data center operations, infrastructure delivery, or IT service management.Familiarity with server rack deployment, networking equipment, and documentation best practices.Basic troubleshooting skills with Ubuntu / Linux systems.Experience managing GPU server clusters (preferred but not mandatory).Strong bilingual communication skills in English and Chinese (both written and verbal), with the ability to collaborate effectively with cross-border teams.Strong coordination and communication skills to bridge technical and logistics functions.Able to manage and lead a team effectively.Highly organized with strong attention to detail.