Talent.com
Cloud Data AI Ops Engineer (Tier 2)

Cloud Data AI Ops Engineer (Tier 2)

CloudMileKuala Lumpur, Kuala Lumpur, Malaysia
30+ days ago
Job description

CloudMile Federal Territory of Kuala Lumpur, Malaysia

CloudMile Federal Territory of Kuala Lumpur, Malaysia

Overview

CloudMile, a leading AI and Cloud technology company in Asia that focuses on digital transformation and growth for its corporate clients. We are the winner of the 2023 Google Cloud Sales Partner of the Year for the Greater China region, recognized for its innovative thinking, outstanding customer service, and best-in-class use of Google Cloud products and services. As a member of the “CloudMiler” team, you will be at the forefront of assisting companies in Asia in their digital transformation by leveraging cloud technology, data, and AI. We value collaboration and shared goals over the notion of a lone “superstar.”

As a Tier 2 Cloud, Data & AI Operations Engineer , you will be the second line of defense for our customers, responsible for resolving complex technical issues escalated from our Tier 1 team. You will proactively manage key customer environments, acting as an extension of their internal operations team. This role requires a deep understanding of cloud infrastructure, with an added focus on data pipelines and AI workloads. If you are passionate about solving complex problems, proposing innovative solutions, and building strong relationships with clients, we are interested in talking to you!

“CloudMiler” is a group of smart people, passionate about cloud computing, data, and AI. We believe that world-class support is critical to customer success.

Key Job Responsibilities

  • Handle Escalations : Serve as the primary escalation point for complex technical issues that are beyond the capabilities of the Tier 1 team.
  • Proactive Management : Proactively monitor, manage, and optimize the cloud environments for a portfolio of managed service customers.
  • Improvement Proposals : Continuously identify opportunities to enhance customer environments by proposing improvements related to cost optimization, security hardening, and performance tuning.
  • Customer Stakeholder Management : Act as a key technical contact for customer stakeholders, participating in regular reviews to discuss operational performance, upcoming changes, and new initiatives.
  • Troubleshooting : Apply advanced troubleshooting techniques to diagnose and resolve issues across cloud infrastructure, networking, security, and especially data and AI services.
  • Data & AI Operations : Provide operational support for data pipelines, ETL / ELT jobs, machine learning model deployments, and AI APIs, ensuring their stability and performance.
  • Automation : Develop and maintain scripts and automation to streamline operations, reduce manual tasks, and improve overall efficiency.
  • Mentorship : Coach and mentor Tier 1 CloudOps Engineers, sharing knowledge and providing guidance on complex issues.
  • Documentation : Create and maintain detailed technical documentation, including runbooks, standard operating procedures, and knowledge base articles.

A Day in the Life

  • Deep-dive into a complex networking issue escalated from Tier 1, collaborating with both the customer and the cloud provider to find a solution.
  • Proactively review a customer's environment, identifying a few idle resources that could be shut down to reduce costs and drafting a proposal to present to the customer.
  • Attend a virtual meeting with a managed service customer to provide an update on their operational health and discuss a planned change to their data pipeline.
  • Investigate an alert on a failed machine learning model deployment, troubleshooting the underlying issue and working with the data science team to get it back online.
  • Write a Python script to automate a common data transfer task, then add the script to the team's shared repository.
  • Work with leadership to define and implement new processes to improve the efficiency of the operations team.
  • We promote advancement opportunities horizontally and vertically across the organization to help you meet your career goals. We offer programs to help you acquire certification and develop the skills required to be successful in your role.

    Basic Qualifications

  • Bachelor's degree in computer science, information technology, or a related field.
  • At least 4 years of hands-on experience with any one of the major CSPs (Google Cloud, AWS, Azure, Alibaba Cloud).
  • Professional-level certification in at least one of the major CSPs (e.g., Google Cloud Professional Cloud Architect / DevOps Engineer, AWS Professional, Azure Professional).
  • Strong understanding of core cloud computing concepts, including networking, security, compute, and storage.
  • Proven ability to troubleshoot and resolve complex technical issues independently.
  • Hands-on experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
  • Foundational knowledge of data and AI concepts, including data pipelines, ETL / ELT processes, and machine learning model deployment.
  • Proficiency in one or more scripting languages (e.g., Python, Go, Bash).
  • Exceptional communication skills and proven customer-facing experience, with the ability to manage technical stakeholders.
  • Excellent written and verbal communication skills in English. Multilingual ability (English, Mandarin, Malay / Indonesian, other Southeast Asian languages) is an added advantage.
  • Preferred Qualifications

  • Experience in a managed services or consulting role.
  • Proven experience with a variety of data services (e.g., BigQuery, Dataproc, Vertex AI, SageMaker, EMR).
  • Experience with logging and monitoring platforms (e.g., Grafana, Cloud Logging, Datadog).
  • Experience with CI / CD tools and concepts.
  • Knowledge or experience of Site Reliability Engineering (SRE) principles.
  • Seniority level

  • Mid-Senior level
  • Employment type

  • Full-time
  • Job function

  • Information Technology
  • Industries

  • IT Services and IT Consulting
  • Referrals increase your chances of interviewing at CloudMile by 2x

    #J-18808-Ljbffr

    Create a job alert for this search

    Cloud Engineer • Kuala Lumpur, Kuala Lumpur, Malaysia

    Related jobs
    • Promoted
    AWS Cloud Architect

    AWS Cloud Architect

    Hilti GroupPetaling Jaya, Selangor, Malaysia
    AWS Cloud Architect at Hilti Group is responsible for designing, delivering and supporting scalable, secure and efficient cloud solutions. In collaboration with Global IT and other teams, you shape ...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    ICM Services Sdn BhdKuala Lumpur, Kuala Lumpur, Malaysia
    Deploy and manage highly scalable, fault-tolerant cloud infrastructure on AWS.Implement event-driven architectures using EventBridge, and other AWS tools to enable real-time and asynchronous commun...Show moreLast updated: 4 days ago
    • Promoted
    Cloud Engineer (AWS)

    Cloud Engineer (AWS)

    HytechKuala Lumpur, Kuala Lumpur, Malaysia
    Hytech Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Hytech is a leading management consulting firm headquartered in Australia and Singapore, specializing in digital transformation for ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer – BI, Analytics & Machine Learning

    Data Engineer – BI, Analytics & Machine Learning

    EvoloperSubang Jaya, Selangor, Malaysia
    Build, test, and optimize ETL / ELT pipelines for data integrity.Develop and enhance BI dashboards for actionable insights across departments. Collaborate with cross-functional teams to identify and a...Show moreLast updated: 16 days ago
    • Promoted
    Data Engineer (Gen AI )

    Data Engineer (Gen AI )

    MaybankKuala Lumpur, Kuala Lumpur, Malaysia
    Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.We are seeking a highly motivated and experienced Data Engineer to join our AI CoE team. The ideal candidate will have strong skills in data...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Engineer (Azure / AWS)

    Cloud Engineer (Azure / AWS)

    Accenture Southeast AsiaPetaling Jaya, Selangor, Malaysia
    Design, implement, and manage cloud-based solutions on Azure and AWS.Monitor and optimize cloud resources for performance and cost efficiency. Support troubleshooting and resolution of cloud-related...Show moreLast updated: 20 days ago
    • Promoted
    Cloud Engineer (Gen AI)

    Cloud Engineer (Gen AI)

    MaybankKuala Lumpur, Kuala Lumpur, Malaysia
    Design, implement, and manage cloud-based infrastructures to support scalable and secure applications, with a focus on deployments involving AI technologies. Develop and maintain scalable, secure cl...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Engineer (AWS)

    Cloud Engineer (AWS)

    Hytech Consulting ManagementKuala Lumpur, Kuala Lumpur, Malaysia
    We specialize in providing digital transformation for fintech and financial services companies.We work exclusively with globally recognized top-tier financial services companies, and we offer a com...Show moreLast updated: 2 days ago
    • Promoted
    Azure Engineer (Cloud Platform) Kuala Lumpur •

    Azure Engineer (Cloud Platform) Kuala Lumpur •

    K3 Capital GroupKuala Lumpur, Kuala Lumpur, Malaysia
    Role Purpose : Build and run secure, scalable Azure platforms and services used by multiple K3 brands.Drive reliability, performance, and cost efficiency across the group's technology ecosystem.Desi...Show moreLast updated: 30+ days ago
    • Promoted
    Azure Data Engineer

    Azure Data Engineer

    AvanadeKuala Lumpur, Kuala Lumpur, Malaysia
    Azure Data Engineer role at Avanade in Kuala Lumpur, Malaysia.Focus on designing and building modern data pipelines, data streams, and reporting tools to enable insightful decision-making for Avana...Show moreLast updated: 30+ days ago
    • Promoted
    Azure Engineer (Cloud Platform)

    Azure Engineer (Cloud Platform)

    QuantumaKuala Lumpur, Kuala Lumpur, Malaysia
    Quantuma Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia.Azure Engineer (Cloud Platform).Role Purpose : Build and run secure, scalable Azure platforms and services used by multiple K3 bran...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist : AWS / Python / Redshift

    Data Scientist : AWS / Python / Redshift

    Capcon AsiaKuala Lumpur, Kuala Lumpur, Malaysia
    About the job Data Scientist : AWS / Python / Redshift.Spearheading the digital transformation initiative for one of KLs biggest insurance companies / insure-tech. Customizing the existing health tec...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer (AI & ML)

    Data Engineer (AI & ML)

    Property Register Pte LtdPetaling Jaya, Selangor, Malaysia
    Artificial Intelligence & Machine Learning).We are seeking a Data Engineer to support our product, sales, and leadership teams by providing insights from analyzing company data.The ideal candidate ...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer

    AI Engineer

    RHB BankKuala Lumpur, Kuala Lumpur, Malaysia
    Collaborate with product owners and domain experts to understand business requirements and tailor AI solutions accordingly. Work closely with Data Scientists / Cloud Platform teams to deploy models se...Show moreLast updated: 19 days ago
    • Promoted
    Data Engineer (Google cloud platform)

    Data Engineer (Google cloud platform)

    CognizantKuala Lumpur, Kuala Lumpur, Malaysia
    APAC Talent Acquisition Lead at Cognizant.Responsible for building and maintaining data pipelines, managing data storage solutions, and ensuring efficient data processing using Google Cloud Platfor...Show moreLast updated: 20 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Involve AsiaKuala Lumpur, Kuala Lumpur, Malaysia
    Involve Asia is seeking a Data Engineer who will be responsible for supporting data analysts and software engineers by providing maintainable infrastructure and tooling to deliver end-to-end soluti...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    Hong Leong Bank BerhadKuala Lumpur, Kuala Lumpur, Malaysia
    Get AI-powered advice on this job and more exclusive features.Infrastructure Design & Implementation : .Design, build, and deploy scalable, highly available, and secure cloud infrastructure solutions...Show moreLast updated: 30+ days ago
    • Promoted
    AI Operations Engineer (Web3 Greenfield Project)

    AI Operations Engineer (Web3 Greenfield Project)

    ReapKuala Lumpur, Kuala Lumpur, Malaysia
    AI Operations Engineer (Web3 Greenfield Project) – Kuala Lumpur, Malaysia.Reap is a global financial technology company headquartered in Hong Kong with employees across multiple countries.We enable...Show moreLast updated: 20 days ago