← all jobs

[Remote] AI Infrastructure Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. vCluster is a venture-backed tech startup pioneering Kubernetes virtualization for the AI era. As an AI Infrastructure Engineer, you will work directly with customers to drive technical deployments and optimize GPU infrastructure, ensuring a smooth transition to production-ready environments.

Responsibilities

  • Lead Technical Deployments: Drive end-to-end technical deployments for GPU neocloud and AI Factory customers, from initial bare metal configuration to a validated vCluster environment
  • Infrastructure Optimization: Configure and troubleshoot bare metal GPU node infrastructure, including CNI configuration, GPU Operator setup, distributed storage backends, and RDMA/InfiniBand
  • Validation: Deploy and validate Kubernetes and vCluster to provide GPU-powered managed K8s
  • Knowledge Transfer: Work alongside customer teams to build self-sufficiency, ensuring they can operate and grow the platform independently
  • Scaling through Documentation: Document reusable playbooks and deployment architectures so your learnings become the next customer's head start
  • Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges, acting as a direct feedback loop from the field into the roadmap
  • Strategic Partnering: Join Sales in the pre-sales process where deep infrastructure work is required to achieve a meaningful proof of value

Skills

  • 5+ years of experience deploying and operating Kubernetes in production, ideally on bare metal or in high-complexity environments
  • Practical knowledge of NVIDIA GPU Operators, CUDA tooling, and systems-level configuration for GPU nodes
  • Deep understanding of CNI plugins, overlay networks, load balancing, and connectivity diagnosis in layered environments
  • Experience with persistent volume configuration, CSI drivers, and distributed systems like Ceph, Rook, Weka, or Longhorn
  • Comfort operating in ambiguous, fast-moving environments where you are often writing the playbook in real time
  • You thrive in environments that reject legacy tech and prefer a modern stack where you can solve a variety of problems from pipelines to internal services
  • Experience writing automation scripts with Bash, Python, or Go
  • Relevant certifications such as CKA (Certified Kubernetes Administrator) or experience writing Kubernetes Operators
  • Experience with inference serving, GPU scheduling, and the tooling around LLM deployment
  • Experience building AI Automation in documentation to contribute to a shared knowledge base

Benefits

  • Offers Equity
  • Offers Bonus
  • Health, dental, vision, and life Insurance, including plans for you and eligible dependents (benefits vary depending on country)
  • Flexible Working Schedule: You have a doctor’s appointment or need to head to the supermarket to get groceries at 2pm? We won’t have an issue with that. To us, results matter more than clocking in and out at the same time every day.
  • Workplace Flexibility: We’re very flexible about where you work. We know things can change in life and we’re happy to adjust the work environment for you along the way.

Company Overview

  • vCluster helps companies build flexible infrastructure tenancy for GPU and AI infra as well as for K8s in private, public and hybrid clouds. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is https://vcluster.com/.
  • Company H1B Sponsorship

  • vCluster has a track record of offering H1B sponsorships, with 1 in 2024. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Clinical Project Manager - Future Opportunities

    Work from home Full-time role

    [Remote] Float Clinical Research Coordinator II

    Work from home Full-time role

    [Remote] Platform Engineer II/III

    Work from home Full-time role

    [Remote] Senior Enterprise Account Executive, Northwest Territory

    Work from home Full-time role

    [Remote] Senior Account Executive - Commercial ITSM Sales

    Work from home Full-time role

    Reporting Operations Associate

    Work from home Full-time role

    Occupational Therapist (OT) – Clinical Record Review Consultant (Remote/PRN)

    Work from home Full-time role

    Adjunct Faculty - Education

    Work from home Full-time role

    Entry-Level Remote Data Entry & Research Participant Specialist – Flexible Hours, High Earnings Potential

    Work from home Full-time role

    Medical Strategist

    Work from home Full-time role

    Remote Customer Service Representative – careerzynith Call Center & Patient Scheduling Specialist (Full‑Time, Temp‑to‑Perm, 8 AM – 5 PM EST)

    Work from home Full-time role

    [Remote] Data Research Engineer

    Work from home Full-time role

    Manager, Customer Success Team (Mid-Market)

    Work from home Full-time role

    VP, Management Supervisor

    Work from home Full-time role

    Mid Graphic/Motion Designer (Part-Time, Remote LATAM)

    Work from home Full-time role

    Customer Service Representative

    Work from home Full-time role

    Entry-Level Remote Live Chat Support Specialist – Customer Care at careerzynith – $25‑$35/hr

    Work from home Full-time role

    Remote Customer Support Representative – Seasonal Work From Home Opportunity with careerzynith (Phone, Written, or Full Support)

    Work from home Full-time role

    Digital Travel Advisor (59285)

    Work from home Full-time role

    Experienced Medical Intake / Data Entry Specialist – Healthcare Administration Support

    Work from home Full-time role

    IT Security Analyst (Cyber Security)-Remote

    Work from home Full-time role