Google logo

Staff Software Engineer, GPU Infrastructure, Operating Systems

Google

Sunnyvale, CA
Full Time
Senior
197k-291k
12 days ago

Job Description

About the Role

Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. In this role, you will work on a specific project critical to Google Cloud's needs, with opportunities to switch teams and projects as the business evolves. You will apply your expertise in NVIDIA GPU architectures and Operating System internals to develop foundational software for the Google distributed cloud team, enabling efficient AI, ML, and HPC workloads on NVIDIA GPUs across distributed, on-premises, and edge environments. The role involves collaboration with hardware, kernel, Kubernetes, and AI/ML teams to ensure optimal performance, scalability, and stability.

Key Responsibilities

  • Integrate and optimize NVIDIA GPU software for Google Distributed Cloud's Kubernetes-native orchestration layer, including CRDs and operators.
  • Develop Linux kernel modules, device drivers, and system software to enable secure GPU-OS interaction.
  • Identify and resolve performance bottlenecks across the GPU stack, optimizing data transfer, memory, and power efficiency.
  • Build fault-tolerant GPU life-cycle management solutions for high availability and security best practices.
  • Validate and support new NVIDIA GPU interfaces, ensuring compatibility and performance.

Requirements

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience in Cloud Infrastructure Systems and Distributed Systems architecture, including deployment, scaling, reliability, and performance optimization.
  • Experience implementing security best practices for cloud environments and working with Compliance frameworks.
  • Experience with NVIDIA GPU architectures and Operating System internals.
  • Ability to develop and optimize system software, including kernel modules and device drivers.
  • Knowledge of container orchestration, specifically Kubernetes, including cluster management and workload deployment.
  • Understanding of core networking principles (e.g., TCP/IP, DNS, load balancing) and cloud-based storage solutions.
  • Experience managing the life-cycle of infrastructure components from provisioning to secure decommissioning.

Nice to Have

  • Experience with data center architecture and its impact on large-scale infrastructure deployments.

Qualifications

  • Formal educational background with a Bachelor's degree or equivalent experience.

Benefits & Perks

  • Competitive salary range of $197,000-$291,000 plus bonus, equity, and benefits.
  • Work environment that supports innovation and growth.
  • Opportunities to work on cutting-edge cloud and GPU technologies.

Working at Google

Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

Apply Now

Job Details

Posted AtJul 12, 2025
Salary197k-291k
Job TypeFull Time
ExperienceSenior

Job Skills

AI Insights

Key skills identified from this job posting

Sign upto access all insights for this job

About Google

Website

google.com

Location

Sunnyvale, CA

Industry

Web Search Portals and All Other Information Services

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches