Google logo

Senior Engineering Manager, ML Performance and Observability Services

Google

Sunnyvale, CA
Full Time
Director
248k-349k
15 days ago

Job Description

About the Role

The Senior Engineering Manager for ML Performance and Observability Services at Google is responsible for leading the development of infrastructure and tools that help ML developers understand and optimize their workloads. This role involves managing multiple engineering teams, shaping technical strategy, and overseeing large-scale projects across various Google services and Google Cloud, with a focus on reliability, scalability, and security in a hyperscale environment.

Key Responsibilities

  • Manage and build team(s) of software engineers developing debugging, observability, performance diagnostics, and related tools and services for ML developers.
  • Develop the long-term technical vision and roadmap to meet future infrastructure needs in the fast-moving ML space.
  • Lead AI/ML technical strategy, large-scale infrastructure optimization, and specialized solution design for internal and third-party developers.
  • Collaborate closely with cross-functional Product Managers, Technical Program Managers, peer Engineering Managers, and multiple Google product areas and GCP customers.
  • Architect services with native JAX/PyTorch support for ML users and open-source third-party services to foster an external developer community.
  • Oversee the deployment of large-scale projects across multiple sites internationally.

Requirements

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience with software development.
  • 7 years of experience leading technical project strategy, ML design, and optimizing ML infrastructure (e.g., model deployment, evaluation, data processing, debugging, fine tuning).
  • 5 years of experience in a technical leadership role overseeing projects and managing teams.
  • Experience designing, developing, and servicing enterprise products emphasizing reliability, scalability, and ease of use.

Nice to Have

  • Master's degree or PhD in Engineering, Computer Science, or a related field.
  • 5 years of experience working in a complex, matrixed organization.
  • Experience with ML infrastructure, ML performance/optimization, ML architecture, or related ML fields.

Qualifications

  • Formal educational background in engineering or computer science (implied by degree requirements).

Benefits & Perks

  • US base salary range of $248,000-$349,000 plus bonus, equity, and benefits.
  • Comprehensive benefits package as detailed at https://careers.google.com/benefits/.

Working at Google

Google values diversity and is an equal opportunity workplace, committed to inclusive employment regardless of race, gender, age, disability, or background. The company emphasizes security, efficiency, and reliability across its hardware and software infrastructure, fostering a collaborative environment that drives innovation in AI and ML technologies.

Apply Now

Job Details

Posted AtJul 9, 2025
Salary248k-349k
Job TypeFull Time
ExperienceDirector

Job Skills

AI Insights

Key skills identified from this job posting

Sign upto access all insights for this job

About Google

Website

google.com

Location

Sunnyvale, CA

Industry

Web Search Portals and All Other Information Services

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches