AMD logo

Full Stack Automation Tech Lead - Data Center GPU

AMD

Austin, TX
Full Time
Senior
10 days ago

Job Description

About the Role

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. Our growing team plays a major role in architecting and shaping every high-performance computing GPU offered by AMD. As a key member of our Automation Solutions team, you will help shape the future of compute platforms by building scalable, resilient, and intelligent systems that support validation, automation, lab fleet management, and data-driven decision-making. You will work across full stack development, automation frameworks, DevOps, and data engineering to deliver solutions that power AI/HPC silicon validation at scale. Your work will also contribute to open-source initiatives and cross-functional strategic programs.

Key Responsibilities

  • Develop a deep understanding of the wider automation development space and its stakeholders both within DCGPU and across the company to drive holistic requirements, architecture and roadmaps.
  • Work with post silicon teams and leadership to identify upcoming validation program needs and user base needs.
  • Collaborate with DevOps, data engineering, and validation teams to align architecture, tooling, and best practices.
  • Lead strategic initiatives that span multiple business units and engineering domains.
  • Contribute to the development of internal tools and frameworks that may be released as open-source or shared externally with partners.
  • Mentor junior and senior engineers, fostering a culture of technical excellence and continuous improvement.
  • Improve scalability, resiliency, and cost-effectiveness of frameworks and cloud-based services.

Requirements

  • Extensive experience in full stack development with Python and Angular or similar technologies.
  • Proficiency with containerization and orchestration technologies (Docker, Kubernetes) and managing compute clusters (e.g., Slurm).
  • Experience with SQL and database design.
  • Proven track record in using or managing cloud infrastructure (e.g., AWS, Azure, Google Cloud) and/or on-premises edge infrastructure.
  • Background in DevOps practices, CI/CD pipelines, and infrastructure as code (e.g., Ansible, Terraform).
  • Experience with data engineering tools (e.g., Spark, Kafka, Airflow) and database systems (SQL/NoSQL).
  • Proficiency in Linux, including system administration and scripting.
  • Excellent problem-solving skills and the ability to work both independently and collaboratively.
  • Proven track record of mentoring engineers and driving cross-functional collaboration.
  • Strong communication skills to effectively convey technical concepts to diverse audiences.
  • Experience working in HPC environments.
  • Experience owning and releasing high-availability production services with strict uptime requirements.

Nice to Have

  • Background in HPC environments.
  • Experience owning and releasing high-availability production services with strict uptime requirements.

Qualifications

  • Bachelor's or master's degree in Electrical/Computer Engineering, Mathematics, Computer Science or an equivalent preferred.

Benefits & Perks

  • Benefits offered are described: AMD benefits at a glance.

Working at AMD

We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

Apply Now

Job Details

Posted AtJun 12, 2025
SalaryCompetitive salary
Job TypeFull Time
ExperienceSenior

About AMD

Website

amd.com

Company Size

10000+ employees

Location

Austin, TX

Industry

Semiconductor and Other Electronic Component Manufacturing

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches