Talentburst logo

Data Engineer II

Talentburst

Cupertino, CA
Contract
Mid Level
5 days ago

Job Description

About the Role

The Data Engineer II role focuses on designing, building, and maintaining scalable data pipelines for processing large-scale audio and acoustic datasets. The position involves collaboration with machine learning researchers and acoustic scientists to support data collection, annotation, transformation, and curation, with an emphasis on audio signal processing and cloud infrastructure deployment.

Key Responsibilities

  • Design, build, and maintain scalable and efficient data pipelines for processing large-scale audio and acoustic datasets.
  • Collaborate with ML researchers and acoustic scientists to collect, annotate, transform, and curate high-quality training and evaluation datasets.
  • Implement signal processing algorithms for feature extraction.
  • Work on real-time and batch processing frameworks for streaming and static audio data.
  • Support model training and evaluation through optimized data loaders and preprocessing steps.
  • Ensure data quality, versioning, and reproducibility using best practices in data engineering.
  • Deploy and maintain cloud-based infrastructure for data workflows (e.g., AWS, Google Cloud Platform, Azure).
  • Develop tools for data visualization and annotation specific to acoustic events.

Requirements

  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, Acoustics, or a related field.
  • Strong experience with audio signal processing libraries (e.g., Librosa, PyDub, SciPy, torchaudio).
  • Proficient in Python and relevant data engineering frameworks (e.g., Airflow, Apache Beam, Spark).
  • Experience working with large-scale data pipelines and cloud infrastructure.
  • Familiarity with machine learning workflows, especially in audio or time-series domains.
  • Understanding of acoustic features and formats (e.g., WAV, FLAC, sampling rates).
  • Strong knowledge of databases, data storage formats (e.g., Parquet, HDF5), and data management tools.

Nice to Have

  • Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) for audio modeling.
  • Knowledge of acoustic modeling, speech recognition, or sound classification.
  • Experience with edge deployment and real-time audio processing.
  • Familiarity with tools like Weights & Biases, MLflow, or DVC for ML operations.
Apply Now

Job Details

Posted AtJul 18, 2025
Job CategoryData Engineering
SalaryCompetitive salary
Job TypeContract
Work ModeOnsite
ExperienceMid Level

Job Skills

AI Insights

Key skills identified from this job posting

Sign upto access all insights for this job

About Talentburst

Website

linkedin.com

Location

Cupertino, CA

Industry

Temporary Help Services

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches