Zurück zur Stellenangebote
XX
Synthetic Data Engineer (AI Data/Training)Hyphen ConnectUnited States
XX

Synthetic Data Engineer (AI Data/Training)

Hyphen Connect
  • US
    United States
  • US
    United States

Über

Synthetic Data Engineer (AI Data/Training)
Oregon, USA We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing and model training within the organization. Responsibilities: Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting. Implement automated quality scoring and de-duplication systems. Manage data pipelines that feed directly into SFT and DPO training loops. Qualifications: Proven experience building large-scale data pipelines (Airflow, Spark, Ray). Deep knowledge of prompt engineering for data generation. Familiarity with dataset distillation and bias mitigation.
  • United States

Sprachkenntnisse

  • English
Hinweis für Nutzer

Dieses Stellenangebot stammt von einer Partnerplattform von TieTalent. Klick auf „Jetzt Bewerben”, um deine Bewerbung direkt auf deren Website einzureichen.