Data Engineer - Manufacturing
Company: ChatGPT Jobs
Location: Location not specified (Remote)
Type: Full-time
Level: Senior
Remote: Yes
Posted: 2026-02-23
About this role
Job Description
Data Engineer - Manufacturing
Foundation EGI
Remote Nationwide,
- Remote
- Full-Time
Job Description
We are an MIT-born, venture-backed Silicon Valley startup building Engineering General Intelligence (EGI)—an AI Copilot for design and manufacturing. Our mission is to fundamentally reinvent how physical products are designed and built, dramatically accelerating the pace of product development.
As an Individual Contributor on the Data Studio team, you will play a key role in transforming raw customer data into structured, high-fidelity datasets that power model training, evaluation, and customer delivery. This role is deeply hands-on and sits at the intersection of product, research, and engineering. You will apply your mechanical engineering and manufacturing expertise to create data pipelines, labeling workflows, reference models, and quality checks that ensure the accuracy and reliability of our AI systems. Mechanical engineering or manufacturing design experience is essential; candidates without this background will not be considered.
Key Responsibilities
- Data Creation, Processing & Quality
- Ingest, clean, transform, and structure customer and internally generated engineering data for AI training and inference.
- Design and build high-quality mechanical components and assemblies in CAD to serve as authoritative ground truth for evaluating and training AI systems.
- Produce labeled datasets, reference designs, annotations, exploded views, sequences, and other engineering artifacts that encode real-world reasoning.
- Apply engineering judgment to define and assess output quality across datasets.
- Continuously refine standards for metadata, annotation, and model quality, maintaining a living “definition of quality” for ME datasets.
- Workflow & Tooling Contributions
- Collaborate with Product Managers to shape tooling used for annotation, data correction, model-output review, and pipeline automati...