AI Training Data infrastructure, RLHF systems, and dataset engineering for high-performance AI models.
-
Updated
Mar 31, 2026
AI Training Data infrastructure, RLHF systems, and dataset engineering for high-performance AI models.
Self-study notes from Chip Huyen's AI Engineering (ch.1-10) + an internal talk on dataset engineering for LLMs
🌱 Community-driven dataset projects for inclusive, safe, and open AI engineering.
Add a description, image, and links to the dataset-engineering topic page so that developers can more easily learn about it.
To associate your repository with the dataset-engineering topic, visit your repo's landing page and select "manage topics."