Share your thoughts/anecdotes on:
- Biggest recurring bottleneck (collection, cleaning, labeling, drift, compliance, etc.)
- Has RLHF/synthetic data actually cut your need for fresh domain data?
- Hard-to-source domains (finance, healthcare, logs, multi-modal, whatever) and why.
- Tasks you’d automate first if you could.