AI Dose
0
Likes
0
Saves
Back to updates

[HN] Ask HN: How AI teams source and license training data?

Impact: 4/10
Swipe left/right

Summary

A researcher is actively investigating the practical, day-to-day processes AI teams use to source and license various types of training data, including text, audio, video, and synthetic data. They are conducting short interviews to gather real-world insights and are open to sharing their findings in return for participation or introductions. This initiative aims to understand the messy realities of data acquisition in AI development.

Continue Reading

Explore related coverage about community news and adjacent AI developments: [r/ML] [D] MYTHOS-INVERSION STRUCTURAL AUDIT, [r/LocalLLaMA] karpathy / autoresearch, [r/ML] [R] Agentic AI and Occupational Displacement: A Multi-Regional Task Exposure Analysis (236 occupations, 5 US metros), [r/ML] Building behavioural response models of public figures using Brain scan data (Predict their next move using psychological modelling) [P].

Related Articles

Comments

Sign in to leave a comment.

Loading comments...