The first users of this dataset will be Big Tech corps. Meta, Alphabet, OpenAI, ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		madduci 33 days ago \| parent \| context \| favorite \| on: Backing up Spotify The first users of this dataset will be Big Tech corps. Meta, Alphabet, OpenAI, Microsoft, Apple will all be happy to use this dataset for training their LLMs. For them, 300TB is just cheap

ipsum2 33 days ago [–]

They already have this data. See jukebox from OpenAI, released before chatgpt.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact