Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'

Stopthatgirl7@lemmy.world · 11 hours ago

Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'

foremanguy@lemmy.ml · 6 hours ago

If you post something publicly, that thing will be used to train AI. Nevertheless the privacy speaks of the company.

Brumefey@sh.itjust.works · 5 hours ago

I don’t know why social media are used for training. It’s like the worst quality of data ever and it results to answers like « go kill youself » when prompted about something sad…

foremanguy@lemmy.ml · 26 minutes ago

They are used because they are “real life” (not really but you know) conversation example