site banner

Small-Scale Question Sunday for February 18, 2024

Do you have a dumb question that you're kind of embarrassed to ask in the main thread? Is there something you're just not sure about?

This is your opportunity to ask questions. No question too simple or too silly.

Culture war topics are accepted, and proposals for a better intro post are appreciated.

3
Jump in the discussion.

No email address required.

So reddit is selling data to AI training now according to Bloomberg. [Archive] . Well I for one isn't suprised! A huge bunch mediocre data nicely cleaned up by unpaid mods and community sentiment encoded in the karma. When I left reddit I removed all my posts and comments. I lost my trust way before the API debacle and seeing what is happening now, it just validates me. Anyone else?

I don't understand the rationale behind this line of thought. If you are concerned(paranoid) about what data is used to train LLMs, you should also know that PushShift database exists, or that no production database out there in the wild doesn't have multiple replicas of itself at various timestamps.

Why do you not want your comments used to train LLMs anyways?

Why do you not want your comments used to train LLMs anyways?

well my concern isn't around LLMs in itself, it is slightly more abstract possible abuse of this data for behavior modification of crowds. I'm not an AGI doomer but I see the outlines already with inciting compulsive use with various apps like YouTube, TikTok and Instagram where it might be possible to use data like reddits to create similar compulsive loops for text as we have for video. I don't know if it is possible but it might be.

But my comment is more of that I made a decision a couple of years ago and this just proves that don't give a shit of the people who use their service, so I'm patting my own back.