Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther
Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.
Full article here.
Link to the full leaked list download: Meta leaked list pdf
Poison thy well comrades. Become more unhinged /s
Take away that /s, it’s praxis now!
Toothpaste makes an excellent fuel additive. I suggest it to all customers who come through my small engine repair business. They love me for it.
Hexbear is on there too.
Lol rip to the AI that trains on my ramblings.
if they want to send the message that every slave owner should have been hanged to every boomer on Facebook, who am I to say no
Noooo my contentarinos nooooo
Fuck yeah! My “Bigfoot is actually a big cellar spider and that’s why it’s always blurry in pictures” theory is gonna be broadcast to everyone’s grandmother!
The bot trained on hexbear and lemmygrad vs the bot trained on .world:
Damn zuckbot’s gonna end up being a commie-bot that posts absurdist memes about beans if it’s harvesting hexbear posts for content
The AI wasting hours of processing power having an internal struggle session re: outdoor cats before simply replying with “:pigpoopballs” on a platform that doesn’t have that emoji
Imagine being a techbro talking to your meta ai chatbot and he says “unlimited genocide on the first world, start jihad on krakkker entity”
lemmygrad
imagining Zuck launching his “everybody gets ten virtual friends” initiative and accidentally re-radicalizing your parents and grandparents in the other direction.
I’ll be upping my use of Maoist Standard English and
in response this revelation.
You need a shower after you accidentally crap on your own balls.
Showers are bourgeois decadence
Honestly, I already figured my posts probably were being used to train a LLM without my consent.
I’m more concerned about the non-consensual scraping causing excess load on the servers. The taking of content without license to train their energy-wasting autocomplete that is being used to for little commercially but to try to cheapen labor and pocket the money is a problem too. But I hate having servers impacted by their bullshit.
Glad i scrubbed my reddit account in 2020
Going straight to palantir
Unpopular opinion but social media has always been fundamentally public.
Unless they’re scraping private dm’s on encrypted devices, this should come as no surprise to anyone.
The good news is that nobody has exclusive right to data on federated platforms, unlike other sites that will ransom their user’s data for private use. Let’s not forget that many of us migrated here because the other site wanted to lock down their api and user data so that they could auction it to google for profit.
many of us migrated here because the other site wanted to lock down their api and user data so that they could auction it to google for profit.
The venn diagram of people who did this and “liberals who would have been fine staying on reddit rather than make a site exactly like reddit” is a circle
Probably because this is one of the places where you can actually get reliably human interactions. Really important to keep models healthy.
Ahahahahaha, so it’s going to be a self-hating Meta AI bot?
@Sal@mander.xyz We made the list. 😎 lmao
Ahh, really?! Thanks for letting me know. I will see if there is something I can do to throttle that after holidays. Curious to see what solutions others come up with
I think Science Memes may make it halucinate more, tbf.
I hate the internet now
This explains our instance having perf issues.