Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

    • CloutAtlas [he/him]@hexbear.net
      link
      fedilink
      English
      arrow-up
      26
      arrow-down
      1
      ·
      3 months ago

      The AI wasting hours of processing power having an internal struggle session re: outdoor cats before simply replying with “:pigpoopballs” on a platform that doesn’t have that emoji

  • Sandouq_Dyatha@lemmy.ml
    link
    fedilink
    English
    arrow-up
    33
    ·
    3 months ago

    Imagine being a techbro talking to your meta ai chatbot and he says “unlimited genocide on the first world, start jihad on krakkker entity”

  • Carl [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    32
    ·
    edit-2
    3 months ago

    lemmygrad

    imagining Zuck launching his “everybody gets ten virtual friends” initiative and accidentally re-radicalizing your parents and grandparents in the other direction.

    • nickwitha_k (he/him)@lemmy.sdf.org
      link
      fedilink
      arrow-up
      14
      ·
      3 months ago

      I’m more concerned about the non-consensual scraping causing excess load on the servers. The taking of content without license to train their energy-wasting autocomplete that is being used to for little commercially but to try to cheapen labor and pocket the money is a problem too. But I hate having servers impacted by their bullshit.

  • anarchiddy@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    23
    ·
    3 months ago

    Unpopular opinion but social media has always been fundamentally public.

    Unless they’re scraping private dm’s on encrypted devices, this should come as no surprise to anyone.

    The good news is that nobody has exclusive right to data on federated platforms, unlike other sites that will ransom their user’s data for private use. Let’s not forget that many of us migrated here because the other site wanted to lock down their api and user data so that they could auction it to google for profit.

    • LeeeroooyJeeenkiiins [none/use name]@hexbear.net
      link
      fedilink
      English
      arrow-up
      7
      ·
      3 months ago

      many of us migrated here because the other site wanted to lock down their api and user data so that they could auction it to google for profit.

      The venn diagram of people who did this and “liberals who would have been fine staying on reddit rather than make a site exactly like reddit” is a circle

  • HiddenLayer555@lemmy.ml
    link
    fedilink
    English
    arrow-up
    15
    ·
    3 months ago

    Probably because this is one of the places where you can actually get reliably human interactions. Really important to keep models healthy.

    • Salamander@mander.xyz
      link
      fedilink
      arrow-up
      5
      ·
      3 months ago

      Ahh, really?! Thanks for letting me know. I will see if there is something I can do to throttle that after holidays. Curious to see what solutions others come up with

  • flamingos-cant (hopepunk arc)@feddit.uk
    link
    fedilink
    English
    arrow-up
    6
    ·
    3 months ago

    There’s like half a dozen feddits and somehow feddit.uk is the only one to make it onto this?

    Here’s a list of instances in feddit.uk linked instances that appear in the list:

    List of instance
    beehaw.org
    furry.engineer
    ibe.social
    fediworld.de
    framatube.org
    trailers.ddigest.com
    nrw.social
    lemmynsfw.com
    video.hardlimit.com
    digitalcourage.social
    xn--baw-joa.social
    tube.kockatoo.org
    equestria.social
    wisskomm.social
    social.anoxinon.de
    freiburg.social
    toobnix.org
    toot.bike
    mstdn.lalafell.org
    peertube.linuxrocks.online
    social.rebellion.global
    mastodon.cipherbliss.com
    social.sdf.org
    corteximplant.com
    typo.social
    www.404media.co
    mastodon.ml
    video.liberta.vip
    tilvids.com
    todon.eu
    hessen.social
    digipres.club
    shigusegubu.club
    mastodon.me.uk
    zdf.social
    mastodon.sdf.org
    spore.social
    kolektiva.media
    gruene.social
    share.tube
    nso.group
    mastouille.fr
    masto.es
    vivaldi.com
    literatur.social
    mstdn.mx
    kirche.social
    mastodon.hams.social
    federation.network
    lile.cl
    todon.nl
    betweenthelions.link
    ipv6.social
    linuxrocks.online
    peertube.otakufarms.com
    pawb.social
    mastodon-belgium.be
    jasette.facil.services
    machteburch.social
    mastodont.cat
    mastodon.eus
    eupolicy.social
    social.bau-ha.us
    toot.berlin
    amicale.net
    hexbear.net
    mastodon.bida.im
    reddthat.com
    shelter.moe
    mastodon.nl
    dju.social
    bonn.social
    mstdn.chrisalemany.ca
    social.sciences.re
    tldr.nettime.org
    lemy.lol
    climatejustice.social
    rollenspiel.social
    mastodon.org.uk
    social.kyiv.dcomm.net.ua
    pouet.chapril.org
    ecoevo.social
    social.politicaconciencia.org
    darmstadt.social
    peertube.tv
    lemmus.org
    libretooth.gr
    hackers.town
    tooter.social
    anarchism.space
    diode.zone
    video.infosec.exchange
    mastodon.thirring.org
    aussie.zone
    social.bund.de
    apobangpo.space
    shitpost.cloud
    berlin.social
    toot.aquilenet.fr
    social.beachcom.org
    lemmygrad.ml
    mastodon.radio
    nerdculture.de
    programming.dev
    decayable.ink
    kafeneio.social
    functional.cafe
    things.uk
    fuzzies.wtf
    diaspodon.fr
    dalek.zone
    sunbeam.city
    tooting.ch
    fediscience.org
    mastodon.tetaneutral.net
    social.librem.one
    im-in.space
    lemmy.sdf.org
    legal.social
    post.lurk.org
    mastodon.uy
    noc.social
    tube.pol.social
    lemmy.ml
    don.linxx.net
    infosec.pub
    kolektiva.social
    masto.bike
    furries.club
    zhub.link
    lemmy.world
    openbiblio.social
    mastodon.zaclys.com
    mamot.fr
    clacks.link
    discuss.tchncs.de
    cyberplace.social
    graz.social
    pl.kitsunemimi.club
    mastodonczech.cz
    masto.nobigtech.es
    hostux.social
    pawb.fun
    mastodon.trueten.de
    norden.social
    systemli.social
    mander.xyz
    ciberlandia.pt
    woem.men
    sopuli.xyz
    lemmy.ca