• Alex@feddit.ro
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    6 months ago

    what about instances that need you to be logged in to view posts and require authorized requests for federation?

    • kalpol@lemm.ee
      link
      fedilink
      English
      arrow-up
      6
      ·
      6 months ago

      All it needs is an account to access troves of training data?

        • Saledovil@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          5
          ·
          6 months ago

          How restrictive do you want to be with the accounts? If you’re too restrictive, there won’t be enough users. If you’re not restrictive enough, the data will be used for AI training.

    • Aux@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      3
      ·
      6 months ago

      That defeats the purpose of a knowledge base. The whole reason why everyone is using SO is that you don’t need an account to access it and it’s fully indexed by Google.

      The real question is why the fuck are people ok with Google indexing SO and not OpenAI? Doesn’t make any fucking sense.

      • irreticent@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        6 months ago

        The real question is why the fuck are people ok with Google indexing SO and not OpenAI? Doesn’t make any fucking sense.

        Because Google is free and OpenAI isn’t. It’s one thing to take free content, index it, then allow anyone to access that index. It’s another thing when you take free content, index it, then hide that index behind a paywall.

        • Aux@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          4
          ·
          6 months ago

          Are you sure? Because Google is not free at all, you’re paying for it through privacy invasion and ads. While ChatGPT is actually free to use for end users - no ads, nothing.

              • Zacryon@lemmy.wtf
                link
                fedilink
                English
                arrow-up
                4
                arrow-down
                1
                ·
                6 months ago

                It’s “freemium”, not free. There is a difference. You can’t use ChatGPT 4 without paying as well as the API. Also, you are limited in the number of prompts you can make per hour before you are put on pause and asked to pay.

                Search engines like Ecosia, DuckDuckGo, etc. don’t ask you for money. Regardless how intensively you use it. (They might come with other drawbacks though like Google with privacy, environment, ethical principles, …)

                  • Zacryon@lemmy.wtf
                    link
                    fedilink
                    English
                    arrow-up
                    4
                    arrow-down
                    1
                    ·
                    6 months ago

                    I’ve never been asked to pay for using one of the aforementioned search engines. I have been asked to pay for OpenAI products.

                    So I don’t see how you come to that conclusion.

              • irreticent@lemmy.world
                link
                fedilink
                English
                arrow-up
                3
                arrow-down
                1
                ·
                edit-2
                6 months ago

                What does Google’s cloud service have to do with what we’re discussing (Google indexing content vs. SO OpenAI doing it)? They’re not even similar services.

                Edit: SO -> OpenAI

          • jnk@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            1
            ·
            6 months ago

            The price difference is that google steals your data. That’s it. OpenAI steals data, ask for money to use most of their models, and buy even more data from other companies stealing user data (like google and SO). Also indexing web pages is not even the “stealing” part of google, it’s just not comparable.

            Yes, training AI on user data for free then selling the end product is a reasonable thing to be concerned about. It’d be different if the product was free or the data was sold to them with user consent.

            SO has announced a subscription-based service trained on user data for free, and not only there’s not even opt-out, they’re mass-banning users for trying to “opt-out” manually. Tell me one thing here that’s not completely fucked up.