• Trigg@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 months ago

    Also what’s more American than taking a loss to under cut competition and then hiking when everyone else goes out of business

  • marcos@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 months ago

    to make american ai unprofitable

    Lol! If somebody manage to divide the costs by 40 again, it may even become economically viable.

    • alcoholicorn@lemmy.ml
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      5 months ago

      spying

      ChatGPT was only released as SaaS, every thing you use it for goes through OpenAI’s servers.

      Deepseek was open-sourced, you can run it on a local machine where it is physically impossible for China to spy on you.

      They also released how they trained Deepseek, so you could even make your own Deepseek, as these guys are doing.

      • Mubelotix@jlai.lu
        link
        fedilink
        English
        arrow-up
        0
        ·
        5 months ago

        Of course chatgpt is saas, because it’s a service built on top of a gpt model, which they made public

            • alcoholicorn@lemmy.ml
              link
              fedilink
              English
              arrow-up
              0
              arrow-down
              1
              ·
              edit-2
              5 months ago

              Roflmasterbigpimp accused the Chinese company of spying.

              I pointed out that the Chinese company can’t spy becaause it’s model was open source and could be locally run, while the US company set up its operation to allow it to spy on any use of its model.

              Mubelotix claimed ChatGPT made their model public, which would only be relevant to the conversation as a evidence that the US is not spying either.

              I said the model isn’t public.

              • Treczoks@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                5 months ago

                Well, there is an Android client that sends keystrokes (and loads of other data) back to Chinese servers. Which very much fulfils my definitions of spying.

                • alcoholicorn@lemmy.ml
                  link
                  fedilink
                  English
                  arrow-up
                  0
                  arrow-down
                  1
                  ·
                  5 months ago

                  The app uses Deepseek’s servers. It physically could not function if it didn’t send your input data to their servers.

                  What other data does it send?

  • hoshikarakitaridia@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    5 months ago

    It’s models are literally open source.

    People have this fear of trusting the Chinese government, and I get it, but that doesn’t make all of china bad. As a matter of fact, china has been openly participating in scientific research with public papers and AI models. They might have helped ChatGPT get to where it’s at.

    Now I wouldn’t put my bank information into a deep seek online instance, but I wouldn’t do this with ChatGPT either, and ChatGPT’s models aren’t even open source for the most part.

    I have more reasons to trust deep seek as opposed to chatgpt.

    • SkyeStarfall@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 months ago

      Yeah. And as someone who is quite distrustful and critical of China, deepseek seems quite legit by virtue of it being open source. Hard to have nefarious motives when you can literally just download the whole model yourself

      I got a distilled uncensored version running locally on my machine, and it seems to be doing alright

        • Binette@lemmy.ml
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 months ago

          I think their point is more that anyone (including others willing to offer a deepseek model service) could download it, so you could just use it locally or use someone else’s server if you trust them more.

    • vrighter@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 months ago

      It’s just free, not open source. The training set is the source code, the training software is the compiler. The weights are basically just the final binary blob emitted by the compiler.

      • Fushuan [he/him]@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        arrow-down
        1
        ·
        5 months ago

        That’s wrong by programmer and data scientist standards.

        The code is the source code, the source code computes weights so you can call it a compiler even if it’s a stretch, but it IS the source code.

        The training set is the input data. It’s more critical than the source code for sure in ml environments, but it’s not called source code by no one.

        The pretrained model is the output data.

        Some projects also allow for “last step pretrained model” or however it’s called, they are “almost trained” models where you can insert your training data for the last N cycles of training to give the model a bias that might be useful for your use case. This is done heavily in image processing.

        • vrighter@discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 months ago

          no, it’s not. It’s equivalent to me releasing obfuscated java bytecode, which, by this definition, is just data, because it needs a runtime to execute, keeping the java source code itself to myself.

          Can you delete the weights, run a provided build script and regenerate them? No? then it’s not open source.

          • Fushuan [he/him]@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            ·
            5 months ago

            The model itself is not open source and I agree on that. Models don’t have source code however, just training data. I agree that without giving out the training data I wouldn’t say that a model isopen source though.

            We mostly agree I was just irked with your semantics. Sorry of I was too pedantic.

  • Hackerman_uwu@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    5 months ago

    We literally are at the stage where when someone says: “this is a psyop” then that is the psyop. When someone says: “these drag queens are groomers” they are the groomers. When someone says: “the establishment wants to keep you stupid and poor” they are the establishment who want to keep you stupid and poor.

    • Krudler@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 months ago

      It’s so important to realize that most of “the establishment” are the pawns who are just as guilty. Thank you.

      • Carighan Maconar@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        5 months ago

        Also “The establishment” when used in accusations can be replaced by “Rich bastards and right-wingers” and the accusations are usually spot on. Child abuse, sexual assault, market manipulation, bribery, always checks out perfectly.

        • Krudler@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 months ago

          That’s where I feel our perspectives diverge. I think there is a sickness that is rooted within the heart of the human experience and it is a fear-based compulsion based on power/control. Some (the rich) have means to express this sickness in larger domains, but it is a characteristic that seems to be independent of wealth. I think there are people with this illness, and those without, and those somewhere on a curve.