• trungulox@lemm.ee
    link
    fedilink
    arrow-up
    42
    arrow-down
    10
    ·
    2 days ago

    Yes I have.

    With a model I fine tuned myself and ran locally on my own hardware.

    Suck it

    • utopiah@lemmy.world
      link
      fedilink
      arrow-up
      11
      arrow-down
      25
      ·
      2 days ago

      Just curious, do you know even as a rough estimation (maybe via the model card) how much energy was used to train the initial model and if so how do you believe it was done so in an ecologically justifiable way?

        • utopiah@lemmy.world
          link
          fedilink
          arrow-up
          4
          arrow-down
          6
          ·
          edit-2
          2 days ago

          Apologies for my sarcastic answer, I did actually search for that a little while ago so I do assume most people do know but that’s incorrect. The most useful tool I know of would probably be https://www.aspistrategist.org.au/uyghurs-for-sale-re-education-forced-labour-and-surveillance-beyond-xinjiang/ It wasn’t specific to children but it does show the process and I’d argue can be apply for the different criteria one would want to focus on. It dates back few years ago, when I learned about the problem so there also you might want to prefer a more up to date source.

          Let me know if you are looking for something more precise. I know of few other tools which do help better understand who builds what and how, for electronics but other products too.

        • utopiah@lemmy.world
          link
          fedilink
          arrow-up
          7
          arrow-down
          11
          ·
          2 days ago

          Straw-hat much or just learning about logistics and sourcing in our globalized supply chain?

          • surewhynotlem@lemmy.world
            link
            fedilink
            arrow-up
            9
            arrow-down
            2
            ·
            2 days ago

            Satirically pointing out that worrying about electricity usage for model creation is ridiculous.

            It’s already spent. The model exists. It’s probably MORE moral to use it as much as possible to get some positive value out of it. Otherwise it was just wasted.

            • utopiah@lemmy.world
              link
              fedilink
              arrow-up
              5
              arrow-down
              2
              ·
              edit-2
              2 days ago

              Yes indeed, yet my point is that we keep on training models TODAY so if keep on not caring, then we do postpone the same problem, cf https://lemmy.world/post/30563785/17400518

              Basically yes, use trained model today if you want but if we don’t set a trend then despite the undeniable ecological impact, there will be no corrective measure.

              It’s not enough to just say “Oh well, it used a ton of energy. We MUST use it now.”

              Anyway, my overall point was that training takes a ton of energy. I’m not asking your or OP or anyone else NOT to use such models. I’m solely pointing out that doing so without understand the process that lead to such models, including but not limited to energy for training, is naive at best.

              Edit: it’s also important to point out alternatives that are not models, namely there are already plenty of specialized tools that are MORE efficient AND accurate today. So even if the model took a ton of energy to train, in such case it’s still not rational to use it. It’s a sunk cost.

              • piecat@lemmy.world
                link
                fedilink
                arrow-up
                2
                arrow-down
                3
                ·
                1 day ago

                How much electricity was wasted for you to post, and us to receive, your human slop

      • mc900ftJesus@lemy.lol
        link
        fedilink
        arrow-up
        9
        arrow-down
        2
        ·
        edit-2
        2 days ago

        Just curious, do you know how much energy went into powering every computer and office room for 3 years while the latest videogame/hollyowood movie/etc was being made used up?

        Should we ban every single non-essential thing in the world or only the ones you don’t enjoy?

        And please hop-off Lemmy, do you know how much power the devs used to program this site!

        • utopiah@lemmy.world
          link
          fedilink
          arrow-up
          2
          ·
          1 day ago

          That’s been addressed few times already so I let you check the history if you are actually curious.

      • utopiah@lemmy.world
        link
        fedilink
        arrow-up
        6
        arrow-down
        14
        ·
        2 days ago

        Feel free to explain the down votes.

        If it wasn’t clear the my point was that self hosting addresses mostly privacy for the user but that is only one dimension addressed. It does not necessarily address the ecological impact. I was honestly hoping this community to care more.

        • Maalus@lemmy.world
          link
          fedilink
          arrow-up
          9
          arrow-down
          2
          ·
          2 days ago

          What’s clear is that you don’t realize how much energy AI actually uses up and you ate up propaganda that you are spreading right now. A querry that runs for 20s to generate an image on a card that uses up at most 350W/h during heavy gaming sessions isn’t magically going to doom the world. Chill out.

          • utopiah@lemmy.world
            link
            fedilink
            arrow-up
            3
            arrow-down
            7
            ·
            2 days ago

            I’ll assume you didn’t misread my question on purpose, I didn’t ask about inference, I asked about training.

            • Maalus@lemmy.world
              link
              fedilink
              arrow-up
              8
              arrow-down
              1
              ·
              2 days ago

              How much energy was used to bring the truckload of groceries into the shop that one time so hundreds of people can use it?

              • utopiah@lemmy.world
                link
                fedilink
                arrow-up
                2
                arrow-down
                5
                ·
                edit-2
                2 days ago

                Great point, so are you saying there is a certain threshold above which training is energetically useful but under which it is not, e.g. if training of a large model is used by 1 person, it is not sustainable but if 1 million people use it (assuming it’s done productively, not spam or scam) then it is fine?

                • Killer@lemmy.world
                  link
                  fedilink
                  arrow-up
                  6
                  arrow-down
                  1
                  ·
                  2 days ago

                  So you’re saying if 1 guy made 1 million results it would offset the training?

                  • utopiah@lemmy.world
                    link
                    fedilink
                    arrow-up
                    1
                    arrow-down
                    6
                    ·
                    2 days ago

                    Results? I have no idea what you are talking about. I thought we were discussing the training cost (my initial question) and that the truckload was an analogy to argue that the impact from that upfront costs is spread among users.

        • Ascend910@lemmy.ml
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          8
          ·
          edit-2
          2 days ago

          “I was honestly hoping this community to care more.”

          So you want us to all go destroy paintings in art gallery and super glue our hands to the road with you?

          • utopiah@lemmy.world
            link
            fedilink
            arrow-up
            7
            arrow-down
            4
            ·
            2 days ago

            Please, do whatever you want to protect the environment you cherish. My point though was literally asking somebody who did point a better way to do it if they were aware of all the costs of their solution. If you missed it, their answer was clear : they do not know and they do not care. I was not suggesting activism, solely genuinely wondering if they actually understood the impact of the alternative they showcased. Honestly, just do whatever you can.

          • Squirrelanna@lemmynsfw.com
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            1
            ·
            2 days ago

            You looking for an excuse? No one else brought it up but if you need permission, go ahead. Thumbs up from me.

          • utopiah@lemmy.world
            link
            fedilink
            arrow-up
            3
            arrow-down
            3
            ·
            2 days ago

            You know what, again maybe I’m misreading you.

            If you do want to help, do try with me to answer the question. I did give a path to the person initially mentioning the Model Card. Maybe you are aware of that but just in cased a Model Card is basic meta-data about a model, cf https://huggingface.co/docs/hub/model-cards

            Some of them do mention CO2 equivalent, see https://huggingface.co/docs/hub/model-cards-co2 so here I don’t know which model they used but maybe finding a way have CO2 equivalent for the most popular models, e.g DeepSeek, and some equivalent (they mentioned not driving a car) would help us all grasping at least some of the impact.

            What do you think?

      • trungulox@lemm.ee
        link
        fedilink
        arrow-up
        10
        arrow-down
        16
        ·
        2 days ago

        Don’t know. Don’t really care honestly. I dont pay for hydro, and whatever energy expenditures were involved in training the model I fine tuned is more than offset by the fact that I don’t and never will drive.

        • utopiah@lemmy.world
          link
          fedilink
          arrow-up
          10
          arrow-down
          5
          ·
          2 days ago

          Don’t know. Don’t really care honestly […] offset by the fact that I don’t and never will drive.

          That’s some strange logic. Either you do know and you can estimate that the offset will indeed “balance it out” or you don’t then you can’t say one way or the other.

          • jfrnz@lemm.ee
            link
            fedilink
            arrow-up
            4
            arrow-down
            3
            ·
            2 days ago

            Running a 500W GPU 24/7 for a full year is less than a quarter of the energy consumed by the average automobile in the US (in 2000). I don’t know how many GPUs this person has or how long it took to fine tune the model, but it’s clearly not creating an ecological disaster. Please understand there is a huge difference between the power consumed by companies training cutting-edge models at massive scale/speed, compared to a locally deployed model doing only fine tuning and inferencing.

              • jfrnz@lemm.ee
                link
                fedilink
                arrow-up
                2
                ·
                2 days ago

                The point is that OP (most probably) didn’t train it — they downloaded a pre-trained model and only did fine-tuning and inference.

                • utopiah@lemmy.world
                  link
                  fedilink
                  arrow-up
                  2
                  arrow-down
                  2
                  ·
                  2 days ago

                  Right, my point is exactly that though, that OP by having just downloaded it might not realize the training costs. They might be low but on average they are quite high, at least relative to fine-tuning or inference. So my question was precisely to highlight that running locally while not knowing the training cost is naive, ecologically speaking. They did clarify though that they do not care so that’s coherent for them. I’m insisting on that point because maybe others would think “Oh… I can run a model locally, then it’s not <<evil>>” so I’m trying to clarify (and please let me know if I’m wrong) that it is good for privacy but the upfront training cost are not insignificant and might lead some people to prefer NOT relying on very costly to train models and prefer others, or a even a totally different solution.

                  • jfrnz@lemm.ee
                    link
                    fedilink
                    arrow-up
                    3
                    ·
                    2 days ago

                    The model exists already — abstaining from using it doesn’t make the energy consumption go away. I don’t think it’s reasonable to let historic energy costs drive what you do, else you would never touch a computer.

            • utopiah@lemmy.world
              link
              fedilink
              arrow-up
              6
              arrow-down
              3
              ·
              2 days ago

              I see. Well, I checked your post history because I thought “Heck, they sound smart, maybe I’m the problem.” and my conclusion based on the floral language you often use with others is that you are clearly provoking on purpose.

              Unfortunately I don’t have the luxury of time to argue this way so I’ll just block you, this way we won’t have to interact in the future.

              Take care and may we never speak again.