Great point, so are you saying there is a certain threshold above which training is energetically useful but under which it is not, e.g. if training of a large model is used by 1 person, it is not sustainable but if 1 million people use it (assuming it’s done productively, not spam or scam) then it is fine?
Results? I have no idea what you are talking about. I thought we were discussing the training cost (my initial question) and that the truckload was an analogy to argue that the impact from that upfront costs is spread among users.
How much energy was used to bring the truckload of groceries into the shop that one time so hundreds of people can use it?
Great point, so are you saying there is a certain threshold above which training is energetically useful but under which it is not, e.g. if training of a large model is used by 1 person, it is not sustainable but if 1 million people use it (assuming it’s done productively, not spam or scam) then it is fine?
So you’re saying if 1 guy made 1 million results it would offset the training?
Results? I have no idea what you are talking about. I thought we were discussing the training cost (my initial question) and that the truckload was an analogy to argue that the impact from that upfront costs is spread among users.