Lauren:
What kind of hardware should WMCS buy to support such a project?
I can't comment on the hardware requirements, but I would note that in addition to the llama.cpp repository (https://github.com/ggerganov/llama.cpp), which currently focuses on LLaMA/Alpaca, there are other efforts to reduce the computational requirements for running LLMs. https://github.com/NolanoOrg/cformers looks promising and supports many of the open models. Fabrice Bellard of FFmpeg fame was one of the first implementers of a highly optimized LLM at https://textsynth.com/; sadly much of the work is proprietary (there is a binary-only distribution for CPU under MIT license).
https://textsynth.com/playground.html remains one of the most accessible ways to explore the performance of the open models with only a rate limitation, and no requirement to purchase credits.
I disagree, which is why I have been pushing RARR and ROME.
That approach does look promising, thanks for continuing to point it out. The publicly available implementations I've seen of citing sources (Google/Microsoft/You.com/character.ai) still hallucinate heavily and attribute claims to the sources that they don't make, but if an LLM can be steered to cite reliable sources and not fabricate information that isn't in the sources, that would be a huge win!
Chris:
However, not clicking a button to indicate acceptance of terms and conditions does not mean I can do whatever I want. It means that I either have to find other evidence of an explicit licence (maybe text on the document?), or consider whether there is an implicit licence or exemption. An implicit licence might well exist but is qite likely to be minimal in scope.
I broadly agree with your reasoning. My core argument is that using a software license to restrict "ethical use", without requiring clickthrough agreements, is either unenforceable in many cases, or it is enforceable only in a manner that otherwise hollows out consumer rights. That's because it is a _reasonable expectation_ that, at the very least, if you give me a piece of software without any obvious restrictions attached to it, I can run that piece of software (by virtue of implied license, exemption, or both).
In my view, a fundamental goal of free licenses is to expand and protect user rights for covered works. Both the GPL and CC-BY-SA make it explicit that none of their provisions are intended to limit fair use / fair dealing, and as previously noted, the GPL even says explicitly that you can receive and run GPL software without accepting the GPL itself.
To fully enforce the "ethical use" provisions of OpenRail-M (which are the raison d'ĂȘtre for the license) requires the licensor to argue that, if you legally obtained a copy from them without ever seeing the license, you are not permitted to run the software. That is a user-hostile, bait-and-switch approach to software licensing, regardless of whether the courts ultimately uphold it.
I suspect it may never be tested, because most organizations that adopt these licenses do so as a matter of reputational risk mitigation, not with actual intent to enforce these provisions -- which would be often be very impractical for other reasons (identifying the bad actor, proving that the model was used, etc.).
Warmly, Erik