[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia

3 Apr 2023

On Sun, Apr 2, 2023 at 5:40 PM Erik Moeller &lt;eloquence(a)gmail.com&gt; wrote:
...

 I can't comment on the hardware requirements, but I would note that in
 addition to the llama.cpp repository
 (https://github.com/ggerganov/llama.cpp), which currently focuses on
 LLaMA/Alpaca, there are other efforts to reduce the computational
 requirements for running LLMs. https://github.com/NolanoOrg/cformers
 looks promising and supports many of the open models. Fabrice Bellard
 of FFmpeg fame was one of the first implementers of a highly optimized
 LLM at https://textsynth.com/ ; sadly much of the work is proprietary 
At this point I guess I would recommend adding five or so
g2.cores8.ram36.disk20 flavor VPSs to WMCS, with between one and three
RTX A6000 GPUs each, plus a 1TB SSD each, which should cost under
$60k. That should allow for very widely multilingual models somewhere
between GPT-3.5 and 4 performance with current training rates.

...
  https://textsynth.com/playground.html remains one of
the most
 accessible ways to explore the performance of the open models with
 only a rate limitation, and no requirement to purchase credits. 
There is are free Alpaca-30b demos for comparison at
https://github.com/deep-diver/Alpaca-LoRA-Serve
And free Alpaca-7b online at https://chatllama.baseten.co/

These models can be quantized into int4 weights which run on cell
phones: https://github.com/rupeshs/alpaca.cpp/tree/linux-android-build-support
It seems inevitable that we will someday include such LLMs with
Internet-in-a-Box, and, why not also the primary mobile apps so we
don't have to give away CPU utilization?

There is a proposal to allow apps over 4GB in WASM:
https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Over…
At the rate things are improving maybe that won't even be neeedd to
make a reasonable static web app, someday.

-LW

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

[Wikimedia-l] Re: 23 March: Invitation to Open Community Call on ChatGPT, generative AI, and Wikimedia