On Mon, Sep 25, 2023 at 10:37 AM Chris Albon <calbon@wikimedia.org> wrote:

Hey SJ!

Intuitively one model per wiki has a lot of merit.

< if we continued down that path, Lift Wing would eventually be hosting 3000+ models (i.e. 330 models per new feature) pretty quickly

< But with language agnostic models we can make that model available to all communities.

Thanks! Agreed that having a model available to all communities is good for equity :) In the automoderator case, is it that the multilingual model incorporates the language-agnostic model, but not vice-versa? Is there a way to have the inverse: a generalized multilingual model, that may be fine-tuned for different communities, but does its best with input in less-known languages or variants? [Perhaps w/ context cues for users estimating how far out of distribution the input is.]

I like the idea of a general model that can be tuned, since I can imagine community groups maintaining datasets for fine-tuning more easily than maintaining their own entire models.

Warmly, SJ