

Interesting, thanks for doing the research!
As an extreme non-expert, I would say “deliberate removal of a part of a model in order to study the structure of that model” is a somewhat different concept to “intrinsic and inexorable averaging of language by LLM tools as they currently exist”, but they may well involve similar mechanisms, and that may be what the OP is referencing, I don’t know enough of the technical side to say.
That paper looks pretty interesting in itself; other issues aside, LLMs are really fascinating in the way they build (statistical) representations of language.
Is it really not as easy for them as saying “hey btw don’t use this distro if you’re in California” and fully expecting nobody to comply? I’m not sure if Ubuntu is based in Cali in which case I can see it being more difficult.
Also this “age bracket” thing seems to have an obvious flaw in that any application that’s running semi-regularly can just poll the API every day and find out the user’s DOB by checking when they roll into the next bracket. It’s actually leaking more data about children than about adults in that case. Brilliant.