More

FeepingCreature · 2026-04-17T09:09:35 1776416975

I'd get confused if I was a LLM and you put my entire prompt in a text file attachment. I'd be like, "is this the user or is this a prompt injection??"

astrange · 2026-04-20T07:06:22 1776668782

If you paste a long enough prompt into either GPT or Claude they turn it into an attachment, so it can happen. I think it's invisible to the model, but somehow not to the summarizer.

FeepingCreature · 2026-04-17T09:07:34 1776416854

Errors compounding is a meme. In iterated as well as verifiable domains, errors dilute instead of compounding because the llm has repeated chances to notice its failure.

FeepingCreature · 2026-04-17T09:03:09 1776416589

It's very unlikely that API use is subsidized.

jermaustin1 · 2026-04-17T11:41:27 1776426087

I keep hearing both sides of this "debate," but no one is providing any direct evidence other than "I do(n't) think that is true."

FeepingCreature · 2026-04-18T12:33:23 1776515603

Well there can't be direct evidence, it's a private corporation and we don't know how big the model is. But you can look on Openrouter for hosters that offer free models with known sizes, where there's no brand and so no incentive to subsidize, and they don't look wildly bigger than OpenAI/Anthropic API prices.

edit: example: GLM 5.1, a 751B model, is offered for 0.6$/m in, 4.43$/m out. Scuttlebutt (ie. I asked Google's AI) seems to think that Opus 4 is a 1T/5T MoE model, so you can treat it (with some effort) as a 1T model for pricing purposes. Its API pricing is $1.55 in, $25 out, ie. 2x to 5x more than GLM. Idk what to say other than this sounds about right, probably with healthy margin.

FeepingCreature · 2026-04-16T08:54:55 1776329695

I always avoided Ollama because it smelled like a project that was trying so desperately to own the entire workflow. I guess I dodged a bigger bullet than I knew.

FeepingCreature · 2026-04-08T08:50:47 1775638247

Seems like that's more to do with human intelligence being first.

FeepingCreature · 2026-04-07T20:03:10 1775592190

OpenAI didn't release GPT-2 initially because they were worried it would make it too easy to generate spam. Which it kinda did.

FeepingCreature · 2026-04-07T20:01:41 1775592101

'emer ge' is two tokens, 'emergency' is one. The models think in a logosyllabic language.

FeepingCreature · 2026-04-07T19:56:46 1775591806

This makes sense if Anthropic think they're the best-positioned to make safe AI. However if you are looking at an AI company there's obviously some selection happening.

FeepingCreature · 2026-04-07T19:53:25 1775591605

No it isn't lol. The consequence of the technology literally includes human extinction. I prefer 0 companies, but I'll take 1 over 5.

FeepingCreature · 2026-03-31T15:34:25 1774971265

(my) fncad doesn't have the querying, but it does have smooth csg! https://fncad.github.io/