More

HWR_14 · 2026-04-17T13:18:11 1776431891

We make more homes, but I would say the construction of the average home is worse after the invention of the power saw than before it.

bluegatty · 2026-04-17T13:53:52 1776434032

Good gosh no.

That's like saying 'cars were better made in the 1950's because they used tons of steel'. Like they were 'heavier and more robust' - but that doesn't mean better.

Foundations are way better, more robust, especially weatherized. Windows today are like magic compared to windows 100 years ago.

What we do more poorly now is we don't use wood everywhere, aka doors, and certain kinds of workmanship are not there - like winding staircases, mouldings - but you can easily have that if you want to pay for it. That's a choice.

AI is power and leverage, it will make better things as long as it's directed by skilled operators.

HWR_14 · 2026-04-17T14:38:18 1776436698

Yes, houses got better because materials got better. Windows are better. But the construction of the houses is worse.

The precision of how the wood or material meets is worse (when cut at the site). There is a huge amount of sloppy work in modern construction.

HWR_14 · 2026-04-17T13:15:28 1776431728

Or Safari with 'reader view'.

HWR_14 · 2026-04-16T22:29:20 1776378560

And hopefully Anthropic has extra capacity then and I can return there.

HWR_14 · 2026-04-16T18:10:01 1776363001

I read that as "it's not worth the negative PR of being associated with AI firing minimum wage employees" compared to just paying them for a year or two.

HWR_14 · 2026-04-15T15:25:26 1776266726

Is it that Europe doesn't have any open data analogues of the SEC, or is it that each European country has its own open data analog?

abstracthinking · 2026-04-15T15:37:49 1776267469

Each european countries has various orgs, types of license. It is easier to start from SEC to integrate that data into my site, also you have to test a bit what users care about.

jasonjayr · 2026-04-15T17:37:17 1776274637

Wondering aloud -- this is clearly PII, but it's public information. The site would be subject to GDPR, and other rules from the EU, and folks may want to have their data hidden or removed. What would be the exposure for sourcing EU data?

HWR_14 · 2026-04-15T11:26:38 1776252398

Sharing the data with any random corporation seems like a bad idea.

HWR_14 · 2026-04-15T11:25:07 1776252307

Whats "reasonable hardware"?

Someone1234 · 2026-04-15T12:05:11 1776254711

People have tried to run Qwen3-235B-A22B-Thinking-2507 on 4x $600 used, Nvidia 3090s with 24 GB of VRAM each (96 GB total), and while it runs, it is too slow for production grade (<8 tokens/second). So we're already at $2400 before you've purchased system memory and CPU; and it is too slow for a "Sonnet equivalent" setup yet...

You can quantize it of course, but if the idea is "as close to Sonnet as possible," then while quantized models are objectively more efficient they are sacrificing precision for it.

So next step is to up that speed, so we're at 4x $1300, Nvidia 5090s with 32 GB of VRAM each (128 GB), or $5,200 before RAM/CPU/etc. All of this additional cost to increase your tokens/second without lobotomizing the model. This still may not be enough.

I guess my point is: You see this conversation a LOT online. "Qwen3 can be near Sonnet!" but then when asked how, instead of giving you an answer for the true "near Sonnet" model per benchmarks, they suddenly start talking about a substantially inferior Qwen3 model that is cheap to run at home (e.g. 27B/30B quantized down to Q4/Q5).

The local models absolutely DO exist that are "near Sonnet." The hardware to actually run them is the bottleneck, and it is a HUGE financial/practical bottleneck. If you had a $10K all-in budget, it isn't actually insane for this class of model, and the sky really is the limit (again to reduce quantization and or increase tokens/second).

PS - And electricity costs are non-trivial for 4x 3090s or 4x 5090s.

Kim_Bruning · 2026-04-15T12:48:51 1776257331

I may have genuinely new data for you.

Qwen3.5-35B-A3B is reported to perform slightly better than the model you mentioned.

It runs fine but non-optimal on a single 3090 with even 131072 tokens of context , and due to the hybrid attention architecture, the memory usage and compute scale rather less drastically than ctx^2. I've had friends with smaller cards still getting work out of it. Generation is at around 20 tokens/sec on that 3090 (without doing anything special yet) . You'll need enough DRAM to hold the bits of the model that don't fit. Nothing to write home about, but genuinely usable in a pinch or for tasks that don't need immediate interactivity.

It's the first local model that passes my personal kimbench usability benchmark at least. Just be aware that it is extremely verbose in thinking mode. Seems to be a qwen thing.

(edit: On rechecking my numbers; I now realize I can possibly optimize this a lot better)

Someone1234 · 2026-04-15T13:12:05 1776258725

With respect, this isn't "new data" it is an anecdote. And it kind of represents exactly the problem I was talking about above:

- Qwen is near Sonnet 4.5!

- How do I run that?

- [Starts talking about something inferior that isn't near Sonnet 4.5].

It is this strange bait/switch discussion that happens over and over. Least of all because Sonnet has a 200K context window, and most of these ancdotes aren't for anywhere near that context size.

Kim_Bruning · 2026-04-15T13:27:40 1776259660

You're not wrong; but... imho it's closer to Sonnet 4.0 [1] on my personal benchmark [2]. And I HAVE run it at just over 200Ktoken context, it works, it's just a bit slow at that size. It's not great, but ... usable to me? I used Sonnet 4.0 over api for half a year or so before, after all.

Only way to know if your own criteria are now matched -or not yet- is to test it for yourself with your own benchmark or what have you.

And it does show a promising direction going forward: usable (to some) local models becoming efficient enough to run on consumer hardware.

[1] released mid-2025

[2] take with salt - only tests personal usability

+ Note that some benchmarks do show Qwen3.5-35B-A3B matching Sonnet 4.5 (released later last year); but I treat those with the same skepticism you do , clearly ;)

yencabulator · 2026-04-16T21:11:47 1776373907

One sure would expect Qwen3.5-35B-A3B to "perform slightly better" than Qwen3-235B-A22B!

zozbot234 · 2026-04-15T14:56:33 1776264993

> The hardware to actually run them is the bottleneck, and it is a HUGE financial/practical bottleneck.

That's unsurprising, seeing as inference for agentic coding is extremely context- and token-intensive compared to general chat. Especially if you want it to be fast enough for a real-time response, as opposed to just running coding tasks overnight in a batch and checking the results as they arrive. Maybe we should go back to viewing "coding" as a batch task, where you submit a "job" to be queued for the big iron and wait for the results.

Borealid · 2026-04-15T11:52:17 1776253937

A machine with 128GB of unified system RAM will run reasonable-fidelity quantizations (4-bit or more).

If you ever want to answer this type of question yourself, you can look at the size of the model files. Loading a model usually uses an amount of RAM around the size it occupies on disk, plus a few gigabytes for the context window.

Qwen3.5-122B-A10B is 120GB. Quantized to 4 bits it is ~70GB. You can run a 70GB model in 80GB of VRAM or 128GB of unified normal RAM.

Systems with that capability cost a small number of thousand USD to purchase new.

If you are willing to sacrifice some performance, you can take advantage of the model being a mixture-of-experts and use disk space to get by with less RAM/VRAM, but inference speed will suffer.

fy20 · 2026-04-15T12:48:46 1776257326

If you want something off the shelf get a MacBook Pro M5 (base "Pro" CPU) with 48GB RAM:

Gemma 4 31B Q6: 9tok/s, I'd say it is smarter than GPT-4o, but yeah it's slow. Good for coding.

Gemma 4 26B A4B Q4: 50tok/s. Feels faster than ChatGPT 5.4, but not as smart (as it reasons less). Good for general chatting and research.

HWR_14 · 2026-04-14T19:33:29 1776195209

Are the calories used by biofuels and cattle even directly consumable by humans?

HWR_14 · 2026-04-13T23:03:07 1776121387

"There is no money in tying online activities to a real identity" is a hot take.

HWR_14 · 2026-04-13T22:58:16 1776121096

Excel works, the VBA macros with random business rules work, people with business knowledge know how to use it, the workflows are set up.

If we were starting from nothing it wouldn't be built, but the value of what already exists is massive.

trollbridge · 2026-04-14T21:48:02 1776203282

I have a client who migrated from Sheets to Excel. Google has all the same issues Microsoft does too when it comes to privacy.