More

aray07 · 2026-04-18T17:51:55 1776534715

Came to a similar conclusion after running a bunch of tests on the new tokenizer

It was on the higher end of Anthropics range - closer to 30-40% more tokens

https://www.claudecodecamp.com/p/i-measured-claude-4-7-s-new...

aray07 · 2026-04-17T17:17:38 1776446258

yeah thats the part that is unclear to me as well - if our usage capacity is now going to run out faster.

AndyNemmity · 2026-04-17T18:48:03 1776451683

The same thing I've been doing all the time, now has used up 1/3rd of my week in one day on max20.

So yes, for the same tasks, usage runs out faster (currently)

aray07 · 2026-04-17T17:16:11 1776446171

im running some experiments on this but based on what i have seen on my own personal data - I dont think this is true

"given that Opus 4.7 on Low thinking is strictly better than Opus 4.6 on Medium, etc., etc.”

Opus 4.7 in general is more expensive for similar usage. Now we can argue that is provides better performance all else being equal but I haven’t been able to see that

aray07 · 2026-04-17T17:05:08 1776445508

effort level is separate from tokenization. Tokenization impacts you the same regardless.

I find 5 thinking levels to be super confusing - I dont really get why they went from 3 -> 5

aray07 · 2026-04-17T17:04:02 1776445442

i think the new qwen models are supposed to be good based on some the articles that i read

aray07 · 2026-04-17T16:51:41 1776444701

anthropic’s pricing is all based on token usage

https://platform.claude.com/docs/en/about-claude/pricing

So if you are generating more tokens, you are eating up your usage faster

aray07 · 2026-04-17T16:50:39 1776444639

are you okay with paying more for your services without any perceived improvement in the service itself?

schmookeeg · 2026-04-17T16:55:04 1776444904

That's been a constant for my entire adult life.

aray07 · 2026-04-17T16:49:45 1776444585

yeah thats is my biggest issue - im okay with paying 20-30% more but what is the ROI? i dont see an equivalent improvement in performance. Anthropic hasnt published any data around what these improvements are - just some vague “better instruction following"

Bridged7756 · 2026-04-17T18:32:31 1776450751

Its enshittificating real fast. They'll just keep releasing model after model, more expensive than the last, marginal gains, but touted as "the next thing". Evangelists will say that they're afraid, it's the future, in 6 months it's all over. Anthropic will keep astroturfing on Reddit. CEOs will make even more outlandish claims.

You raised a good point, what's a good metric for LLM performance? There's surely all the benchmarks out there, but aren't they one and done? Usually at release? What keeps checking the performance of those models. At this point it's just by feel. People say models have been dumbed down, and that's it.

I think the actual future is open source models. Problem is, they don't have the huge marketing budget Anthropic or OpenAI does.

conductr · 2026-04-17T21:37:42 1776461862

This is most likely trajectory I fear. It reminds me a lot of Oracle, where they rebrand and reskin products just to change pricing/marketing without adding anything.

skydhash · 2026-04-17T22:14:41 1776464081

Win 10, win 11, all the recent macOS,… could have been released as features and not new products

margorczynski · 2026-04-17T17:58:04 1776448684

The other thing is most people don't really care about price per token or whatever but how much it will cost to execute (successfully) a task they want.

It doesn't matter if a model is e.g. 30% cheaper to use than another (token-wise) but I need to burn 2x more tokens to get the same acceptable result.

aray07 · 2026-04-17T16:35:50 1776443750

isn’t caveman a joke? why would you use it for real work?

aray07 · 2026-04-17T16:29:26 1776443366

yeah opus 4.7 feels a lot more verbose - i think they changed the system prompt and removed instructions to be terse in its responses