Hacker Newsnew | past | comments | ask | show | jobs | submit | brunooliv's commentslogin

I still haven’t seen any other models be as complete as Claude inside Claude Code. I bet Anthropic knows this and they turn the knobs and see people’s reactions… I have been planning with Qwen3.6 Max inside opencode, absolutely game changer. Opus can then follow the plan quite detailed and like this I can make progress on my toy apps on Pro plan at 20/mo.

For work, unlimited usage via Bedrock.

Yes I’d like to get more usage out of my personal sub, but at 20/mo no complains


Also my experience


I’ve been using Opus 4.6 extensively inside Claude Code via AWS Bedrock with max effort for a few months now (since release). I’ve found a good “personal harness” and way of working with it in such a way that I can easily complete self contained tasks in my Java codebase with ease.

Now idk if it’s just me or anything else changed, but, in the last 4/5 days, the quality of the output of Opus 4.6 with max effort has been ON ANOTHER LEVEL. ABSOLUTELY AMAZING! It seems to reason deeper, verifies the work with tests more often, and I even think that it compacted the conversations more effectively and often. Somehow even the quality of the English “text” in the output felt definitely superior. More crisp, using diagrams and analogies to explain things in a way that it completely blew me away. I can’t explain it but this was absolutely real for me.

I’d say that I can measure it quite accurately because I’ve kept my harness and scope of tasks and way of prompting exactly the same, so something TRULY shifted.

I wish I could get some empirical evidence of this from others or a confirmation from Boris…. But ISTG these last few days felt absolutely incredible.


This thread is very confusing. Everyone is saying diametrically opposed things. But I think this may be a clue: AWS bedrock means api billing, no? I’m guessing those complaining about the recently lowered quality of Claude are on subscriptions. And those who are still loving Claude are on work accounts.


Maybe… but I can say I saw a real shift in these last few days, why or if it’s real, I can’t fully say but definitely something changed


Unusable if not Opus 4.6 on max effort sadly. Price is quite steep too! I still remember when Sonnet was an absolute beast…


Couldn’t read the entire comments but, my experience has been overwhelmingly positive so far. I think what helps me be effective is a combination of factors: I work only in a modern, well-documented and well-architected Java codebase with over 80% test coverage.

I only use Claude Code with Opus 4.6 on High Effort.

I always, ALWAYS treat my “new job” as writing a detailed ticket for whatever it is I need to do.

I give the model access to a DB replica of my prod DB that I create manually.

I do NOT waste time with custom agents, Claude.md files or any of that stuff.

When I put ALL of the above together, the results ARE THE PROMISED LAND: I simply haven’t written a single line of code manually in the last 3 months.


I find this pretty interesting. I am curious though: Did you dislike coding? You sound genuinely excited to not be doing it anymore.

For me I have been a coder since a very young age and I am nearing the end of my career now. I still love writing code to problem solve just as much as the first day I learnt to code. The thought of something taking that task away from me doesn't fill me with glee.

A parallel for me is if I enjoyed puzzle pages and those brought me with joy and satisfaction employing my grey matter to solve, I just wouldn't find it interesting to have an agent complete the forms to me, with me simply guiding the agent to clues.


Replying once again for future reference to make my position clear: I firmly believe that one MUST experience programming on its own first. No LLMs, no crutches. One MUST feel the abstractions melting away and things clicking in the brain first.

The design becoming obvious. Being able to remove that extra if statement after clarifying requirements with a customer face to face.

A design pattern fitting a scenario like a glove, etc, etc.

You need REAL experience that only comes with time and effort. Years or decades, different businesses, different companies, etc.

But once you have crossed that chasm and that rite of passage, using LLMs becomes a true multiplier and my experience quite fun.

Using them blindly or without experience is a very different thing I can imagine.


I like problem solving and building useful things for our customers. Coding for me was always more of a “means to an end” than pure craft on its own. Obviously some standard, good and clean code pops up when you’re working in things to be extended or maintained by others, but, truth be told, ego battling in code reviews gets boring very fast and additionally, no matter how much I like experimenting with things, if I have an hypothesis, I can now validate it in 2 days instead of 1 week, which means I can validate double the hypothesis.

I am extremely excited about that! Coding in itself as the act of manually typing things? Absolutely not


What rock?

C'mon let's be real here, there's either "testing AI skills" versus "using AI agents like you would on the daily".

The signal got from leetcode is already dubious to assert profeciency and it's mostly used as a filter for "Are you willing to cram useless knowledge and write code under pressure to get the job?" just like system design is. You won't be doing any system design for "scale" anywhere in any big tech because you have architects for that nor do you need to "know" anything, it's mostly gatekeeping but the truth is, LLMs democratized both leetcode and system design anyway. Anyone with the right prompting skills can now get to an output that's good for 99% of the cases and the other 1% are reserved for architecs/staff engineers to "design" for you.

The crux of the matter is, companies do not want to shift how they approach interviews for the new era because we have collectively believed that the current process is good enough as-is. Again, I'd argue this is questionable given how sometimes these services break with every new product launch or "under load" (where YO SYSTEM DESIGN SKILLZ AT).


I really love Anthropic's models, but, every single product/feature I've used other than the Claude Code CLI has been terrible... The CLI just "sticked" for me and I've never needed (or arguably looked in depth) any other features. This for my professional dayjob.

For personal use, where I have a Pro subscription and adventure into exploring all the other features/products they have... I mean, the experience outside of Claude Code and the terminal has been... bad.


> every single product/feature I've used other than the Claude Code CLI has been terrible

yeah they're shipping too fast and everything is buggy as shit

- fork conversation button doesn't even work anymore in vscode extension

- sometimes when I reconnect to my remote SSH in VSCode, previously loaded chats become inaccessible. The chats are still there in the .jsonl files but for some reason the CC extension becomes incapable of reading them.


I tend to agree here. Today, I tried to get the claude chat to give me a list of Jira tickets from one board (link provided) and then upload it to notion with some additional context. It glitched out after trying the prompt over again 4x. I eventually gave up and went back to the terminal.


Yes. This is my experience as well. The software quality is generally horrible. It surely has improved a lot over the last couple of months, but it is still pretty horrible.

It is quite normal for me to have to force-close Claude Desktop.


Would be really curious to see


Mickens is the best!


I built the running app I always wanted: https://runcoach.fly.dev

You get tailored running schedules and also some body weight strength workouts and healthy meals all in one!


Error Invalid input: 1 validation error for PlanRequest current_km Input should be greater than 0 [type=greater_than, input_value=0.0, input_type=float] For further information visit https://errors.pydantic.dev/2.5/v/greater_than


Yes it assumes a minimum running base, will improve thx


It should be fixed now!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: