More

iknowstuff · 2026-04-17T22:06:49 1776463609

No. Their L3 was a scam, never sold and not actually planned to be sold anymore.

kube-system · 2026-04-17T22:10:21 1776463821

So, the same as Tesla's L3.

iknowstuff · 2026-04-18T00:44:20 1776473060

You can go to austin and take an unsupervised robotaxi right now

vel0city · 2026-04-18T01:21:37 1776475297

Last I heard they still have a car following. Has that actually changed?

iknowstuff · 2026-04-17T22:05:56 1776463556

literally every single competitor will do it because they don't control for them at all. Most of them don't even control for traffic lights or stop signs. Also https://www.youtube.com/watch?v=9lc-HUZUiQg

iknowstuff · 2026-04-17T22:04:17 1776463457

not anymore as long as you're looking at the road.

iknowstuff · 2026-04-17T20:55:21 1776459321

style="color:red" is back in fashion?

iknowstuff · 2026-04-17T16:12:51 1776442371

Interesting because I already felt like current models spit out too much garbage verbose code that a human would write in a far more terse, beautiful and grokable way

aray07 · 2026-04-17T16:29:26 1776443366

yeah opus 4.7 feels a lot more verbose - i think they changed the system prompt and removed instructions to be terse in its responses

QuercusMax · 2026-04-17T18:10:32 1776449432

I had a case yesterday where Claude wrote me a series of if/elses in python. I asked it if it could use some newer constructs instead, and it told me that I was on a new enough python version that I could use match/case. Great!

And then it proceeded to rewrite the block with a dict lookup plus if-elses, instead of using match/case. I had to nag it to actually rewrite the code the way it said it would!

iknowstuff · 2026-04-17T01:57:19 1776391039

Imagine where we’d be if the restrictive iOS model was dominant in all computing. We’d never get anything like this

iknowstuff · 2026-04-13T19:02:09 1776106929

Gemma 31B scoring below 26B-A4B?

gertlabs · 2026-04-13T19:10:24 1776107424

In one shot coding, surprisingly, yes, by a decent amount. And it isn't a sample size issue. In agentic, no: https://gertlabs.com/?agentic=agentic

My early takeaway is that Gemma 26B-A4B is the best tuned out of the bunch, but being small and with few active params, it's severely constrained by context (large inputs and tasks with large required outputs tank Gemma 26B's performance). We're working on a clean visualization for this; the data is there.

It's not uncommon for a sub-release of a model to show improvements across the board on its model card, but actually have mixed real performance compared to its predecessor (sometimes even being worse on average).

adrian_b · 2026-04-14T05:53:48 1776146028

In early tests the performance of gemma-4-31B was affected by tokenizer bugs in many of the existing backends, like llama.cpp, which were later corrected by their maintainers.

Moreover, tool invocation had problems that were later corrected by Google in an updated chat template.

So any early benchmarks that have shown the dense model as inferior to the MoE model are likely to be flawed and they must be repeated after updating both the inference backend and the model.

All benchmarks that I have seen after the bugs were fixed have shown the dense model as clearly superior in quality, even if much slower.

gertlabs · 2026-04-14T06:57:54 1776149874

We add samples every week, so I'm curious if the numbers will move.

They did a similar re-release during the Gemini 3.1 Pro Preview rollout, and released a custom-tools version with its own slug, which performs MUCH better on custom harnesses (mostly because the original release could not figure out tool call formatting at all).

iknowstuff · 2026-04-13T00:04:58 1776038698

Yeah but approving every purchase from a merchant I trust, like Amazon, would be annoying. Gotta allow for one tap to purchase, like eg apple pay does

snicky · 2026-04-13T03:18:22 1776050302

IIRC BLIK asks you if you want to skip the verification next time you buy from the same merchant.

iknowstuff · 2026-04-12T06:52:17 1775976737

And they gave this guy life in prison! Unlucky/stupid to do it after turning 18.

https://kotaku.com/gta-6-hacker-sentenced-prison-life-185111...

def curious to hear your story if you’re willing to share

joshmn · 2026-04-12T13:42:06 1776001326

I didn’t know about this case. When you said it I thought it must have been a US federal case. Nope.

https://news.ycombinator.com/item?id=45844197

iknowstuff · 2026-04-10T18:08:51 1775844531

Not really. A portion of users will randomly tap that just to get rid of the question. They don’t read.

The easiest way to experience that yourself is to set your device to a language you barely understand. You’ll find yourself dismissing dialogs just like all those illiterate normies.