More

wordpad · 2026-05-03T14:06:14 1777817174

Planetary Annihilation did it and wrote and gave talks about it.

wordpad · 2026-04-24T03:55:35 1777002935

The players barely ever change. People don't have problems following sports, you shouldn't struggle so much with this once you accept top spot changes.

gbnwl · 2026-04-24T05:03:52 1777007032

I didn't express this well but my interest isn't "who is in the top spot", and is more _why and _how various labs get the results they do. This is also magnified by the fact that I'm not only interested in hosted providers of inference but local models as well. What's your take on the best model to run for coding on 24GB of VRAM locally after the last few weeks of releases? Which harness do you prefer? What quants do you think are best? To use your sports metaphor it's more than following the national leagues but also following college and even high school leagues as well. And the real interest isn't even who's doing well but WHY, at each level.

yorwba · 2026-04-24T08:03:36 1777017816

The technical report discussing the why and how is here: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main...

renticulous · 2026-04-24T06:08:43 1777010923

Follow the AI newsletters. They bundle the news along with their Op-Ed and summarize it better.

stef25 · 2026-04-24T09:37:24 1777023444

Tips on what newsletters are worth signing up for ?

anonymousDan · 2026-04-24T08:03:27 1777017807

Can you suggest some good ones?

namnnumbr · 2026-04-24T12:39:39 1777034379

I really like latent.space and simonwillison.com.

Also (shameless self-promo) I publish a 2x weekly blog just to force myself to keep up: https://aimlbling-about.ninerealmlabs.com/treadmill/

yorwba · 2026-04-24T08:04:23 1777017863

https://jack-clark.net/

ayewo · 2026-04-25T09:11:24 1777108284

Thanks for this!

Link to direct newsletter subscription: https://importai.substack.com/

ehnto · 2026-04-24T04:41:27 1777005687

It is funny seeing people ping pong between Anthropic and ChatGPT, with similar rhetoric in both directions.

At this point I would just pick the one who's "ethics" and user experience you prefer. The difference in performance between these releases has had no impact on the meaningful work one can do with them, unless perhaps they are on the fringes in some domain.

Personally I am trying out the open models cloud hosted, since I am not interested in being rug pulled by the big two providers. They have come a long way, and for all the work I actually trust to an LLM they seem to be sufficient.

dannyw · 2026-04-24T12:23:51 1777033431

Their financial projections that to a big part their valuation and investor story is built on involves actually making money, and lots of money, at some point. That money has to come from somewhere.

DiscourseFan · 2026-04-24T04:46:27 1777005987

I find ChatGPT annoying mostly

awakeasleep · 2026-04-24T04:49:40 1777006180

Open settings > personalization. Set it to efficient base style. Turn off enthusiasm and warmth. You’re welcome

2ndorderthought · 2026-04-24T11:02:30 1777028550

Yea but even then it's still annoying. "It's not about the enthusiasm and warmth but the general tone"

layer8 · 2026-04-24T13:58:43 1777039123

Setting “base style and tone” to “efficient” works fine for me.

wordpad · 2026-04-22T14:56:41 1776869801

That's way more than 10, around 50

wordpad · 2026-04-19T23:32:32 1776641552

>Capitalists claim that this is optimal.

It's more optimal than planned economies until we have AI planned economies with realtime feedback, I guess.

Consumers get cheap goods during oversupply and most inefficient companies get elliminated during bust while consolidation leads to economies of scale.

whatever1 · 2026-04-19T23:55:56 1776642956

No this is literally a sign of an unstable system with too high of a gain K.

There is an alternative where legislation dampens this behavior but the short term profits will be lower. Hence the hawks don’t like it.

wordpad · 2026-04-20T00:56:15 1776646575

>legislation dampens this behavior

Potentially. Well meaning and thought out legislation still distorts the markets, possibly making things objectively worse.

Panzer04 · 2026-04-20T00:38:52 1776645532

This is a wild take.

BenFranklin100 · 2026-04-20T03:29:10 1776655750

Sophomoric take more precisely.

AngryData · 2026-04-20T09:30:00 1776677400

Why is the opposite of capitalist markets automatically assumed to be a command economy? Co-op style businesses aren't really capitalist orientated but are also not reliant on government action.

wordpad · 2026-04-10T21:36:34 1775856994

How does this compare to Jules from Google?

danoandco · 2026-04-10T21:54:40 1775858080

Jules is similar to Twill with the following differences:

- Twill is CLI-agnostic, meaning you can use Claude Code, Codex or Gemini. Jules only works with Gemini.

- We focus on the delegation experience: Twill has native integrations with your typical stack like Slack or Linear. The PRs comes back with proofs of work, such as screenshots or videos.

wordpad · 2026-04-13T17:42:26 1776102146

That's very interesting, thank you!

wordpad · 2026-04-02T17:32:20 1775151140

Do you think it's just part of their training set now?

alexeiz · 2026-04-02T18:35:17 1775154917

It's time to do "frog on a skateboard" now.

Wyverald · 2026-04-04T01:16:17 1775265377

In case you haven't seen this: https://x.com/JeffDean/status/2024525132266688757

lysace · 2026-04-02T19:24:43 1775157883

Seems very likely, even if Google has behaved ethically.

Simon and YC/HN has published/boosted these gradual improvements and evaluations for quite some time now.

There is a https://simonwillison.net/robots.txt but it allows pretty much everything, AI-wise.

simonw · 2026-04-02T17:53:36 1775152416

If it's part of their training set why do the 2B and 4B models produce such terrible SVGs?

vessenes · 2026-04-02T18:23:41 1775154221

We were promised full SVG zoos, Simon. I want to see SVG pangolins please

nickpsecurity · 2026-04-02T22:51:05 1775170265

Larger models better understand and reproduce what's in their training set.

For example, I used to get verbatim quotes and answers from copyrighted works when I used GPT-3.5. That's what clued me in to the copyright problem. Whereas, the smallest models often produced nonsense about the same topics. Because small models often produce nonsense.

You might need to do a new test each time to avoid your old ones being scraped into the training sets. Maybe a new one for each model produced after your last one. Totally unrelated to the last one, too.

wolttam · 2026-04-02T20:39:03 1775162343

Because it is in their training set but it's unrealistic to expect a 2B or 4B model to be able to perfectly reproduce everything it's seen before.

The training no doubt contributed to their ability to (very) loosely approximate an SVG of pelican on a bicycle, though.

Frankly I'm impressed

retinaros · 2026-04-02T20:05:27 1775160327

because generating nice looking svg requires handling code, shapes, long context, reasoning and at 2b you most likely will break the syntax of the file 9 times out of 10 if you train for that. or you will need to go for simpler pelicans. might not be worth to ft on a 2b. but on their top tier open model it is definitly worth it. even not directly but just crawling a github would make it train on your pelicans.

wordpad · 2026-03-25T13:50:00 1774446600

They are not doing random rotation, simplification here means they are aligning the outliers. If you threw a bunch of shapes on the ground they are picking up one that rolled away and putting it with the others.

>How can a boolean value preserve all of the relational and positional information between data points?

They aren't reducing entire vector to a bollean only each of its dimensions.

wordpad · 2026-03-17T19:31:54 1773775914

> AI capability problem is mostly solved; the distribution and trust problem isn't.

SaaS opportunity? Maybe, some sort of marketplace of AI-written applications and services with discovery features?

wordpad · 2026-03-12T00:28:32 1773275312

I have a junior position open and got 1,300 applicants in 1 week before we took it down. Many of the candidates with strong resumes are just lying and doing so well enough to pass HR screens.

I doubt any sort of AI screen would help though as many of the lying candidates are already using AI assist tools making it just a cat and mouse race...

I don't know a good solution to give everyone a fair chance.

eloisant · 2026-03-12T15:51:19 1773330679

You can't give everyone a fair chance, but at least don't waste their time with a stupid AI interview.

Also, at the end of the day, in your 1,300 applicants maybe you have 200 who are a perfect fit and as equally good. But you just have one position. So even with a perfect system that gives you complete information, you'll still have to reject 199 strong candidates.

wordpad · 2026-02-26T16:58:37 1772125117

It's not just for politics but fairness. You can't just one day up and decide to make something illegal that others depending on for livelyhood. It's good enough that it limits growth of the banned thing.

ryandrake · 2026-02-26T17:20:44 1772126444

Sure you can. It just takes backbone, which is rarely found in the political class.

If I, as a voter, voted for a politician who promised to ban dumping mercury in the local river, I don't expect them to say "Oh, but any company already dumping mercury in the river can keep doing so, because we don't want to hurt people's livelihood." That's not what I voted for.

MoltenMan · 2026-02-26T18:27:45 1772130465

Ok, but if you are investing capital in some sort of production line or industrialization you are not going to want to do that in an area where you might just lose your entire investment instantly; instead, you're just going to invest it in Texas or China. Of course with more extreme examples like yours you do have to put some cost on the existing companies to get it fixed, but it would be something with a smaller cost like having to dispose of the mercury properly (whereas in this article's examples they just flat out ban these things, which you can't do to existing factories).

ryandrake · 2026-02-26T18:55:50 1772132150

For sure there would be a disincentive to "invest" in the area where you might lose the investment. That would be intentional. As a voter, I specifically don't want companies to be making those kinds of "investments" in my region. Go "invest" your dirty industry in China. If California's reputation for harshly regulating these things prevents these kinds of businesses from opening here in the first place, I consider that Working As Intended. We could make that reputation even stronger by not grandfathering things.