More

andoando · 2026-04-23T08:17:19 1776932239

Yes, with a lot of reviewing what its doing/asking questions, 100%

andoando · 2026-04-22T19:00:57 1776884457

Because its SO much faster not to have to do all that. I think 10x is no joke, and if you're doing MVP, its just not worth the mental effort.

pron · 2026-04-22T20:53:13 1776891193

POC, sure (although 10x-ing a POC doesn't actually get you 10x velocity). MVP, though? No way. Today's frontier models are nowhere near smart enough to write a non-trivial product (i.e. something that others are meant to use), minimal or otherwise, without careful supervision. Anthropic weren't able to get agents to write even a usable C compiler (not a huge deal to begin with), even with a practically infeasible amount of preparatory work (write a full spec and a reference implementation, train the model on them as well as on relevant textbooks, write thousands of tests). The agents just make too many critical architectural mistakes that pretty much guarantee you won't be able to evolve the product for long, with or without their help. The software they write has an evolution horizon between zero days and about a year, after which the codebase is effectively bricked.

andoando · 2026-04-22T21:41:54 1776894114

There is a million things in between a C compiler and a non-trivial product. They do make a ton of horrible architectural decisions, but I only need to review the output/ask questions to guide that, not review every diff.

pron · 2026-04-22T22:24:34 1776896674

A C compiler is a 10-50KLOC job, which the agents bricked in 0 days despite a full spec and thousands of hand-written tests, tests that the software passed until it collapsed beyond saving. Yes, smaller products will survive longer, but how would you know about the time bombs that agents like hiding in their code without looking? When I review the diffs I see things that, if had let in, the codebase would have died in 6-18 months.

BTW, one tip is to look at the size of the codebase. When you see 100KLOC for a first draft of a C compiler, you know something has gone horribly wrong. I would suggest that you at least compare the number of lines the agent produced to what you think the project should take. If it's more than double, the code is in serious, serious trouble. If it's in the <1.5x range, there's a chance it could be saved.

Asking the agent questions is good - as an aid to a review, not as a substitute. The agents lie with a high enough frequency to be a serious problem.

The models don't yet write code anywhere near human quality, so they require much closer supervision than a human programmer.

sarchertech · 2026-04-22T22:20:54 1776896454

A C compiler with an existing C compiler as oracle, existing C compilers in the training set, and a formal spec, is already the easiest possible non-trivial product an agent could build without human review.

You could have it build something that takes fewer lines of code, but you aren’t gonna to find much with that level of specification and guardrails.

andoando · 2026-04-18T08:05:31 1776499531

I don't either, but I really think Im just burnt out. The simplest things piss me off.

andoando · 2026-04-16T19:01:22 1776366082

Totally agree, AI interfaces will become the norm.

Even all the websites, desktop/mobile apps will become obsolete.

donnisnoni · 2026-04-17T08:36:50 1776415010

AI won't kill apps, it will just change who 'clicks' the buttons. Even the most powerful AI needs a source of truth and a structured environment to pull data from. A world without websites is a world where AI has nothing to read and nowhere to execute. We aren’t deleting the UI. We’re just building the backends that feed the agents.

andoando · 2026-04-16T18:59:17 1776365957

I want it yes. I already feel like Im the one doing the dumb work for the AI of manually clicking windows and typing in a command here or there it cant do.

Ive also been getting increasingly annoyed with how tedious it is to do the same repetitive actions for simple tasks.

andoando · 2026-04-15T20:46:36 1776285996

Most books have so much nonsense details that I cant help but skip most of it.

On the other hand technical books can be so overwhelmingly difficult that you need to go outside and do hours of learning to understand one tidbit of it

andoando · 2026-04-13T18:26:04 1776104764

The big thing is here is more training and that comes in two flavors:

1. Using AI helps as part of the training process.

2. All the prompts going to openai/claude is a gold mine.

andoando · 2026-04-07T00:30:06 1775521806

Isnt the codebase in the context window?

frog437 · 2026-04-07T01:17:24 1775524644

depending on how large your codebase is, hopefully not. At this point use something like the IX plugin to ingest codebase and track context, rather than from the LLM itself.

frog437 · 2026-04-07T04:04:14 1775534654

This is crazy..

tokensSaved = naiveTokens - actualTokens

  - naiveTokens = 19.4M — what ix estimates it would have cost to answer your queries without graph intelligence (i.e., dumping full files/directories into context)                                    
  - actualTokens = 4.7M — what ix's targeted, graph-aware responses actually used
  - tokensSaved = 14.7M — the difference

andoando · 2026-04-09T04:14:14 1775708054

I mean whatever part of the code that is read by the AI has to be in the content window at some point or another nSprewd throughout your sessions Id think even with a huge codebase, 90% of it is going to be there

andoando · 2026-04-06T16:24:24 1775492664

Ive been noticing something similar recently. If somethings not working out itll be like "Ok this isnt working out, lets just switch to doing this other thing instead you explicitly said not to do".

For example I wanted to get VNC working with PopOS Cosmic and itll be like ah its ok well just install sway and thatll work!

albert_e · 2026-04-06T17:52:55 1775497975

Experienced this -- was repeatedly directing CC to use Claude in Chrome extension to interact with a webpage and it was repeatedly invoking Playwright MCP instead.

RALaBarge · 2026-04-07T12:05:03 1775563503

I actually submitted an upstream patch for Cosmic-Comp thanks to Claude on Saturday. I wanted to play Guild Wars remake and something was going on with the mouse and moving the camera. We had it fixed in no time and now shit is working great.

robotswantdata · 2026-04-06T18:12:30 1775499150

It’s as if it gives up, I respond keep going with original plan, you can do it champ!

rootnod3 · 2026-04-06T16:47:55 1775494075

[flagged]

andoando · 2026-04-06T16:57:18 1775494638

satvikpendem · 2026-04-06T18:40:48 1775500848

They're saying just do it yourself instead of trying to herd an unpredictable animal to your bidding like an LLM.

andoando · 2026-04-06T06:36:15 1775457375

Id highly disagree with that. Were all living in the same shared universe, and underlying every intelligence must be precisely an understanding of events happening in this space-time.

vixen99 · 2026-04-06T13:18:57 1775481537

What does 'precisely' mean? Everyone has the same understanding of events - a precise one?

andoando · 2026-04-06T16:14:14 1775492054

No I am saying the basis of intelligence must be shared, not that we have the same exact mental model.

I might for example say a human entered a building, a bat might on the other hand think "some big block with two sticks moved through a hole", but both are experiencing a shared physical observation, and there is some mapping between the two.

Its like when people say, if there are aliens they would find the same mathematical constants thet we do