More

rahen · 2026-03-29T00:55:07 1774745707

Out of curiosity, what were those wonderful things you were hearing about the 11/34 back then?

rahen · 2026-03-28T21:35:57 1774733757

Yes. The Cray supercomputers from the 80s were crazy good matmul machines in particular. The quad-CPU Cray X-MP (1984) could sustain 800 MFLOPS to 1 GFLOPS, and with a 1 GB SSD, had enough computer power and bandwidth to train a 7-10M-parameter language model in about six months, and infer at 18-25 tok/sec.

A mid-90s Cray T3E could have handled GPT-2 124M, 24 years before OpenAI.

I also had a punch-card computer from 1965 learn XOR with backpropagation.

The hardware was never the bottleneck, the ideas were.

lucasfin000 · 2026-03-29T14:24:06 1774794246

Post-quantum crypto is a good example of this. Lattice-based schemes were theorized in the 90s, but they took decades to actually reach production. The math existed, the hardware existed, and the ideas for making it work were just not there yet.

CamperBob2 · 2026-03-29T04:13:53 1774757633

The hardware was never the bottleneck, the ideas were.

For sure. Minsky and Papert really set us back.

Onavo · 2026-03-29T06:26:57 1774765617

They should have lived to see the results of the bitter lesson.

CamperBob2 · 2026-03-29T15:50:15 1774799415

Minsky came close (d. 2016) -- although he may have had other interests later in life, if the Epstein file dumps are to be believed.

rahen · 2026-03-28T18:16:19 1774721779

The WASM GUI is probably the easiest way to see the Transformer in action on this machine: https://dbrll.github.io/ll-34/

There's also the original Tetris from 1984 to play.

rahen · 2026-03-28T13:57:40 1774706260

That thing is a Tamagochi though, it constantly needs attention, pardon the pun. I did most of the development and tuning on ll-34 for that reason.

budman1 · 2026-03-28T14:52:59 1774709579

I am a bit surprised, but I guess everything eventually wears out.

In the 1980's I worked as a field engineer that supported a lot of pdp-11's. They were very reliable for the time; tape drives and disks were the #1 maintenance items. To actually have to open up the processor and change a board was not a regular activity.

Other machines of that era, like those from Gould or Perkin/Elmer or DG gave regular practice in the art of repairing processors.

Guess I expect them to work forever. Like a Toyota.

rahen · 2026-03-28T18:22:19 1774722139

I encouter two main failure modes. First, the bipolar PROMs degrade at the atomic level, the metal ions in the fuses tend to migrate or 'regrow' over decades, causing bit rot. Second, the backplanes suffer from mechanical fatigue. After forty years of thermal expansion and structural flexing, especially when inserting boards, the traces and solder joints develop stress cracks. Both are a pain to repair.

https://retrocmp.com/articles/trying-to-fix-a-dec-pdp-1134-b...

budman1 · 2026-03-28T20:14:12 1774728852

Excellent work.

The feeling of accomplishment when the machine boots after a major repair (almost) makes it all worth while.

(i think i would have found a used backplane...fixing it was crazy clever)

ForOldHack · 2026-03-29T02:13:24 1774750404

XENIX's second target processor was an 11/34 with a programmers workbench. That nightmare took 3~4 years... Microsoft years, while they used the Pdp-11/70 for development.

We are talking less ram than an Atari..

rahen · 2026-03-28T13:49:25 1774705765

Thanks for reposting! I'm the author of ATTN-11. Happy to answer any questions about the fixed-point arithmetic, the PDP-11 hardware, or the training process.

functional_dev · 2026-03-28T14:34:34 1774708474

Incredible work! Fitting transformer into 32KB RAM is crazy

For those who read this project and do not know PDP-11 it could be hard to understand that working with these memory limits is difficult. Here is visual guide for PDP11 architecture - https://vectree.io/c/pdp-11-hardware-architecture

Thanks for this amazing project!

PaulHoule · 2026-03-28T15:13:48 1774710828

That PDP-11 was the most fun minicomputer of the late 1970s in my opinion. Growing up in NH about an hour north of Digital's HQ all sorts of schools from primary to secondary as well as museums had PDP-8, PDP-10, PDP-11 and later VAX machines.

The PDP-11 had a timesharing OS called RSTS/E which could give maybe 10 people a BASIC programming experience a little bit better than an Apple ][. If you were messing with 8-bit microcomputers in 1981 you might think a 16-bit future would look like the PDP-11 but the 1970 design was long in the tooth by 1980 -- like 8-bit micros it was limited to a 64kb logical address space. Virtual memory let it offer 64k environments to more users, but not let a user have a bigger environment.

dare944 · 2026-03-28T15:24:25 1774711465

Fun stuff! At one point I wondered about building something similar. But I lack the AI chops, and have too many other projects going on anyway.

I'm curious as to the type of memory in the 11/34. I also have a working PDP-11, an 11/05 with 32KW of actual core. I wonder what performance would be like with EIS emulation grafted in. Stunningly slow, I imagine.

Thanks for publishing this.

McGlockenshire · 2026-03-28T17:37:18 1774719438

Thank you for the inspiration, I now have a practical-impractical assembly project for my TI TMS99105A homebrew! The 64k barrier is a real pain.

rahen · 2026-03-28T18:26:19 1774722379

I also have a working design for a small Transformer on the original Game Boy. It has around 4000 parameters fitting in the 8 KB cartridge SRAM, where the "saved game" is the trained model. A TI-82 with its 32 KB of RAM would be even more comfortable.

rahen · 2026-02-03T08:54:33 1770108873

Around the same time (1984), there was also another very cool piece of technology that often gets overlooked: the CMU WARP. It wasn’t as flashy as the Crays and the Connection Machine, but it was the first systolic array accelerator (what we’d now call TPUs). It packed as much MFLOPS as a Cray 1.

It's also the computer that powered the Chevrolet Navlab self-driving car in 1986.

rahen · 2026-01-28T14:59:58 1769612398

I've been building a functional language for differentiable programming that compiles to JAX. The core idea is homoiconicity applied to ML, models are data structures that can inspect and transform themselves.

rahen · 2026-01-14T09:59:59 1768384799

For those interested, this guy is revamping the Emacs widget library with something more modern and platform agnostic, based on SDL: https://appetrosyan.github.io/posts/

His posts are very insightful.

d12frosted · 2026-01-14T10:14:51 1768385691

Interesting, thanks for sharing! I've had thoughts about making vui.el backend-agnostic so it could target different widget implementations (like xwidgets or even native-GUI). An SDL-based widget library could potentially be one of those backends. Need to dig into appetrosyan's work before I can say anything intelligent about it though. And of course, it was an idea and I am unlikely to dive deep without practical need (time is limited, sadly).

Buttons840 · 2026-01-14T14:41:32 1768401692

He's building a new Emacs, or he's building a new library for the existing Emacs?

I couldn't tell from the list of blog posts about on non-Emacs topics.

rahen · 2026-01-06T12:26:08 1767702368

My only complaint regarding the Zed editor is the inability to display two panes of the sidebar one below the other. Not only is it impossible to display them together, but switching between them requires clicking a tiny button in the status bar. To make matters worse, performing a search hides the symbols and the tree view.

So right now I'm sticking to Emacs.

rahen · 2025-12-30T17:35:44 1767116144

"> I think all of ML being in Python is a colossal mistake that we'll pay for for years.

Market pressure. Early ML frameworks were in Lisp, then eventually Lua with Torch, but demand dictated the choice of Python because "it's simple" even if the result is cobbled together.

Lisp is arguably still the most suitable language for neural networks for a lot of reasons beyond the scope of this post, but the tooling is missing. I’m developing such a framework right now, though I have no illusions that many will adopt it. Python may not be elegant or efficient, but it's simple, and that's what people want.

Joker_vD · 2025-12-30T17:40:08 1767116408

Gee, I wonder why the tooling for ML in Lisp is missing even though the early ML frameworks were in Lisp. Perhaps there is something about the language that stifles truly wide collaboration?

rahen · 2025-12-30T17:51:10 1767117070

I doubt it considering there are massive Clojure codebases with large teams collaborating on them every day. The lack of Lisp tooling and the prevalence of Python are more a result of inertia, low barrier to entry and ecosystem lock-in.

wild_egg · 2025-12-30T19:09:06 1767121746

What sort of tooling is missing in Lisp? I'd love to check out your framework if you've shared it somewhere

rahen · 2025-12-30T19:51:53 1767124313

Lisp isn't missing anything, it's a natural fit for AI/ML. It’s the ecosystem's tooling that needs catching up.

The code hasn't reached RC yet, but I'll definitely post a Show HN once it's ready for a preview.