More

mdavid626 · 2026-04-03T12:35:54 1775219754

AV1 lacks hw support…

out_of_protocol · 2026-04-03T14:23:05 1775226185

VP9 works well too and more supported (default YouTube codec)

breve · 2026-04-03T12:38:16 1775219896

My laptop has hardware AV1 encoding and decoding. My TV has AV1 decoding. My phone has AV1 decoding. None of these devices are particularly new.

And don't underestimate dav1d (https://www.videolan.org/projects/dav1d.html). You can comfortably play AV1 video in software on your phone. Try it with VLC.

petcat · 2026-04-03T12:40:52 1775220052

> You can comfortably play AV1 video in software on your phone

Maybe for about 15 minutes before your battery is drained to 20%. I'm not aware of any software video decoder at all that won't unacceptably heat up your phone and kill your battery.

daneel_w · 2026-04-03T12:50:17 1775220617

You really are underestimating just how far e.g. Apple's mobile CPUs have come in terms of raw performance and power-efficiency.

breve · 2026-04-03T12:41:35 1775220095

Why say maybe? Why not simply try it for yourself?

petcat · 2026-04-03T12:45:22 1775220322

I don't need to try it myself to know that software video decoding on the CPU is not a viable solution on mobile phones.

breve · 2026-04-03T13:02:17 1775221337

Of course it is. Even the iPhone 7 from 2016 can play 1080p AV1 video.

Why did you spend all that money on your phone if you're not going to exercise the hardware?

smilespray · 2026-04-03T13:58:03 1775224683

Not OP, but 2hr battery time for video playback vs 20hr would be the entire point of this thread. HW decoding is an order of magnitude more efficient.

breve · 2026-04-03T20:23:17 1775247797

That isn't the benchmark set by petcat. petcat's goal post is 15 minutes to 20% battery with AV1 playback.

It's baffling that petcat won't simply try it. Maybe he has awkwardly discovered he does have hardware AV1 support after all.

I've still got my old iPhone 7. I'll dust it off and do the experiment. I think it'll do at least 90 minutes in VLC.

petcat · 2026-04-03T20:52:05 1775249525

Software video decoding on a CPU is woefully inefficient and will drain your battery. Full stop.

> I've still got my old iPhone 7. I'll dust it off and do the experiment. I think it'll do at least 90 minutes in VLC.

Your "old iPhone 7" probably wont even boot up now, let alone play 1080p AV1 video for more than 5 minutes. Go ahead, try it.

breve · 2026-04-03T23:48:29 1775260109

> Your "old iPhone 7" probably wont even boot up now, let alone play 1080p AV1 video for more than 5 minutes.

None of your predictions came true. In fact, you were more than 24 times wrong about it.

My 10-year-old iPhone 7 with its 10-year-old battery and a small crack in the screen, hardware which was released before AV1 was released, did boot up. I charged it to 99%.

I downloaded a 4 minute music video. It's 1080p25 1609kbps AV1 video, 48khz 122kbps stereo Opus audio in an MP4 container.

Using VLC 3.7.2 (which uses dav1d for decoding AV1), I played the video continuously on repeat. It took 122 minutes for the battery to go from 99% to 20%. At 20% the phone switched to low power mode and kept playing the video.

I should have put it in low power mode the whole time. I'll try that next to see if it can go longer.

In the meantime, what we can conclude is that the iPhone 7 is mighty.

dav1d, most especially, is mighty.

petcat isn't.

breve · 2026-04-04T07:44:09 1775288649

Low power mode did indeed make a difference.

I charged to 100%, put the phone in low power mode, and ran the same test.

This time it took 200 minutes to go from 100% to 20% battery.

TheMiddleMan · 2026-04-03T12:39:42 1775219982

It's coming along nicely https://en.wikipedia.org/wiki/AV1#Hardware_encoding_and_deco...

Also decoding on a reasonably powerful (non-accelerated) cpu is fast enough for 1080p, not ideal for battery life but still.

mdavid626 · 2026-04-01T18:34:08 1775068448

I trust sandbox-exec more, or Docker on Linux. Those come from the OS, well tested and known.

MITM proxy is nice idea to avoid leaking secrets. Isn’t it very brittle though? Anthropic changes some URL-s and it’ll break.

afshinmeh · 2026-04-01T18:36:14 1775068574

Thanks for sharing that. Zerobox _does_ use the native OS sandboxing mechanisms (e.g. seatbelt) under the hood. I'm not trying to reinvent the wheel when it comes to sandboxing.

Re the URLs, I agree, that's why I added wildcard support, e.g. `*.openai.com` for secret injection as well as network call filtering.

mdavid626 · 2026-04-01T19:04:36 1775070276

You know, the thing is, that it is super easy to create such tools with AI nowadays. …and if you create your own, you can avoid these unnecessary abstractions. You get exactly what you want.

mdavid626 · 2026-04-01T18:47:55 1775069275

How do you intercept network traffic on mac os? How do you fake certificates?

afshinmeh · 2026-04-01T19:04:03 1775070243

Zerobox creates a cert in `~/.zerobox/cert` on the first proxy run and reuses that. The MTIM process uses that cert to make the calls, inject certs, etc. This is actually done by the underlying Codex crate.

mdavid626 · 2026-04-01T19:06:05 1775070365

Yeah, but how does the sandboxed process “know” that it has to go through the proxy? How does it trust your certificate? Is the proxy fully transparent?

afshinmeh · 2026-04-01T19:09:37 1775070577

Oh I see. It inject HTTP_PROXY/HTTPS_PROXY/etc. env vars into the process so that all sandboxed subprocesses go through the proxy.

blanched · 2026-04-01T19:24:04 1775071444

What if the program doesn’t respect those env vars? Can Zerobox still block network calls in that case?

afshinmeh · 2026-04-01T19:50:39 1775073039

Great question! On Linux, yes, network namespaces enforce that and all net traffic goes through the proxy. Direct connections are blocked at the kernel level even if the program ignores proxy env vars, but I will test this case a bit more (unsure how to though, most network calls would respect HTTPS_PROXY and other similar env vars).

That being said, the default behaviour is no network, so nothing will be routed if it's not allowed regardless of whether the sandboxed process respects env vars or not.

solarkraft · 2026-04-02T18:50:10 1775155810

Does this work inside of Podman containers?

simonw · 2026-04-01T20:04:04 1775073844

How about on macOS?

afshinmeh · 2026-04-01T20:08:42 1775074122

On macOS, the proxy is best effort. Programs that ignore HTTPS_PROXY/HTTP_PROXY can connect directly. This is a platform limitation (macOS Seatbelt doesn't support forced proxy routing).

BUT, the default behaviour (no net) is fully enforced at the kernel level. Domain filtering relies on the program respecting proxy env vars.

simonw · 2026-04-01T20:12:46 1775074366

I thought seatbelt-exec had mechanisms for that?

  (allow network-outbound
    (remote tcp "127.0.0.1:8080"))

afshinmeh · 2026-04-01T20:21:32 1775074892

It does but because I'm inheriting the seatbelt settings from Codex, I'm not resetting it in Zerobox (I thought it's a safer option). Let me look into this, there should be a way to take Codex' profile and safely combine/modify it.

mdavid626 · 2026-04-01T05:51:53 1775022713

How the hell is it 500k lines?

twsted · 2026-04-01T06:00:10 1775023210

It is vibe coded.

dankobgd · 2026-04-01T11:16:24 1775042184

it's just bunch of useless junk

mdavid626 · 2026-03-31T18:45:43 1774982743

It’s time to run development in sandboxes. Docker or sandbox-exec (for Mac).

mdavid626 · 2026-03-31T12:14:55 1774959295

Oh boy, you couldn't be more wrong. If something, LLM-s need MORE readable code, not less. Do you want to burn all your money in tokens?

jen20 · 2026-03-31T17:35:09 1774978509

I very much doubt Anthropic devs are metered, somehow.

mdavid626 · 2026-03-26T07:39:46 1774510786

There is no such thing as agentic codebase. If humans don’t understand it, nothing really does. Agents give zero fuck about anything. If they burn 100 or million tokens to add a feature, they don’t care. It’s the developers responsibility to keep it under control.

drzaiusx11 · 2026-03-26T12:59:31 1774529971

100% this. With these new tools it's tempting to one-shot massive changesets crossing multiple concerns in preexisting, stable codebases.

The key is to keep any changes to code small enough to fit in your own "context window." Exceed that at your own risk. Constantly exceeding your capacity for understanding the changes being made leads to either burnout or indifference to the fires you're inevitably starting.

Be proactive with these tools w.r.t. risk mitigation, not reactive. Don't yolo out unverified shit at scales beyond basic human comprehension limits. Sure, you can now randomly generate entirely (unverified) new software into being, but 95% of the time that's a really, really bad idea. It is just gambling and likely some part of our lizard brains finds it enticing, but in order to prevent the slopification of everything, we need to apply some basic fucking discipline.

As you point out, it's our responsibility as human engineers to manage the risk reward tradeoffs with the output of these new tools. Anecdotally, I can tell you, we're doing a fucking bad job of it rn.

codyb · 2026-03-26T13:19:22 1774531162

The big AI projects I've seen at work are...

- A Kafka topic visualization dashboard

and

- A chrome extension the original "developer" can no longer work on cause the bots will wreck something else on every new feature he tries to add or bug he tries to fix

I think we're a ways out from truly complex code bases that only agents understand.

I've seen a bunch of hype video where people spend lord knows how much money in order to have a bunch of these things run around and I guess... use Facebook, and make reports to distribute amongst themselves, and then the human comes in and spends all their time tweaking this system. And then apparently one day it's going to produce _something_ but two years and counting and much like bitcoin, I've yet to see much of this _something_ materialize in the form of actual, working, quality software that I want to use.

My buddy made a thing that tells him how many people are at the gym by scraping their API and pushing it into a small app package... I guess that's kind of nice.

mdavid626 · 2026-03-24T06:50:01 1774335001

Isn’t that good advice? Rewriting it in C can prove that the problem is not in your programmin language.

Outdated OS can be the problem as well.

What kind of advice did you expect?

Panzerschrek · 2026-03-24T07:17:58 1774336678

> Rewriting it in C can prove that the problem is not in your programmin language

It's a lot of work of rewriting it in C. It's doable, but impractical. And such rewrite may introduce new bugs.

Proving that the problem isn't in my language wasn't necessary - there were no room for it to introduce such kind of bug. Language bugs are usually manifest themselves in a way different way, (like broken binary code leading to crashes or accepting invalid code). That's why I have created my question in a first place - while I was sure, that it wasn't a language bug.

> Outdated OS can be the problem as well.

Outdated OS can't be a problem. Basic socket functionality was implemented eons ago and all known bugs should be fixed even in pretty outdated versions of the kernel.

> What kind of advice did you expect?

Eventually I have found myself, that in my test program I create too much connections for a TCP server and its internal connection queue overflows. But it took some time to find it, way longer compared to what could be achieved if my SO was answered. The problem was not so hard to spot considering that I have described what exactly I do. Even providing code examples wasn't necessary - textual description was enough.

7bit · 2026-03-24T07:45:24 1774338324

Your arguments fall flat.

Could very well be that your language was the issue and writing a small proof of concept for the specific use case that's problematic in a battle tested language other people know is a standard trouble shooting step. Especially if it was a rare limiting error, that sounds like a trivial thing to be implemented in C with a simple for loop and some dummy data perhaps.

Same with the OS. Only because socket functionality is decades old doesn't mean that you can't hit a recently introduced bug, that could have been fixed in a new version. That also is a standard troubleshooting step.

It doesn't make for bad advice only because you're too lazy to do it.

mdavid626 · 2026-03-22T08:48:03 1774169283

I’d be happier if this could run as one binary without Docker. Java is so much harder to setup.

jeroenhd · 2026-03-22T09:43:38 1774172618

Getting Java to run is a base requirement for running most software written in Java.

However, there is a dedicated Dockerfile for creating a native image (Java words for "binary") that shouldn't require a JVM. I haven't tested running the binary myself so it's possible there are dependencies I'm not aware of, but I'm pretty sure you can just grab the binary out of the container image and run there locally if you want to.

It'll produce a Linux image of course, if you're on macOS or Windows you'd have to create a native image for those platforms manually.

mdavid626 · 2026-03-22T14:22:57 1774189377

Yeah, compare this with downloading single binary approach.

Downloading JDK, setting up the correct env variables, or running Docker, all this is just pain, compared to single binary approach.

drzaiusx11 · 2026-03-22T14:54:50 1774191290

Isn't a docker image basically a universal binary at this point? It's a way to ship a reproducible environment with little more config than setting an ENV var or two. All my local stuff runs under a docker compose stack so I have a container for the db, a container for redis, LocalStack, etc

mdavid626 · 2026-03-22T15:35:33 1774193733

Is it though?

On my Mac Docker runs Linux virtualized. It’s a resource hog.

Compare that with simple native binary.

drzaiusx11 · 2026-03-22T17:49:23 1774201763

I'm not saying it's ideal, just saying that's what we've shifted to for repeatable programs. Your Linux "universal" binary certainly won't work on your Mac directly either...

mdavid626 · 2026-03-19T19:03:37 1773947017

Firefox doing everything, but being a good browser.

mdavid626 · 2026-03-19T06:57:27 1773903447

Sigh… that’s not how it works.

qsera · 2026-03-19T08:02:23 1773907343

How what works?