elijahwright_'s comments

elijahwright_ · 2025-07-28T23:22:51 1753744971

there's a good reason why this isn't done this way. Git is very open ended but there are .gitignore templates out there. ideally anyone submitting a pull request should be looking at what they're adding. but the real issue here is this:

> Luckily, you realize that you can turn the blacklist of files (the gitignore) into a whitelist, by just ignoring everything and manually un-ignoring desired files.

this will not work and tbh it's fitting a square peg in a round hole. it technically might fit if you shave off the square enough but large repos don't do this, they use a round peg: https://github.com/github/gitignore

elijahwright_ · 2025-07-01T23:35:39 1751412939

this article doesn't make any sense. the bill has a lot of em dashes because that's how bills are expressed and it's a large bill. bills in Congress aren't written with em dashes because it can be confusing with the bill syntax and there's not a reason to do it that way

anakaine · 2025-07-02T00:08:56 1751414936

The author compares it to the average bill going through congress, where you expect 0.1 emdash per page, where this bill has 10. So 100x the historic average.

elijahwright_ · 2025-07-02T01:03:46 1751418226

well, for one, it's more more than 0.1 em dashes per page. the SHARE IT Act has 10 on each page[0]. I don't know how many the 2017 tax cut bill had but it's more than 1,000 and that was over 185 pages[1], and obviously that was before LLMs like ChatGPT. so I don't really know why this is the measure of AI or not, especially because bills have always had a lot of em dashes to start. if you're not analyzing the text of the bill then it's just not going to be accurate

[0] https://www.congress.gov/118/plaws/publ187/PLAW-118publ187.p...

[1] https://www.congress.gov/115/statute/STATUTE-131/STATUTE-131...

rooftopzen · 2025-07-03T10:14:25 1751537665

I'm the author and updated this post - after looking into this, the larger bills contain entire pages with only headings that contain emdashes - removed the headings from analysis so that the emdashes per page are only from the legislative text itself. For the baseline, over 50% of bills found on congress.gov are 1-2 pgs, after reading a few I decided some rationale could exist to remove them from the baseline - even after all these adjustments, we're still looking at a 30% increase from a decent baseline of similar bill size. It's evident when reading the text below headings (as a human!)

rooftopzen · 2025-07-02T04:38:33 1751431113

Share IT is from 2024, but the 2017 tax cut bill is interesting (lots of emdashes there that deviate from the avg) - you’re correct on the additional need for text analysis in this case. Bills I’d found from earlier in 2024 that are publicly available do not have emdashes outside of the table of contents, which is built into the average - curious how/why they are used so much in this bill from 2017, now wondering how they got into any potential templates (or not), and adds the confound of how much this is AI or template (or requirements, or something else) Thx!

rooftopzen · 2025-07-03T10:18:13 1751537893

Not following exactly, so apologies if I'm misinterpreting, but I'm the author and updated this post (transparently) with nuance I'd recently learned about that explains this (somewhat) - the larger bills contain entire pages with only headings that contain emdashes - removed the headings from analysis so that the emdashes per page are only from the legislative text itself. For the conservatively / minimal difference, we're still looking at a 30% increase from a decent baseline.

elijahwright_ · 2025-07-01T20:00:09 1751400009

sorry I changed the name, here's the correct URL - https://docs.openfile.tax/en/latest/reference.html

elijahwright_ · 2025-07-01T19:59:27 1751399967

the code they released was everything that is necessary to run Direct File in development, but they removed code relating to MeF (the IRS's online submission API) and SADI (the IRS's auth system which is integrated with ID.me). most of the code is the backend, Fact Graph (which is a very complex rules engine), the client app (both the form and the screener when you go to directfile.irs.gov), and the state tax API

1oooqooq · 2025-07-02T10:39:31 1751452771

so it is the mostly useless system that only works for a single w2 :(

99.999% of people here would never be able to use it anyway.

elijahwright_ · 2025-07-02T19:57:03 1751486223

it works with multiple W-2s I'm pretty sure. the goal is to make it work for everyone which is pretty difficult but I'm motivated to at least work on getting it to work with IRAs because I have one and I want to use this for next year

1oooqooq · 2025-07-04T11:13:37 1751627617

is this official but incomplete code better than the always open source but mostly complete open tax solver project?

elijahwright_ · 2025-07-08T02:16:47 1751941007

I wouldn't know but at least this code has a good foundation to add onto

elijahwright_ · 2025-07-01T07:25:36 1751354736

I'll fix the docs today, thanks

McAlpine5892 · 2025-07-01T07:44:20 1751355860

Thank you for your work