frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

We do not think Anthropic should be designated as a supply chain risk

https://twitter.com/OpenAI/status/2027846016423321831
205•golfer•4h ago•77 comments

The Windows 95 user interface: A case study in usability engineering (1996)

https://dl.acm.org/doi/fullHtml/10.1145/238386.238611
126•ksec•3h ago•63 comments

MinIO Is Dead, Long Live MinIO

https://blog.vonng.com/en/db/minio-resurrect/
193•zufallsheld•4h ago•68 comments

Obsidian Sync now has a headless client

https://help.obsidian.md/sync/headless
380•adilmoujahid•9h ago•134 comments

The happiest I've ever been

https://ben-mini.com/2026/the-happiest-ive-ever-been
313•bewal416•2d ago•144 comments

Show HN: Xmloxide – an agent made rust replacement for libxml2

https://github.com/jonwiggins/xmloxide
19•jawiggins•1h ago•6 comments

Block the “Upgrade to Tahoe” Alerts

https://robservatory.com/block-the-upgrade-to-tahoe-alerts-and-system-settings-indicator/
120•todsacerdoti•6h ago•54 comments

Addressing Antigravity Bans and Reinstating Access

https://github.com/google-gemini/gemini-cli/discussions/20632
199•RyanShook•11h ago•167 comments

Building a Minimal Transformer for 10-digit Addition

https://alexlitzenberger.com/blog/post.html?post=/building_a_minimal_transformer_for_10_digit_add...
32•kelseyfrog•3h ago•4 comments

Woxi: Wolfram Mathematica Reimplementation in Rust

https://github.com/ad-si/Woxi
242•adamnemecek•3d ago•103 comments

Verified Spec-Driven Development (VSDD)

https://gist.github.com/dollspace-gay/d8d3bc3ecf4188df049d7a4726bb2a00
142•todsacerdoti•8h ago•68 comments

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers

https://venturebeat.com/technology/alibabas-new-open-source-qwen3-5-medium-models-offer-sonnet-4-...
205•lostmsu•5h ago•129 comments

Deterministic Programming with LLMs

https://www.mcherm.com/deterministic-programming-with-llms.html
17•todsacerdoti•3d ago•6 comments

Werner Herzog Between Fact and Fiction

https://www.thenation.com/article/culture/werner-herzog-future-truth/
63•Hooke•1d ago•14 comments

New evidence that Cantor plagiarized Dedekind?

https://www.quantamagazine.org/the-man-who-stole-infinity-20260225/
108•rbanffy•3d ago•68 comments

Show HN: Now I Get It – Translate scientific papers into interactive webpages

https://nowigetit.us
183•jbdamask•12h ago•99 comments

The whole thing was a scam

https://garymarcus.substack.com/p/the-whole-thing-was-scam
591•guilamu•8h ago•162 comments

The archivist preserving decaying floppy disks

https://www.popsci.com/technology/floppy-disk-archivist-project/
45•Brajeshwar•3d ago•4 comments

MCP server that reduces Claude Code context consumption by 98%

https://mksg.lu/blog/context-mode
224•mksglu•15h ago•54 comments

747s and Coding Agents

https://carlkolon.com/2026/02/27/engineering-747-coding-agents/
123•cckolon•1d ago•58 comments

Ghosts'n Goblins – “Worse danger is ahead”

https://superchartisland.com/ghostsn-goblins/
61•elvis70•3d ago•24 comments

From Noise to Image – interactive guide to diffusion

https://lighthousesoftware.co.uk/projects/from-noise-to-image/
103•simedw•2d ago•15 comments

The Eternal Promise: A History of Attempts to Eliminate Programmers

https://www.ivanturkovic.com/2026/01/22/history-software-simplification-cobol-ai-hype/
231•dinvlad•3d ago•165 comments

Pentagon chief blocks officers from Ivy League schools and top universities

https://fortune.com/2026/02/28/pentagon-officer-education-ivy-league-schools-universities-partner...
45•geox•1h ago•13 comments

Unsloth Dynamic 2.0 GGUFs

https://unsloth.ai/docs/basics/unsloth-dynamic-2.0-ggufs
197•tosh•16h ago•52 comments

The Future of AI

https://lucijagregov.com/2026/02/26/the-future-of-ai/
116•BerislavLopac•14h ago•93 comments

Our Agreement with the Department of War

https://openai.com/index/our-agreement-with-the-department-of-war
210•surprisetalk•5h ago•189 comments

What I learned while trying to build a production-ready nearest neighbor system

https://github.com/thatipamula-jashwanth/smart-knn
17•Jashwanth01•3d ago•10 comments

The United States and Israel have launched a major attack on Iran

https://www.cnn.com/2026/02/28/middleeast/israel-attack-iran-intl-hnk
1035•lavp•19h ago•2269 comments

'Play like a dog biting God's feet': Steven Isserlis on György Kurtág at 100

https://www.theguardian.com/music/2026/feb/26/steven-isserlis-on-the-formidable-gyorgy-kurtag-at-100
21•mitchbob•2d ago•0 comments
Open in hackernews

The war against PDFs is heating up

https://www.economist.com/business/2026/02/24/the-war-against-pdfs-is-heating-up
20•pseudolus•2h ago

Comments

pseudolus•2h ago
https://archive.ph/aCleq
dhosek•2h ago
Well, that was a nonsense article. Badly written software has trouble with PDFs, accessibility is an afterthought (which, sadly, is true of most things) and some small group thinks they can invent a better wheel, ignoring the fact that they’d have to do a lot of work to overcome the first mover advantages of HTML and PDF and this comment now has more information than the original article thanks to that clause beginning with “ignoring”.
pavel_lishin•2h ago
> Yet Duff Johnson, head of the PDF Association, protector of the format, argues that the fault lies not in the file type but in ourselves. He contends that there is no reason developers cannot build bots that are able to use PDFs. The AI assistant embedded in Acrobat, Adobe’s PDF reader, is designed to do precisely that, notes Leonard Rosenthol, the software firm’s PDF guru.

Designed to, but does it do it well without the problems noted earlier in the article?

ssl-3•1h ago
Strictly anecdotally, I've had no trouble feeding PDFs to OpenAI's bot.

The searchable PDFs get searched, and the just-pictures-of-words ones get fed through their (quite good, IMHO) OCR.

I use it all the time. It's remarkably good for locating the details I need in the poorly-organized ~1,200 page factory manual for my Honda.

(Well, it's not necessarily organized poorly. It's just designed with the clear intent that it is mostly to serve as a set of repair instructions, and sometimes I don't want repair instructions. Sometimes I want to know how a thing works for my own cognitive benefit instead of how diagnose and R&R it as a series of steps.)

cyberax•1h ago
I'm using paperless-ngx for personal document management, and Claude Desktop was able to read and OCR all the PDFs there just fine (through an MCP connector).

It also was able to parse my tax forms in 3 languages.

barrister•2h ago
Seems to be a weak pitch for an Israeli startup called Factify. Their new document type is also closed sourced which seems like an obvious showstopper for a ubiquitous global document replacement, especially in today's extremely heated and untrustworthy environment.

No strong argument imo for replacing the pdf.

g947o•32m ago
It weirdly reminds me of SynthID/C2PA. At the end of the day, they matter very little. People are going to do what they want to do.

If people want to manage version/access etc, they are going to do it right the first time with existing document format and permission control mechanism, ranging from "making rhe document only accessible to certain users" to "have someone read a document in a specific room", which has worked reasonably well.

cratermoon•1h ago
There are PDF files and there are PDF files. Many (most?) PDFs I run into are generated from Microsoft Word or some other MS product with no structure at all. The majority of people use MS products don't understand or care about structure. The WYSIWYG imperative means lots of markup to describe font size, color, and decoration, to make every section heading look the same without ever designating the text as a section head. The same happens with paragraphs, page breaks, and column flow. The resulting document looks correct enough to the creator. Other people who have a different version of Word, different fonts, and a thousand other little differences, won't see it correctly. That leads our author to generate a PDF, probably with embedded fonts, to ensure uniform appearance across these thousand little exceptions.

The result is a document with the content mixed up so incomprehensibly with appearance controls as to be both unreadable and without any residue of the underlying intended structure of the document's sections, headers, figures, paragraphs, captions, footnotes, or anything.

And then there's PDF files which are nothing more than a series of images of pages of text. If you're lucky and the scans are clean a good OCR might be able to recover most of the content.

What I'm saying is, it doesn't matter the tool, if authors don't encode structure and formatting in semantically meaningful ways.

tpm•1h ago
So what you are actually saying is that there is a market for a tool that will recreate the PDF with a structure based on how the original PDF looks?
cratermoon•1h ago
The market has been needing a tool like that for 30 years. A PDF document of the type I describe is like a broken egg. Information is lost between the authoring and rendering, to the extent that it's not clear recreating the original is even possible.
pessimizer•1h ago
A typesetter could recreate the document through looking at it, doing some font research, and playing with the kerning for a while. Saying it's not possible to recreate a typeset document that is readable is absurd, no matter how twisted and insane the actual postscript is.
ur-whale•1h ago
https://archive.is/aCleq
Gualdrapo•1h ago
Makes me remember of this, which was posted a few days ago here in HN:

https://scottlocklin.wordpress.com/2023/05/31/djvu-and-its-c...

pessimizer•1h ago
The war against pdfs is based on AI being too stupid to read them? That's a condemnation of AI, not pdfs. I, a natural intelligence, can easily read pdfs.
Cheyana•46m ago
Perfect response.
lsbehe•1h ago
I'll miss getting documentation as a pile of pictures in a PDF.
maxloh•1h ago
For context, here is the startup's website: https://www.factify.com/. The site consists of only two main pages: the landing page and a "careers" section.

Based on the site, the service appears to be little more than a document hosting platform with tracking features, such as monitoring who copied the document and the specific paragraphs they selected. They’ve intentionally omitted a download feature to prevent access to outdated versions, but otherwise, the experience seems no different from an ordinary PDF reader.

There is no mention of a "new standard" on their front page. I suspect they don't actually convert the documents. They likely just convert pages to encrypted images and use client-side rendering for text elements to allow for selection and copying.

sghaz•57m ago
This looks like an sponsored article. Very poor quality.
g947o•37m ago
My biggest gripes:

* you cannot easily view a PDF in dark mode. Solutions do exist, but there are always some limitations

* poor experience reading on mobile device (mentioned in the article). You can use "Reflow" features provided by Acrobat or similar tools, but they often don't work offline, not to mention Acrobat is bloated and filled with dark patterns that trick you into buying a subscription