frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: Why are PDFs so hard to edit?

5•superconduct123•5h ago
What is it about the underlying format that makes it so difficult to edit a PDF

Comments

k310•4h ago
There's a pretty decent explanation here:

https://mailmergic.com/blog/why-pdf-are-hard-to-edit/

The most compelling tidbit I found was this:

> The Technical Architecture of PDF: A Labyrinth of Objects

> Beneath the surface, PDF files are complex compositions made up of objects: text blocks, images, vectors, fonts, metadata, and instructions for rendering. These elements are often stored in fragmented sequences that are optimized for viewing rather than editing. The text is not always stored in logical reading order, and words may be divided into separate character objects placed precisely on the page based on coordinates.

Lots more there. No more spoilers.

PaulHoule•4h ago
Maybe 10 years ago I was a student of file formats and I actually liked PDF as it had a clear theory of how you serialize a graph of objects. It's more like the old Microsoft Word format or the current DOCX and much better than the atrocious PSD format. PDF is a good format for one developed in the 1990s for what it was intended to do.
necovek•4h ago
Because it was designed as a graphical output format, not an editable format.

Some of the "compression" tricks it allows one to use (eg. font subsetting, even remapping characters to use fewer bits to encode text) may make the data only keep the same appearance, and semantic encoding would be gone (for example, "A" may stand for "#").

It's actually quite similar in nature to TeX's DVI format (boxes and their positions), though obviously not a bitmap format but a vector one with all the deps embedded.

This means that, for instance, using non-default kerning and whitespace will lead to all text becoming box-per-character thrown around the page.

superconduct123•3h ago
I see, so its like a lower level format than say a word doc or markdown
fuzzfactor•4h ago
>Why are PDFs so hard to edit?

This is by design.

IIRC the original objective was to require a costly proprietary program from Adobe called "Acrobat" to create the file to begin with, and it was intended not to be edited. Rather it was supposed to be readable and printable with good consistency between PCs and Macs.

"Acrobat Reader" has always been free, to help popularize the format and make sure that anybody could open and read the file. But no editing for you the user. And the "publishers" who routinely generated the early PDFs using the full Acrobat suite wanted to distribute documents for people to trust that they had not been edited from the source. At least not as easily as a Word DOC file could be edited.

Introduction to the A* Algorithm

https://www.redblobgames.com/pathfinding/a-star/introduction.html
1•auraham•1m ago•0 comments

10-HarmonyOS5-TextProcessingEntity-Case

1•zhousg•5m ago•0 comments

09-TextProcessing-WordSegment-Case

1•zhousg•6m ago•0 comments

Show HN: Open-Source Emoji Economy with Multi-Species Governance for Co-Creation

https://github.com/BP-H/emoji/blob/main/emoji_engine.py
1•supernova_2177•6m ago•0 comments

Python removes "experimental" tag from the "nogil" free-threaded Python

https://discuss.python.org/t/pep-779-criteria-for-supported-status-for-free-threaded-python/84319?page=7
1•heinrichhartman•14m ago•0 comments

Short Ruby Newsletter – edition 140

https://newsletter.shortruby.com/p/edition-140
1•amalinovic•18m ago•0 comments

IonQ's Accelerated Roadmap: Turning Quantum Ambition into Reality

https://ionq.com/blog/ionqs-accelerated-roadmap-turning-quantum-ambition-into-reality
1•donutloop•18m ago•0 comments

Show HN: I built a tool to help you sell Digital Downloads via Stripe

https://downloadpage.io/
2•ben_makes_stuff•19m ago•0 comments

SeedancePro AI: Effortlessly Convert Text and Images to Video

https://www.seedancepro.net
1•cy2025421•20m ago•1 comments

How bot detection misfires on non-mainstream browsers and privacy tools

https://blog.castle.io/how-bot-detection-misfires-on-non-mainstream-browsers-and-privacy-tools/
2•avastel•21m ago•1 comments

Telegram Messenger's Ties to Russia's FSB Revealed in New Report

https://www.newsweek.com/telegram-messenger-russia-fsb-ties-report-2083491
1•bobek•21m ago•0 comments

Show HN: Paygo.network – no subscription AI tools

https://paygo.network
1•robbsch•26m ago•0 comments

End-to-End Encryption: Architecturally Necessary

https://labs.ripe.net/author/flindeberg/end-to-end-encryption-architecturally-necessary/
2•Bogdanp•26m ago•0 comments

AI Is Changing Work–and Indie Devs Feel It First

https://plumshell.com/2025/06/17/ai-is-changing-work-and-indie-devs-feel-it-first/
1•NonUmemoto•26m ago•0 comments

Gemini Flash 2.5, Imagen 4 and Veo 2 Chaining for Multi-modal Characters

https://musings-mr.net/post/chaining-for-consistency-prompt-structured-data-image-video
1•mrkiouak•28m ago•1 comments

Years of inactivity in "Pay or OK" cases: noyb sues German DPAs

https://noyb.eu/en/years-inactivity-pay-or-ok-cases-noyb-sues-german-dpas
2•ZacnyLos•28m ago•0 comments

Air India: Is There a Safest Seat in a Plane Crash?

https://www.nytimes.com/2025/06/13/travel/safest-seat-airplane.html
1•sizzle•29m ago•1 comments

Nodepass: Secure, efficient TCP/UDP tunneling solution

https://github.com/yosebyte/nodepass
1•thunderbong•29m ago•0 comments

East German Uprising of 1953

https://en.wikipedia.org/wiki/East_German_uprising_of_1953
1•mkesper•29m ago•0 comments

Show HN: I created a Chrome Extension that improves X/Twitter

https://www.getreplyguy.com/
1•Tjerkienator•32m ago•0 comments

Show HN: Zentra – The AI Travel Planner

https://zentra.hridya.tech
1•hridyaagrawal•34m ago•0 comments

Making Playpen Sans

https://www.type-together.com/making-playpen-sans
2•sandbach•35m ago•0 comments

Show HN: A boilerplate for Kotlin Multiplatform to launch Android and iOS apps

https://www.kmpship.app/
2•VivienMahe•35m ago•0 comments

Pentagon pizza monitor predicted 'busy night' ahead of Israel's attack on Iran

https://www.theguardian.com/world/2025/jun/13/pentagon-pizza-delivery-israel-iran-attack
2•helsinkiandrew•38m ago•1 comments

Phkmalloc

https://phk.freebsd.dk/sagas/phkmalloc/
1•panic•40m ago•0 comments

The Fediverse: A New Era of Social Media

https://www.youtube.com/watch?v=YRJHIJy5Nno
2•doener•45m ago•0 comments

The 'Superlative' Injunction: India's Pirate Site Blockades Go Next Level

https://torrentfreak.com/the-superlative-injunction-indias-pirate-site-blockades-go-next-level/
1•gslin•46m ago•0 comments

Show HN: Taiyaki – AI-generated 3D jewelry from photos and sketches

https://taiyaki.ai/
2•eleelenawa•46m ago•1 comments

Show HN: AI Text to Music Generator

https://text2music.net
2•tomh88•47m ago•0 comments

John Carmack at Upper Bound 2025 [video]

https://www.youtube.com/watch?v=rQ-An5bhkrs
3•tosh•49m ago•0 comments