frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Nano PDF – A CLI Tool to Edit PDFs with Gemini's Nano Banana

https://github.com/gavrielc/Nano-PDF
32•GavCo•1h ago
The new Gemini 3 Pro Image model (aka Nano Banana) is incredible at generating slides, so I thought it would be fun to build a CLI tool that lets you edit PDF presentations using plain English. The tool converts the page you want to edit into an image, sends it to the model API together with your prompt to generate an edited image, then converts the updated image back and stitches into the original document.

Examples:

- `nano-pdf edit deck.pdf 5 "Update the revenue chart to show Q3 at $2.5M"`

- `nano-pdf add deck.pdf 15 "Create an executive summary slide with 5 bullet points"`

Features:

- Edit multiple pages in parallel

- Add entirely new slides that match your deck's style

- Google Search enabled by default so the model can look up current data

- Preserves text layer for copy/paste and search

It can work with any kind of PDF but I expect it would be most useful for a quick edit to a deck or something similar.

GitHub: https://github.com/gavrielc/Nano-PDF

Comments

lxe•51m ago
This is nuts and I absolutely love this. So you convert the PDF into image, edit the image, then convert the image back into a PDF.
treetalker•41m ago
I'd love to see clearer examples: a video, or original pdf / command / result pdf. Very cool!
sultson•37m ago
How cool! It's frustrating how tedious many PDF workflows still are. I've been building something similar in this space[0], but web-based where you visually specify the area to edit. The biggest issue for now is the cost per edit as the Pro version amounts to roughly $0.15/image. However, with some finessing, the original Nano Banana seems to do a great job as well. Have you explored UI-based approaches yourself by any chance?

[0]https://docusera.com/

itsmevictor•27m ago
Very nice! I wonder whether that could be used to get LLMs to annotate pdfs. Say an "agentic" CLI like Claude Code or Gemini-cli reviews a pdf and finds typos, could it use this to annotate the pdf like underlining them in red or something of that sort? That could be nice.
mentalgear•27m ago
Nice - but consider adding an animated screengrap like: https://github.com/pythops/oryx
tecoholic•22m ago
> Converts an image to a single-page PDF with a hidden text layer using Tesseract. This is the 'State Preservation' step.

Does this mean the text only pdf page is transformed into an image that covers the full page, but the text is still under there. So, any machine based extraction would still get the text, but would probably loose all the bounding box information and regular users cannot just use their mouse to select text anymore?

Show HN: Nano PDF – A CLI Tool to Edit PDFs with Gemini's Nano Banana

https://github.com/gavrielc/Nano-PDF
32•GavCo•1h ago•6 comments

Show HN: Network Monitor – a GUI to spot anomalous connections on your Linux

58•grigio•5d ago•18 comments

Show HN: Explore what the browser exposes about you

https://neberej.github.io/exposedbydefault/
198•coffeecoders•5d ago•69 comments

Show HN: Zero-power photonic language model–code

https://zenodo.org/records/17764289
5•damir00•3h ago•3 comments

Show HN: No Environment Setups Anymore

https://www.gitarsenal.dev/
2•rohan2003•1h ago•1 comments

Show HN: Mu – The Micro Network

https://github.com/asim/mu
54•asim•5d ago•37 comments

Show HN: Choose your own adventure style Presentation

https://github.com/Skarlso/adventure-voter
53•skarlso•1w ago•9 comments

Show HN: I built Magiclip – an all-in-one AI studio

https://magiclip.io/
37•kokau•9h ago•18 comments

Show HN: Glasses to detect smart-glasses that have cameras

https://github.com/NullPxl/banrays
491•nullpxl•1d ago•186 comments

Show HN: An LLM-Powered Tool to Catch PCB Schematic Mistakes

https://netlist.io/
48•wafflesfreak•1d ago•27 comments

Show HN: Auth Agent – Let AI Agents Log In Without Human Credentials

https://github.com/auth-agent/auth-agent
3•hkpatel•5h ago•0 comments

Show HN: Lifetime Black Friday Deals (Mega List)

https://www.blackfridaydeals.directory
4•bfdd•5h ago•0 comments

Show HN: Pulse 2.0 – Live co-listening rooms where anyone can be a DJ

https://473999.net/pulse
81•473999•1d ago•30 comments

Show HN: Claude Artifact: P2P Coin Toss with hashed commitment scheme

https://claude.ai/public/artifacts/ab43ea56-ba14-4c4e-a96b-d644e2535509
2•guiltyf•6h ago•0 comments

Show HN: MacGlow – macOS app to sync brightness across Mac and all Monitors

https://www.lovi.sh/macglow
4•lovish888•7h ago•2 comments

Show HN: KiDoom – Running DOOM on PCB Traces

https://www.mikeayles.com/#kidoom
355•mikeayles•3d ago•49 comments

Show HN: Horizontal Cylinder Chess

https://deanalvero.github.io/horizontal-cylinder-chess/
3•deanalvero•8h ago•0 comments

Show HN: oeis-tui – A TUI to search OEIS integer sequences in the terminal

https://github.com/hako/oeis-tui
2•wesleyhill•8h ago•0 comments

Show HN: Self-hosted RAG for docs and code (FastAPI, Docling, ChromaDB)

https://github.com/2dogsandanerd/Knowledge-Base-Self-Hosting-Kit
3•2dogsanerd•9h ago•1 comments

Show HN: I built an interactive HN Simulator

https://news.ysimulator.run/news
533•johnsillings•5d ago•214 comments

Show HN: I built 19 AI agents because one wasn't enough to coach my workouts

https://arvo.guru
2•danielepelleri•12h ago•2 comments

Show HN: Spikelog – A simple metrics service for scripts, cron jobs, and MVPs

https://spikelog.com
34•dsmurrell•2d ago•17 comments

Show HN: DB Pro – A Modern Desktop Client for Postgres, MySQL, SQLite and LibSQL

https://www.dbpro.app/
28•upmostly•1d ago•10 comments

Show HN: MkSlides – Markdown to slides with a similar workflow to MkDocs

https://github.com/MartenBE/mkslides
75•MartenBE•2d ago•15 comments

Show HN: TimeTracker – Self-hosted time tracking with invoicing (120 features)

https://timetracker.drytrix.com/
4•DRYTRIX•13h ago•1 comments

Show HN: Runprompt – run .prompt files from the command line

https://github.com/chr15m/runprompt
129•chr15m•2d ago•46 comments

Show HN: Era – Open-source local sandbox for AI agents

https://github.com/BinSquare/ERA
61•gregTurri•2d ago•18 comments

Show HN: SyncKit – Offline-first sync engine (Rust/WASM and TypeScript)

https://github.com/Dancode-188/synckit
86•danbitengo•2d ago•36 comments

Show HN: I turned algae into a bio-altimeter and put it on a weather balloon

https://radi8.dev/blog/stratospore/
140•radeeyate•1w ago•13 comments

Show HN: Safe-NPM – only install packages that are +90 days old

https://github.com/kevinslin/safe-npm
89•kevinslin•5d ago•64 comments