frontpage.

Hello HN,

I’m the founder of PDFClear (https://www.pdfclear.com). It’s a suite of PDF tools (merge, split, compress, etc.) that runs entirely in the browser. I built this because I was tired of Googling "merge pdf" and landing on sites that require me to upload sensitive bank statements or contracts to an unknown server. I wanted a tool where the file never leaves the device.

The Tech Stack:

The app is built with React and Vite, but the heavy lifting is done via WebAssembly and Web Workers to keep the UI thread responsive.

- PDF Manipulation: I’m using pdf-lib for standard operations (merge, split, rotate).

- Compression & Encryption: For heavier tasks like compressing streams or handling encryption/decryption, I compiled QPDF to WebAssembly (qpdf-wasm).

- OCR: Scanned documents are processed client-side using Tesseract.js.

Local AI (The New Part):

I recently added Semantic Search and Summarization without relying on OpenAI/Anthropic APIs.

- It uses Transformers.js to run ONNX models directly in the browser.

- Search: Uses different models (including nomic-ai/nomic-embed-text-v1.5 and Xenova/GIST-small-Embedding-v0) for embeddings. It chunks the text, stores vectors in IndexedDB (via idb-keyval), and performs cosine similarity locally.

- Summarization: Uses onnx-community/text_summarization-ONNX (quantized) running in a Web Worker.

Privacy:

Because everything runs client-side, no documents are uploaded to my server. You can verify this by inspecting the Network tab. Once the app loads (and the AI models are cached), it works fully offline.

I’d love your feedback on the performance of the local AI models, specifically on older devices.

Mind-reading devices can now predict preconscious thoughts: is it time to worry?

Installing Java in 2025, and Version Managers

Companies are crafting new ways to grow cocoa and chocolate alternatives (2024)

What's Like to Be an AI/ML Engineer

Run Local Speech-to-Text Transcription

Protecting Data-in-Use in the Cloud: A Pragmatic Philosophy

Enumerating Three Billion Accounts on WhatsApp [pdf]

Building CallSpark (browser based VoIP): what I learned and what caused pain

Kennedy sharpens vaccine attacks, without scientific backing

Show HN: Prismle – From Query to Candidates in One Human Sentence

Nancy Pelosi posted up a staggering 16,930% return, beat the market by 581%

Whole-body Learning in Creating Mathematical/Architectural Structures [pdf]

Vikings. Vikings Everywhere

"Eye" evolving from the Bronze Age to today [video]

Lower cooling costs with deployment of quantum computers in the stratosphere

Americans are holding onto devices longer than ever and it's costing the economy

Micropackages and Open Source Trust Scaling (2016)

We deleted our Dockerfiles: a better, faster way to build container images

Gemini 3 beaks OpenAI's long-standing lead in SRE tasks

First New Malaria Drug in Years Performs Strongly in Late-Stage Testing

Show HN: An AI Agent with Hysteresis-Based Personality Evolution (Python/Gemini)

Show HN: Smart Scan – REST API, Dashboard, and CI/CD Tools for MCP Security

Show HN: Raspberry Pi Monitoring with Gemini-CLI

Alltrails Responds to Subscriber Outrage over Upsell Treadmill

For 6 years, we ran the largest blind eng hiring experiment of all time

Microsoft's Notepad; the Best Advertisement for Notepad++ There Is

Technical writeup of universal account takeover on Lovable

Show HN: I built an interactive map of jobs at top AI companies

Shopping Research in ChatGPT

Pentagon investigating Sen Mark Kelly for urging troops to defy 'illegal orders'

Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)