frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The youngest participant in the 2025 FIDE World Cup

https://www.chess.com/news/view/2025-fide-world-cup-round-1-tiebreaks
1•wslh•46s ago•0 comments

Michael Burry is going all in on shorting the bubble

https://old.reddit.com/r/wallstreetbets/comments/1ont093/michael_burry_is_going_all_in_on_shortin...
1•greesil•1m ago•0 comments

Fear of desynchronization: Why doesn't Europe abolish daylight saving time?

https://english.elpais.com/society/2025-10-24/fear-of-desynchronization-why-doesnt-europe-abolish...
2•PaulHoule•2m ago•0 comments

Show HN: Chess960v2 – Stockfish tournament with different starting positions

https://chess960v2.com/en
2•lavren1974•4m ago•1 comments

A Project Is Not a Bundle of Tasks

https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks
2•walterbell•6m ago•0 comments

Giscus: A comments system powered by GitHub Discussions

https://giscus.app/
1•thunderbong•6m ago•0 comments

Five Moments in the History of Chinese Cybernetics

https://thereader.mitpress.mit.edu/five-moments-in-the-history-of-chinese-cybernetics/
4•billybuckwheat•11m ago•0 comments

I vibe coded a simulation of San Francisco

https://www.sfsimulator.com
1•greenfish6•12m ago•0 comments

Info Terkini Dari WhatsApp Yang Resmi

1•jaringan•20m ago•1 comments

Ladwp Says It Will Shift Its Largest Gas Power Plant to Hydrogen

https://www.latimes.com/environment/story/2025-10-29/ladwp-approves-controversial-hydrogen-conver...
1•m463•21m ago•0 comments

The Morals of Chess (1786)

https://americanliterature.com/author/benjamin-franklin/essay/the-morals-of-chess
1•gregsadetsky•22m ago•0 comments

Dictionary.com's 2025 Word of the Year Is "6-7"

https://www.dictionary.com/e/word-of-the-year-2025/
1•surprisetalk•23m ago•0 comments

GLP-1s Reverse Prediabetes in 95% and Aid Major Weight Loss

https://twitter.com/lindayax/status/1985438416364867659
2•dsr12•28m ago•0 comments

Security Vulnerabilities in AI-Powered Web Browsers

https://blog.lukaszolejnik.com/security-vulnerabilities-in-ai-powered-web-browsers/
3•jruohonen•28m ago•0 comments

Crypto Purity Test

https://cryptopuritytest.org/
1•salkahfi•40m ago•0 comments

Playing Around with ARM Assembly

https://blog.nobaralabs.com/posts/playing-with-arm-asm
1•orionfollett•41m ago•0 comments

Carbon Cub UL flies to 37609 ft using Rotax 916 iS engine

https://www.youtube.com/watch?v=2XRBOIjncU0
2•burnt-resistor•45m ago•1 comments

LIGO's Dual Detectors

https://www.ligo.caltech.edu/page/ligo-detectors
2•duck•49m ago•0 comments

Show HN: I gamified passwords to make them easier to remember

https://passwordgame.apps.benthayer.com/
1•benthayer•50m ago•0 comments

Tabsdata: Pub/Sub for Tables – A new foundation for enterprise data [video]

https://www.youtube.com/watch?v=qCZIRC9khmA
1•teleforce•53m ago•0 comments

Why I don't see many advantages in decentralized platforms

1•DeveloperOne•59m ago•1 comments

High-Performance Syntax Highlighting with CSS Highlights API

https://pavi2410.com/blog/high-performance-syntax-highlighting-with-css-highlights-api/
2•OuterVale•1h ago•0 comments

Are microfrontends still worth the complexity for most teams? [video]

https://www.youtube.com/watch?v=RxVuiccZJU8
1•evergreenxx•1h ago•1 comments

Vanilla JavaScript

http://vanilla-js.com/
5•gurjeet•1h ago•2 comments

Show HN: Generate coherent, synthetic data at scale

https://github.com/ds-horizon/datagen
2•darshanime•1h ago•0 comments

PhET Interactive Simulations

https://phet.colorado.edu/en/simulations/browse
1•gregsadetsky•1h ago•0 comments

34123

https://news.ycombinator.com/submit
1•qravitas•1h ago•0 comments

Otto Nemenz, Supplier and Designer of Cameras and Lenses for Hollywood, Dies

https://www.hollywoodreporter.com/movies/movie-news/otto-nemenz-dead-cameras-lenses-hollywood-123...
2•Marshferm•1h ago•0 comments

Is it worrying that 95% of AI enterprise projects fail?

https://www.seangoedecke.com/why-do-ai-enterprise-projects-fail/
2•zdw•1h ago•2 comments

Implementing Soft Deletion in Prisma with Client Extensions

https://matranga.dev/true-soft-deletion-in-prisma-orm/
1•frankmatranga•1h ago•1 comments
Open in hackernews

Vectorless, Vision-Based RAG

https://colab.research.google.com/github/VectifyAI/PageIndex/blob/main/cookbook/vision_RAG_pageindex.ipynb
4•page_index•6h ago

Comments

page_index•6h ago
In modern document question answering (QA) systems, OCR serves an important role by converting PDF pages into text that can be processed by Large Language Models (LLMs). The resulting text can provide contextual input that enables LLMs to perform question answering over document content.

Traditional OCR systems typically use a two-stage process that first detects the layout of a PDF — dividing it into text, tables, and images — and then recognizes and converts these elements into plain text. With the rise of vision-language models (VLMs) (such as Qwen-VL and GPT-4.1), new end-to-end OCR models like DeepSeek-OCR have emerged. These models jointly understand visual and textual information, enabling direct interpretation of PDFs without an explicit layout detection step.

However, this paradigm shift raises an important question:

> If a VLM can already process both the document images and the query to produce an answer directly, do we still need the intermediate OCR step?

We build a practical vectorless, vision-based question-answering implementation for long documents, without relying on OCR. Specifically, we adopt a vectlorless, reasoning-based retrieval layer and the multimodal GPT-4.1 as the VLM for visual reasoning and answer generation.