news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Do we still need OCR when we can build a pure vision-based AI agent

https://pageindex.ai/blog/do-we-need-ocr

4•LoMoGan•11h ago

Comments

LoMoGan•11h ago

With the rise of vision-language models (VLMs) (such as Qwen-VL and GPT-4.1), new end-to-end OCR models like DeepSeek-OCR have emerged. These models jointly understand visual and textual information, enabling direct interpretation of PDFs without an explicit layout detection step.

However, this paradigm shift raises an important question:

If a VLM can already process both the document images and the query to produce an answer directly, do we still need the intermediate OCR step?

Nvidia Becomes World’s First $5T Company

https://www.ft.com/content/62933c70-261c-4b7a-a045-3f9f9cceccd7

1•sarimkx•1m ago•1 comments

Show HN: NepaliGPT – Open-Source Nepali and English Language Model (MIT

https://huggingface.co/universalml/NepaliGPT-2.0

1•prince_singh•10m ago•0 comments

Jet engine shortages threaten AI data center expansion, wait times into 2030

https://www.tomshardware.com/tech-industry/turbine-shortage-threatens-ai-datacenters-as-wait-time...

1•pabs3•11m ago•0 comments

Show HN: I built a crypto investment platform after losing money guessing

https://investwithclarity.xyz/

1•frukerick•12m ago•0 comments

Trump says South Korea will build a nuclear submarine in the U.S.

https://www.npr.org/2025/10/29/nx-s1-5590230/trump-nuclear-submarine-south-korea

1•bear_with_me•13m ago•0 comments

AI Use Makes Us Overestimate Our Cognitive Performance

https://www.aalto.fi/en/news/ai-use-makes-us-overestimate-our-cognitive-performance

1•jruohonen•17m ago•1 comments

Comparing Claude Code vs. OpenCode

https://www.andreagrandi.it/posts/comparing-claude-code-vs-opencode-testing-different-models/

1•behnamoh•21m ago•0 comments

A Brief History of the Cypherpunk Movement

https://seykhel.org/en/blog/history/

1•suioir•23m ago•0 comments

China's New Influencer Law: Only Degree-Holders Can Discuss Professional Topics

https://www.moroccoworldnews.com/2025/10/265324/chinas-new-influencer-law-says-only-degree-holder...

2•ghssds•29m ago•0 comments

Why is Python's OrderedDict ordered?

https://www.piglei.com/articles/en-why-is-python-ordereddict-ordered/

3•misonic•31m ago•0 comments

Unscreen Pro: Remove Video Background with AI

https://unscreen.pro

1•sparkalpha•35m ago•0 comments

Blank Page Website for Writing

https://focusforge.net

3•chwiho•40m ago•0 comments

Germany examines nationalising Rosneft arm after US sanctions

https://www.reuters.com/business/energy/germany-examines-nationalising-rosneft-arm-after-trump-sa...

3•geox•40m ago•0 comments

Affluent Investors Are Using Options Math to Borrow on the Cheap

https://www.bloomberg.com/news/articles/2025-10-29/trump-and-xi-set-to-formalize-trade-truce-afte...

1•imichael•43m ago•2 comments

Upply – AI that auto-fills any online form

https://goapply.today/

1•longama•48m ago•2 comments

Solving the Character Encoding Issue When Reading DuckDB via ODBC in Excel VBA

https://redraiment.medium.com/solving-the-character-encoding-issue-when-reading-duckdb-via-odbc-i...

1•redraiment•49m ago•1 comments

Show HN: Claim-Detective – Verify Suspicious Tech Claims Collaboratively

https://www.claim-detective.com/

1•stackoversnow•54m ago•0 comments

Trump asks Pentagon to resume testing US nuclear weapons

https://www.reuters.com/world/china/trump-asks-pentagon-immediately-start-testing-us-nuclear-weap...

7•JKCalhoun•55m ago•3 comments

One Year with Next.js App Router – Why We're Moving On

https://paperclover.net/blog/webdev/one-year-next-app-router

2•nnx•56m ago•1 comments

Found a clean subnet cheatsheet website

https://subnetmaskcheatsheet.com

1•chwiho•57m ago•0 comments

Data centers turn to commercial aircraft jet engines as AI power crunch bites

https://www.tomshardware.com/tech-industry/data-centers-turn-to-ex-airliner-engines-as-ai-power-c...

2•pabs3•57m ago•3 comments

Top researchers consider leaving US amid funding cuts:Science world is ending [video]

https://www.youtube.com/watch?v=yLvO070E_dI

4•thelastgallon•59m ago•0 comments

Use the XDG Base Directory Specification

https://xdgbasedirectoryspecification.com/

1•Bogdanp•1h ago•0 comments

GitHub MCP Server now with server instructions, better tools, and more

https://github.blog/changelog/2025-10-29-github-mcp-server-now-comes-with-server-instructions-bet...

2•kordlessagain•1h ago•0 comments

Hello-World iOS App in Assembly

https://gist.github.com/nicolas17/966a03ce49f949dd17b0123415ef2e31

10•pabs3•1h ago•2 comments

No Nvidia Chips Needed Amazon's New AI Data Center for Anthropic [video]

https://www.youtube.com/watch?v=vnGC4YS36gU

1•mgh2•1h ago•0 comments

IRCd service (2024)

https://example.fi/blog/ircd.html

28•pabs3•1h ago•3 comments

InlinedVector: Header-only SBO that supports const members in insert/erase

https://blog.lloyal.ai/inlinedvector-yet-another-sbo-container-but-with-a-good-reason

1•zuhair•1h ago•0 comments

FVDB: Large scale GPU reality capture from Nvidia

https://fvdb.ai/reality-capture/

5•fwilliams•1h ago•0 comments

Trump orders immediate resumption of US nuclear weapons testing

https://www.france24.com/en/live-news/20251030-trump-orders-immediate-resumption-of-us-nuclear-we...

15•hackthemack•1h ago•11 comments