frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LaTeX → structured ArXiv data for scientific RAG

https://sciencestack.ai/
2•cjlooi•1d ago

Comments

cjlooi•1d ago
PDF-based pipelines are fundamentally lossy and compute-heavy—whether they rely on OCR, GROBID, or LLM-based parsing. They're simply not good enough for accurate, scientific agents at scale.

To fix this, I'm launching ScienceStack API: a lossless, node-based API for scientific papers with LaTeX source, starting with arXiv.

It currently covers 150k+ arXiv papers, mainly in CS, Math, and Physics.

Every paper also ships with a WYSIWYG interactive reader at sciencestack.ai/paper/{arxivId}. Example: https://www.sciencestack.ai/paper/2512.24601v1

I’m giving away 5× 3-month Pro keys to early commenters who are building in this space (scientific tooling, agents, copilots, RAG etc). I’d love to hear what you’re working on

Nvidia Brute-Force Bubble: Why 90% of Physics AI Compute Is a Mathematical Waste

https://github.com/isaac-sim/IsaacSim/discussions/394
1•ZuoCen_Liu•58s ago•0 comments

Retirement of Microsoft Lens

https://support.microsoft.com/en-us/topic/retirement-of-microsoft-lens-fc965de7-499d-4d38-aeae-f6...
1•toomuchtodo•2m ago•0 comments

Y2K bug delayed my honeymoon by 17 years

https://www.theregister.com/2026/01/02/on_call/
1•CHB0403085482•2m ago•0 comments

Distinct AI Models Seem to Converge on How They Encode Reality

https://www.quantamagazine.org/distinct-ai-models-seem-to-converge-on-how-they-encode-reality-202...
1•sonabinu•3m ago•0 comments

Deep sequence models memorize atomic facts "geometrically"

https://bsky.app/profile/vaishnavh.bsky.social/post/3mbwt77arv22x
1•neehao•3m ago•0 comments

Code review of vibe coded HTML parser translations

https://felix.dognebula.com/art/html-parsers-in-portland.html
1•nicoburns•4m ago•0 comments

The Berry That Ferments Itself

https://fruitwine.substack.com/p/the-berry-that-ferments-itself
1•djrivard•8m ago•0 comments

Show HN: CallMe – Minimal plugin that lets Claude Code call you on the phone

https://github.com/ZeframLou/call-me
1•zefram_l•8m ago•0 comments

Show HN: Fast (0.5 GB/SEC) dedup utility for the era of LLMs written in C23

https://github.com/ThirdLetterC/corpus-dedup
1•yehors•9m ago•0 comments

Ask HN: Why isn't AI spawning profitable indie games?

1•eveningsun•9m ago•0 comments

Taming the Tart: Malolactic Fermentation Strategies for Superfruit Wines

https://fruitwine.substack.com/p/taming-the-tart-malolactic-fermentation
1•djrivard•9m ago•0 comments

The Tailwind Debacle

https://njump.me/naddr1qqtk7m3dw35x2tt5v95kcamfdejz6er9vfskxmr9qgsvhgf6d6s4qykqn9qfykf5tu6vw6smnd...
1•andunie•9m ago•0 comments

Why I Left iNaturalist

https://kueda.net/blog/2026/01/06/why-i-left-inat/
1•erutuon•9m ago•0 comments

Writing an LLM from scratch, part 30 – digging into the LLM-as-a-judge results

https://www.gilesthomas.com/2026/01/20260109-llm-from-scratch-30-digging-into-llm-as-a-judge
1•gpjt•10m ago•0 comments

FFTW: Fastest Fourier Transform in the West

http://fftw.org/
1•Anon84•11m ago•0 comments

Researchers craft new recipe for groundbreaking alcohol studies

https://medicalxpress.com/news/2025-12-lab-rigor-real-life-craft.html
1•PaulHoule•12m ago•0 comments

Show HN: Fzf-navigator, a terminal file system navigator

https://github.com/benward2301/fzf-navigator
1•benward2301•13m ago•0 comments

Myths about Logitech Developer ID certificate expiration

https://lapcatsoftware.com/articles/2026/1/2.html
1•frizlab•15m ago•0 comments

Working memory for Claude Code – persistent context and multi-instance coord

https://github.com/GMaN1911/claude-cognitive
1•bochoh•18m ago•1 comments

Framework Lock: From 10-38 to Revolutionary

https://zenodo.org/records/18179143
1•andreguzzon•22m ago•1 comments

What's on HTTP?

https://whatsonhttp.com/
1•elixx•24m ago•1 comments

Tumblr removed from Apple App Store over abuse images (2018)

https://www.bbc.com/news/technology-46275138
49•dmschulman•29m ago•11 comments

NASA ends space mission early due to astronaut medical condition

https://www.bbc.com/news/articles/cd9e2y7nkv8o
1•DarkContinent•33m ago•0 comments

Jane Street's Ron Minsky on the Future of Programming (2023)

https://signalsandthreads.com/future-of-programming/
2•weinzierl•36m ago•0 comments

Iran Goes Dark as Government Cuts Itself Off from Internet

https://www.kentik.com/analysis/iran-goes-dark-as-government-cuts-itself-off-from-internet/
1•m-hodges•37m ago•1 comments

Scientists Create Robots Smaller Than a Grain of Sand

https://www.wsj.com/science/scientists-create-robots-smaller-than-a-grain-of-sand-c3081fd0
1•Bostonian•37m ago•1 comments

Securely sending query parameters in HTTP headers

https://github.com/dickhardt/redirect-headers
1•mooreds•38m ago•0 comments

Waymo getting a ticket. It drove off with the ticket on the windshield

https://old.reddit.com/r/Austin/comments/1q7t4e4/waymo_getting_a_ticket_while_i_was_inside_it/
2•m-hodges•41m ago•0 comments

iOS 26 Shows Unusually Slow Adoption Months After Release

https://www.macrumors.com/2026/01/08/ios-26-shows-unusually-slow-adoption/
3•latexr•45m ago•5 comments

Study casts doubt on potential for life on Europa

https://www.reuters.com/science/study-casts-doubt-potential-life-jupiters-moon-europa-2026-01-06/
3•paulpauper•46m ago•0 comments