frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Use Claude Code to Query 600 GB Indexes over Hacker News, ArXiv, etc.

https://exopriors.com/scry
21•Xyra•1h ago
Paste in my prompt to Claude Code with an embedded API key for accessing my public readonly SQL+vector database, and you have a state-of-the-art research tool over Hacker News, arXiv, LessWrong, and dozens of other high-quality public commons sites. Claude whips up the monster SQL queries that safely run on my machine, to answer your most nuanced questions.

There's also an Alerts functionality, where you can just ask Claude to submit a SQL query as an alert, and you'll be emailed when the ultra nuanced criteria is met (and the output changes). Like I want to know when somebody posts about "estrogen" in a psychoactive context, or enough biology metaphors when talking about building infrastructure.

Currently have embedded: posts: 1.4M / 4.6M comments: 15.6M / 38M That's with Voyage-3.5-lite. And you can do amazing compositional vector search, like search @FTX_crisis - (@guilt_tone - @guilt_topic) to find writing that was about the FTX crisis and distinctly without guilty tones, but that can mention "guilt".

I can embed everything and all the other sources for cheap, I just literally don't have the money.

Comments

bugglebeetle•14m ago
Seems very cool, but IMO you’d be better off doing an open source version and then hosted SAAS.
7777777phil•8m ago
Really useful currently working on a autonomous academic research system [1] and thinking about integrating this. Currently using custom prompt + Edison Scientific API. Any plans of making this open source?

[1] https://github.com/giatenica/gia-agentic-short

barishnamazov•6m ago
I like that this relies on generating SQL rather than just being a black-box chat bot. It feels like the right way to use LLMs for research: as a translator from natural language to a rigid query language, rather than as the database itself. Very cool project!

Hopefully your API doesn't get exploited and you are doing timeouts/sandboxing -- it'd be easy to do a massive join on this.

I also have a question mostly stemming from me being not knowledgeable in the area -- have you noticed any semantic bleeding when research is done between your datasets? e.g., "optimization" probably means different things under ArXiv, LessWrong, and HN. Wondering if vector searches account for this given a more specific question.

nineteen999•5m ago
That's just not a good use of my Claude plan. If you can make it so a self-hosted Lllama or Qwen 7B can query it, then that's something.
mentalgear•4m ago
Nice, but would you consider open-sourcing it? I (and I assume others) are not keen on sharing my API keys with a 3rd party.

Disney will pay $10M to settle children's data privacy lawsuit

https://www.bleepingcomputer.com/news/security/disney-will-pay-10m-to-settle-claims-of-childrens-...
1•fleahunter•2m ago•0 comments

Unexpected Surprise: Windows 11 Outperforming Linux on an Intel Arrow Lake H

https://www.phoronix.com/review/windows-beats-linux-arl-h
1•oshanz•3m ago•0 comments

Several ways is which software can be surprisingly slow

https://gregoryszorc.com/blog/2021/04/06/surprisingly-slow/
1•fanf2•3m ago•0 comments

What is the best comment in source code you have ever encountered? (2011)

https://stackoverflow.com/questions/184618/what-is-the-best-comment-in-source-code-you-have-ever-...
2•chistev•14m ago•1 comments

Why isn't Google showing my site's logo in search results?

https://www.google.com/search?q=site%3Avect.pro&oq=&gs_lcrp=EgZjaHJvbWUqCQgAECMYJxjqAjIJCAAQIxgnG...
2•WoWSaaS•16m ago•3 comments

Would you still celebrate New Year's Day, if it wasn't a day off?

2•olek•19m ago•0 comments

Making end-to-end encrypted AI chat feel like logging in

https://confer.to/blog/2025/12/passkey-encryption/
1•Vinnl•21m ago•0 comments

Execute Code in a Remote Python Process

https://rawomb.at/posts/python-remote-exec/
2•lumpa•24m ago•0 comments

GitHub – tomasf/Cadova: Swift DSL for parametric 3D modeling

https://github.com/tomasf/Cadova
2•bdcravens•28m ago•0 comments

2026 budget reset: I Converted official tax/fee schedules into wallet-level math

https://www.thepricer.org/how-much-will-new-2026-laws-cost-you/
2•jimmyhamdriks•31m ago•1 comments

Sora2 API – Sora 2 Video Generation API

https://sora2-api.com
4•paidx•32m ago•1 comments

The rise of industrial software

https://chrisloy.dev/post/2025/12/30/the-rise-of-industrial-software
23•chrisloy•36m ago•3 comments

Can two Amazons survive? Invisible e-waste is poisoning the world

https://news.mongabay.com/2025/12/can-two-amazons-survive-invisible-e-waste-is-poisoning-the-world/
2•PaulHoule•39m ago•0 comments

Is it better to shower in the morning or at night?

https://www.rte.ie/brainstorm/2025/1227/1513861-showering-morning-night-hygiene/
1•austinallegro•40m ago•0 comments

Yonaguni, the Japanese Island on the Front Lines of China's Feud with Japan

https://www.nytimes.com/2025/12/30/world/asia/japan-china-island-yonaguni.html
1•thm•41m ago•0 comments

Image Pixelator – Pixelate Images Online, Fast and Private

https://imagepixelator.org/
2•wu1064442747•47m ago•1 comments

Squad In Sync – Kill the group chat "Where are we going?" loop

1•alexcloudstar•47m ago•0 comments

AI vs. Real

https://ai-vs-real.com/
2•dsego•47m ago•0 comments

When Vibe Scammers Met Vibe Hackers: Pwning PhaaS with Their Own Weapons [video]

https://media.ccc.de/v/39c3-when-vibe-scammers-met-vibe-hackers-pwning-phaas-with-their-own-weapons
1•Klaster_1•50m ago•0 comments

Torch.ts – building PyTorch in TypeScript from scratch to learn

https://github.com/13point5/torch.ts
3•13point5•56m ago•1 comments

Sophia: A Persistent Agent Framework of Artificial Life

https://arxiv.org/abs/2512.18202
1•mpweiher•58m ago•0 comments

Brew by Weight? Brew by AI – Archestra Blog – Archestra

https://archestra.ai/blog/brew-by-ai
1•scoring-wade-6c•1h ago•0 comments

How Much Does a Horse Cost?

https://horseguidehub.com/how-much-does-a-horse-cost
1•onSmallMessage•1h ago•3 comments

What can I do if ChatGPT gets increasingly laggy after a long conversation?

1•ZHUDAN509•1h ago•3 comments

100k-Watt Iron Beam laser becomes first to be operationally deployed

https://www.tomshardware.com/tech-industry/100kw-iron-beam-laser-becomes-worlds-first-drone-defen...
6•tomerbd•1h ago•0 comments

Show HN: Proving 67M ZK rows on a laptop in 28s (Winterfell OOMs)

1•y00zzeek•1h ago•1 comments

Show HN: GPU-Zombie-Hunter – Find GPU Processes Wasting $2,880/Month

https://github.com/ecl-runtime/gpu-zombie-hunter
1•gpuzombiehunter•1h ago•0 comments

Show HN: How SQL Parsers Work

https://nishchith.com/sql-parsers/
1•inishchith•1h ago•0 comments

A Simple Reason Snow Globe Glass Cups Feel More "Special" Than Regular Ones

1•tumblers•1h ago•0 comments

AI as an Attributable Representation Channel: An AI-Mediated Governance Failure

https://zenodo.org/records/18105273
1•businessmate•1h ago•1 comments