frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: ContextPacker code context API for your agent without vector databases

https://contextpacker.com/
2•rozetyp•1h ago
Every time I wanted to give an LLM context from a codebase, I ended up doing this:

- Set up a vector DB - Write chunking logic - Run an indexer - Keep it in sync with git on any change

For just answer this question about this repo, it felt a bit too much. So I built a small API instead: you send a repo + question, it sends back the files an LLM actually needs (not sure how novel this is at all?).

What it does:

You call an HTTP endpoint with a GitHub repo URL + natural language question (better if specific, but this will also work: How does auth work? What validates webhook signatures? etc).

The API returns JSON with 1–10 ranked files: - `path`, `language`, `size`, full `content` - plus a small `stats` object (token estimate, rough cost savings)

You plug those files into your own LLM / agent / tool. There's no embeddings, no vector DB, no background indexing job. It works on the very first request.

Why I built this?

I just wanted to ask this repo a question without:

- Standing up Pinecone/Weaviate/Chroma - Picking chunk sizes and overlap - Running an indexer for every repo - Dealing with sync jobs when code changes

This API skips all of that. It's meant for:

- One-off questions on random repos - Agents / tools that hop across many repos - Internal tools where you don't want more infra

Does it work at all?

On a small internal eval (177 questions across 14 repos, mix of Python, TS, monorepos + private ones):

- A cross-model LLM judge rated answers roughly on par with a standard embeddings + vector DB setup - Latency is about 2–4 seconds on the first request per repo (shallow cloning + scanning), then faster from cache - No indexing step: new repos work immediately

Numbers are from our own eval, so treat them as directional, not a paper. Happy to share the setup if anyone wants to dig in.

How it works:

1. On first request, it shallow clones the repo and builds a lightweight index: file paths, sizes, languages, and top-level symbols where possible.

2. It gives an LLM the file tree + question and asks it to pick the most relevant files.

3. It ranks, dedupes, and returns a pack of files that fits in a reasonable context window.

Basically: let an LLM read the file tree and pick files, instead of cosine-searching over chunks.

*imitations:

- Eval is relatively small (177 questions / 14 repos), all hand-written – directional, not research-grade - Works best on repos with sane structure and filenames - First request per repo pays the clone cost (cached after)

Try it:

- Live demo: https://contextpacker.com - DM me for an API key – keeping it free while I validate the idea.

If you're building code agents, "explain this repo" tools, or internal AI helpers over your company's repos – I'd love to hear how you'd want to integrate something like this (or where you think it will fall over). Very open to feedback and harsh benchmarks.

Inherently Funny Word

https://en.wikipedia.org/wiki/Inherently_funny_word
1•cainxinth•1m ago•0 comments

Show HN: TapeHead – A CLI tool for stateful random access of file streams

https://github.com/emamoah/tapehead
1•emamoah•4m ago•0 comments

Repeated-sprint training in hypoxia: A review with 10 years of perspective [pdf]

https://www.fisiologiadelejercicio.com/wp-content/uploads/2025/10/Repeated-sprint-training-in-hyp...
1•wslh•6m ago•0 comments

Show HN: Create your own interactive visual customer support agent

https://www.chat-data.com/changelog/ui-based-response-launch
2•freesam•9m ago•0 comments

Zebra-Llama: Towards Efficient Hybrid Models

https://arxiv.org/abs/2505.17272
1•mirrir•12m ago•0 comments

The Deer Island Marvel: Wastewater treatment and engineering excellence

https://whattotelltherobot.com/p/the-deer-island-marvel
1•stefie10•13m ago•1 comments

Why Operational Metrics and Agent Analytics Driving Successful MCP Servers

https://glama.ai/blog/2025-12-06-the-operational-metrics-and-agent-analytics-driving-successful-m...
1•OmShree0709•16m ago•1 comments

Dex rethinks form factor for kids' tech device [video]

https://www.youtube.com/watch?v=Nz5O7PvyHcM
2•taro666•18m ago•0 comments

Cellular layers in the brain's memory center

https://keck.usc.edu/news/usc-study-reveals-hidden-cellular-layers-in-the-brains-memory-center/
2•gmays•19m ago•0 comments

Show HN: Multimodal Benchmarks

https://github.com/mixpeek/multimodal-benchmarks
2•Beefin•19m ago•0 comments

2025 Aggvent Calendar – Geometry Puzzles

https://andymath.com/2025aggventcalendar/
2•incognito124•19m ago•0 comments

PocketMage Is an E Ink PDA for the Modern Era

https://www.hackster.io/news/pocketmage-is-an-e-ink-pda-for-the-modern-era-3dac295619c0
2•todsacerdoti•23m ago•0 comments

Show HN: I built an open-source AI tool to analyze CSV locally in the browser

https://maxgfr.github.io/csv-ai-analyzer/
2•maxgfr•27m ago•0 comments

Hybrid ML and LLM Framework for Identifying Engaging, Breaking Content on Reddit

https://old.reddit.com/r/RedditEng/comments/1p6gwub/breaking_through_the_noise_a_hybrid_ml_and_llm/
2•nowflux•27m ago•0 comments

CATL Expects Oceanic Electric Ships in 3 Years

https://cleantechnica.com/2025/12/05/catl-expects-oceanic-electric-ships-in-3-years/
3•thelastgallon•28m ago•1 comments

Seven Architectural Decision Making Fallacies (and Ways Around Them)

https://ozimmer.ch/practices/2025/09/01/ADMFallacies.html
1•azhenley•30m ago•0 comments

The Case That A.I. Is Thinking

https://www.newyorker.com/newsletter/the-daily/is-ai-amazing-or-are-we-simple
1•jsomers•32m ago•1 comments

Printer tracking dots visible after holographic foiling of transparency print

https://xcancel.com/garnetstar28/status/1997048456532332931
1•Lammy•33m ago•0 comments

'It's like the lottery': AI boom has created parking chaos in SF neighborhood

https://www.sfchronicle.com/sf/article/mission-car-parking-san-francisco-21221645.php
1•mikhael•35m ago•0 comments

Use Python for Scripting

https://hypirion.com/musings/use-python-for-scripting
1•todsacerdoti•35m ago•0 comments

Titans and MIRAS: Helping AI have long-term memory

https://research.google/blog/titans-miras-helping-ai-have-long-term-memory/
2•bilsbie•38m ago•0 comments

Ask HN: Do you believe Netflix will be a good steward of Warner Bros?

1•dannyphantom•39m ago•2 comments

The general who refused to crush Tiananmen's protesters

https://www.economist.com/china/2025/12/04/the-general-who-refused-to-crush-tiananmens-protesters
17•marojejian•41m ago•3 comments

The Broken Job Search: Why Applying to Big Tech Is a Trap

https://webdev-sb.blogspot.com/2025/11/the-broken-job-search-why-applying-to.html
3•typesafeJ•48m ago•0 comments

Camera traps snap 3X more images of endangered Sumatran tigers than before

https://www.frontiersin.org/news/2025/12/04/camera-traps-endangered-sumatran-tiger
2•gmays•51m ago•0 comments

A Template-Driven Approach to Resource Management for AI Compute

https://www.ori.co/blog/a-template-driven-approach-to-ai-resource-management
1•edogrider•56m ago•0 comments

Rnj-1: Building Instruments of Intelligence

https://www.essential.ai/research/rnj-1
1•neversettles•1h ago•0 comments

Risks to British Business

https://www.riskstobritishbusiness.today
1•seangrvs•1h ago•0 comments

The Hays Code

https://allthetropes.org/wiki/Hays_Code
1•Ariarule•1h ago•0 comments

Agents need good developer experience too

https://modal.com/blog/agents-devex
2•birdculture•1h ago•0 comments