frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Semantic – Reducing LLM "Agent Loops" by 27.78% via AST Logic Graphs

https://github.com/concensure/Semantic
31•concensure•1h ago

Comments

concensure•1h ago
The Problem: Most RAG-based coding tools treat code as unstructured text, relying on probabilistic vector search that often misses critical functional dependencies. This leads to the "Edit-Fail-Retry" loop, where the LLM consumes more time and money through repeated failures.

The Solution: Semantic uses a local AST (Abstract Syntax Tree) parser to build a Logical Node Graph of the codebase. Instead of guessing what is relevant, it deterministically retrieves the specific functional skeletons and call-site signatures required for a task. The Shift: From "Token Savings" to "Step Savings"

Earlier versions of this project focused on minimizing tokens per call. However, our latest benchmarks show that investing more tokens into high-precision context leads to significantly fewer developer intervention steps. Latest A/B Benchmark (2026-03-27)

    Provider: OpenAI (gpt-4o / o1)

    Suite: 11-task core suite (atomic coding tasks)

    Configuration: autoroute_first=true, single_file_fast_path=false
Run Variant Token Delta (per call) Step Savings (vs Baseline) Task Success Baseline (2026-03-13) -18.62% — 11/11 Hardened A +8.07% — 11/11 Enhanced (2026-03-27) -6.73% +27.78% 11/11 Key Takeaways:

    The ROI of Precision: While the "Enhanced" run used roughly 6.73% more tokens than the baseline per request, it required 27.78% fewer steps to reach a successful solution.

    Deterministic Accuracy: By feeding the LLM a "Logical Skeleton" rather than fuzzy similarity-search chunks, we eliminate the "lost in the middle" effect. The agent understands the consequences of an edit before it writes a single line.

    Context Density: We are effectively trading cheap input tokens for expensive developer time and agent compute cycles.
Detailed breakdowns of the task suite and methodology are available in docs/AB_TEST_DEV_RESULTS.md.
rs545837•44m ago
Cool to see more people building on tree-sitter + logic graphs for agents. We've been going deeper on this direction with sem
gbnwl•42m ago
OK I'll take the opportunity to be the the first non-self-promotional comment on this thread now that concensure and rohan2003 have done their ads.

Based on this post's current position on the front page it kind of seems to fall in line with a pattern we've all been seeing the past few months: HN is finally majority onboard with believing in the usefullness of coding agents and is celebrating this by rediscovering each and every personal "I improved CC by doing [blank] thing" from scratch project.

That's all whatever. Fine. But what I'm really curious about is does the HN community actually look at the random LLM-generated statistic-vomit text posted by creators like this and find themselves convinced?

I ask because if you're new to random stat vomit you're going to find yourself having to deal with it all the time soon, and I've yet to find good meta discussions about how we find signal in this noise. I used to use HN or selected reddit community upvotes as a first pass "possibly important" signal, but its been getting worse and worse, illustrated by posts like this getting upvoted to the top without any genuine discssion.

vova_hn2•7m ago
> random LLM-generated statistic-vomit text

I do not understand why this project in particular have set you off.

Their README looks much better than many I've seen on HN:

- no annoying verbosity, that is so prevalent in AI-generated text - not too many buzzwords (they're not saying "agentic" every sentence) - it is very clear what exactly project is supposed to do and why it can be useful

Personally, I upvoted this because I wanted to do something similar for a long time but never got around to it.

verdverm•50s ago
> Personally, I upvoted this because I wanted to do something similar for a long time but never got around to it.

It's much easier to give your agents the LSP for the language(s) it's working on.

Google's 200M-parameter time-series foundation model with 16k context

https://github.com/google-research/timesfm
1•codepawl•1m ago•0 comments

Sony halts memory card shipments due to NAND shortage

https://www.techzine.eu/news/devices/140058/sony-halts-memory-card-shipments-due-to-nand-shortage/
2•methuselah_in•10m ago•0 comments

Gone (Almost) Phishin'

https://ma.tt/2026/03/gone-almost-phishin/
2•luu•11m ago•0 comments

Scientists say we've been looking in the wrong place for human origins

https://www.sciencedaily.com/releases/2026/03/260327230113.htm
1•DeathArrow•11m ago•0 comments

GitHub backs down, kills Copilot pull-request ads after backlash

https://www.theregister.com/2026/03/30/github_copilot_ads_pull_requests/
2•_____k•11m ago•0 comments

Information Flow Kernel for Claude Code Hooks

https://github.com/coproduct-opensource/nucleus/blob/main/docs/quickstart-hook.md
1•difc•11m ago•1 comments

Vector Databases Explained in 3 Levels of Difficulty

https://machinelearningmastery.com/vector-databases-explained-in-3-levels-of-difficulty/
1•eigenBasis•13m ago•0 comments

A Knowledge Graph

https://tjid3.org/test/kg12.74
1•TimothyMJones•13m ago•1 comments

Would you use a GitHub App that auto-generates changelogs from commit diffs?

1•mandeepsng•14m ago•2 comments

Run virtualized iOS with Private Cloud Compute drivers

https://github.com/wh1te4ever/super-tart-vphone-writeup
1•goranmoomin•16m ago•0 comments

Read-Only vs. Action AI: Why Most Odoo AI Tools Stop at the Report

https://www.odooclaw.ai/blog/read-only-vs-action-ai-why-most-odoo-ai-tools-stop-at-the-report
1•oktra_dev•18m ago•0 comments

Securing Elliptic Curve Cryptocurrencies Against Quantum Vulnerabilities [pdf]

https://quantumai.google/static/site-assets/downloads/cryptocurrency-whitepaper.pdf
2•nstj•21m ago•0 comments

Meta Testing Instagram Plus Subscription with Exclusive Features

https://techlomedia.in/2026/03/meta-testing-instagram-plus-subscription-with-exclusive-features-1...
1•deepanker70•23m ago•1 comments

Distributed builds of LLVM with CMake,recc, and NativeLin

https://reidkleckner.dev/
1•swq115•31m ago•0 comments

Claude Code bug can silently 10-20x API costs

https://old.reddit.com/r/ClaudeCode/comments/1s7mitf/psa_claude_code_has_two_cache_bugs_that_can
2•wg0•32m ago•0 comments

Cognitive profiling from speech, not multiple choice

https://expressivecognition.org/
1•baplantas•37m ago•0 comments

The Philosopher and the Tsar

https://www.the-hinternet.com/p/the-philosopher-and-the-tsar
1•Caiero•38m ago•0 comments

Ask HN: Why have supply chain attacks become a near daily occurrence?

2•dhruv3006•38m ago•0 comments

Mental Health Fashion

https://nosaddays.com/
2•samdreamin•40m ago•0 comments

Show HN: I built an app after nearly missing a passport expiry

https://traveldocumentvault.com/
2•mustafahafeez•42m ago•1 comments

Screenpipe

https://github.com/screenpipe/screenpipe
2•handfuloflight•43m ago•0 comments

You can now run a full Linux operating system inside a 6mb PDF

https://twitter.com/oliviscusAI/status/2038563166431346865
3•matthewsinclair•49m ago•1 comments

Show HN: Provero – Data quality checks in YAML, compiled to single SQL queries

https://github.com/provero-org/provero
2•andreahlert•49m ago•0 comments

Office EU: European-owned cloud based office suite

https://office.eu
3•koenraad•50m ago•1 comments

Tokens Are the New Oil: How China Is Quietly Winning the AI Economy

https://thamizhelango.medium.com/tokens-are-the-new-oil-how-china-is-quietly-winning-the-ai-econo...
3•KnuthIsGod•51m ago•0 comments

The United States has become a rogue state

https://www.washingtonpost.com/ripple/2026/03/26/united-states-trump-rogue-state-iran/
2•hkhn•52m ago•0 comments

Show HN: Tiny Axios Alternative, Fch

https://www.npmjs.com/package/fch
1•franciscop•52m ago•0 comments

Our AI traced the axios NPM attack and found how the payload hid itself

https://app.strix.ai/share/chats/NDIxNzZiMTItZWQ2My00NDY4LWIzYzUtNDEyZDgyMWI1YjYzLm1uZTJldnQ0LkVt...
2•ahmedallam2•52m ago•0 comments

What is 'tokenomics' and how would China gain the edge

https://www.scmp.com/tech/big-tech/article/3347495/how-china-could-dominate-ai-eras-tokenomics-va...
1•KnuthIsGod•52m ago•0 comments

Delve – Fake Compliance as a Service – Part II

https://substack.com/home/post/p-192665132
2•theahura•55m ago•0 comments