frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

RAG accuracy jumped from 10% to 60% when I added outcome scoring

https://roampal.ai/blog-ai-learns.html
11•roampal•1h ago

Comments

mistrial9•1h ago
What is this kind of blog post? It is like advertising only, with urgent "install this code now" talk at the end. Impolite at best.. not great front page material IMHO
roampal•1h ago
Fair point, the install instructions at the end were meant as a "here's how to try it if interested" but I can see how it reads as pushy. The core of the post is about the outcome scoring approach itself. Should've led with more depth on the methodology. Thanks for the feedback.
udfalkso•48m ago
It’s not pushy at all imo
ramenlover•1h ago
How did you measure the 60% improvement rate?
roampal•1h ago
Ran a 4-way comparison test across 200 query-memory pairs:

- Baseline RAG (embedding similarity only): 10%

- RAG + reranker: 20%

- Outcomes only (no reranker): 60%

- RAG + outcome scoring (mature memories with 20+ uses): 60%

"Accuracy" = correct memory ranked #1 for the query. The outcome scoring uses Wilson score lower bound - memories that consistently get positive feedback from the "user" get boosted, ones that fail get demoted.

Test methodology: https://github.com/roampal-ai/roampal/blob/main/dev/benchmar...

realaleris149•24m ago
> When I say "thanks, that worked," that memory gets promoted. When I say "no, that's wrong," it gets demoted. … > No manual tagging.

I think this is also a kind of tagging.

roampal•20m ago
You're right, it is a form of tagging technically. The difference is you're already saying "thanks that worked" or "nah that's wrong" anyway. No extra step, it just listens.

Show HN: Google Drive sync erased my data, so I built a proper backup tool

https://www.cloudchute.co.uk/
1•s-h-x•2m ago•1 comments

Sydney Uni data goes walkabout after criminals raid code repo

https://www.theregister.com/2025/12/19/sydney_uni_breach/
1•Bender•2m ago•0 comments

Show HN: Crunch – A Message Definition and Serdes Tool for Getting Things Right

https://github.com/sam-w-yellin/crunch
1•volatileint•2m ago•0 comments

Where Do New Ideas Come From?

https://himanshusinghbisht.substack.com/p/where-do-new-ideas-come-from
1•gilfoyle_7•3m ago•0 comments

Linux 6.19 Lands Fix for Seagate Barracuda HDD Taking Down the SATA Bus

https://www.phoronix.com/news/Linux-6.19-Seagate-HDD-Fix
1•Bender•3m ago•0 comments

Dismantling Defenses: Trump 2.0 Cyber Year in Review

https://krebsonsecurity.com/2025/12/dismantling-defenses-trump-2-0-cyber-year-in-review/
1•Bender•4m ago•0 comments

European Rail in Numbers (2025)

https://chuuchuu.com/2025wrapped
1•Gigacore•4m ago•0 comments

Dell says Win 11 transition is far slower than Win 10, yet PC sales have stalled

https://www.theregister.com/2025/11/26/dell_q3_2026/
1•PaulHoule•7m ago•0 comments

Show HN: tmpo – Minimal CLI time tracker with auto-detection for developers

https://github.com/DylanDevelops/tmpo
1•dylandevelops•7m ago•1 comments

Lofoten Islands Hiking

https://www.switchbacktravel.com/norway/lofoten-islands/hiking
1•mooreds•8m ago•0 comments

TunnelBear dropping custom server selection and split tunnel from its free tier

https://www.techradar.com/vpn/vpn-services/tunnelbear-reshapes-its-free-vpn-model-amid-rising-inf...
1•catlikesshrimp•8m ago•1 comments

Mountain home near Aspen, built for monks, sold to Palantir CEO for $120M

https://coloradosun.com/2025/12/19/monastery-sells-palantir-ceo/
1•mooreds•8m ago•0 comments

Ymery – Build Dear ImGui apps with YAML instead of code

https://github.com/zokrezyl/ymery
1•zokrezyl•9m ago•0 comments

When to Turn Off Your Lights

https://www.energy.gov/energysaver/when-turn-your-lights
1•mooreds•9m ago•0 comments

I got tired of losing track of job applications in spreadsheets

https://trackmyjobs.fyi/
1•yonikeshagun•10m ago•1 comments

Waymo suspends service in San Francisco as robotaxis stall during blackout

https://techcrunch.com/2025/12/21/waymo-suspends-service-in-san-francisco-as-robotaxis-stall-duri...
3•SilverElfin•11m ago•0 comments

AI Boom vs. Main Street: a data-backed look using public macro indicators

https://baselight.app/u/pjsousa/dashboard/the-ai-boom-vs-main-street
1•pjsousa79•13m ago•1 comments

A Biography of Earth Across the Age of Animals

https://www.quantamagazine.org/climate-extremes-are-a-hallmark-of-the-age-of-animals-20250915/
1•the__alchemist•13m ago•0 comments

Invisible infrared surveillance technology and those caught in its digital cage

https://apnews.com/photo-essay/chinese-surveillance-silicon-valley-tech-photo-essay-2da6d9ae5c29d...
1•NN88•13m ago•0 comments

Inverse Parentheses

https://kellett.im/a/inverse-parentheses
2•todsacerdoti•15m ago•0 comments

AI language models duped by poems

https://www.dw.com/en/ai-language-models-duped-by-poems/a-75180648
1•yladiz•16m ago•0 comments

What the Smartest Minds Think About AI

https://www.wsj.com/tech/ai/ai-conference-neurips-ff6398df
1•ironyman•18m ago•0 comments

How AI can bring on a second Industrial Revolution – Kevin Kelly (2017) [video]

https://www.youtube.com/watch?v=IjbTiRbeNpM
1•danielfalbo•21m ago•0 comments

Generating Data Shapes with Hypothesis

https://nedbatchelder.com/blog/202512/generating_data_shapes_with_hypothesis.html
1•ingve•22m ago•0 comments

Police push back on potential ban on Chinese-made drones

https://www.latimes.com/business/story/2025-12-18/security-concerns-mount-as-police-departments-f...
2•bookofjoe•26m ago•1 comments

Get an AI code review in 10 seconds

https://oldmanrahul.com/2025/12/19/ai-code-review-trick/
1•oldmanrahul•26m ago•0 comments

Global warming could trigger the next Ice Age

https://www.sciencedaily.com/releases/2025/12/251221043231.htm
1•ashishgupta2209•27m ago•1 comments

Idea: AI suggests code changes but execution decides What's wrong with this?

1•adhamghazali•28m ago•0 comments

Skybridge: TypeScript Framework for ChatGPT Apps

https://github.com/alpic-ai/skybridge
1•smurda•31m ago•0 comments

Samsung announces Exynos 2600, the first 2nm mobile chip

https://semiconductor.samsung.com/processor/mobile-processor/exynos-2600/
1•akyuu•35m ago•0 comments