frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Per-instance TSP Solver with No Pre-training (1.66% gap on d1291)

12•jivaprime•8h ago
OP here.

Most Deep Learning approaches for TSP rely on pre-training with large-scale datasets. I wanted to see if a solver could learn "on the fly" for a specific instance without any priors from other problems.

I built a solver using PPO that learns from scratch per instance. It achieved a 1.66% gap on TSPLIB d1291 in about 5.6 hours on a single A100.

The Core Idea: My hypothesis was that while optimal solutions are mostly composed of 'minimum edges' (nearest neighbors), the actual difficulty comes from a small number of 'exception edges' outside of that local scope.

Instead of pre-training, I designed an inductive bias based on the topological/geometric structure of these exception edges. The agent receives guides on which edges are likely promising based on micro/macro structures, and PPO fills in the gaps through trial and error.

It is interesting to see RL reach this level without a dataset. I have open-sourced the code and a Colab notebook for anyone who wants to verify the results or tinker with the 'exception edge' hypothesis.

Code & Colab: https://github.com/jivaprime/TSP_exception-edge

Happy to answer any questions about the geometric priors or the PPO implementation!

Comments

mkl•2h ago
TSP = Travelling Salesman Problem (https://en.wikipedia.org/wiki/Travelling_salesman_problem)

PPO = Proximal Policy Optimisation, a reinforcement learning algorithm (https://en.wikipedia.org/wiki/Proximal_Policy_Optimization)

n8henrie•1h ago
Thanks. Was wondering if this was about my federal thrift savings plan.

Show HN: Aroma: Every TCP Proxy Is Detectable with RTT Fingerprinting

https://github.com/Sakura-sx/Aroma
46•Sakura-sx•4d ago•24 comments

Show HN: Evidex – AI Clinical Search (RAG over PubMed/OpenAlex and SOAP Notes)

https://www.getevidex.com
26•amber_raza•5h ago•12 comments

Show HN: See what readers who loved your favorite book/author also loved to read

https://shepherd.com/bboy/2025
105•bwb•10h ago•24 comments

Show HN: Vibe coding a bookshelf with Claude Code

https://balajmarius.com/writings/vibe-coding-a-bookshelf-with-claude-code/
246•balajmarius•9h ago•186 comments

Show HN: Per-instance TSP Solver with No Pre-training (1.66% gap on d1291)

12•jivaprime•8h ago•2 comments

Show HN: Agtrace – top and tail -f for AI coding agent sessions

https://github.com/lanegrid/agtrace
2•zawakin•1h ago•0 comments

Show HN: Spacelist, a TUI for Aerospace window manager

https://github.com/magicmark/spacelist
33•markl42•2d ago•6 comments

Show HN: My not-for-profit search engine with no ads, no AI, & all DDG bangs

https://nilch.org
178•UnmappedStack•17h ago•67 comments

Show HN: Z80-μLM, a 'Conversational AI' That Fits in 40KB

https://github.com/HarryR/z80ai
460•quesomaster9000•16h ago•101 comments

Show HN: A solar system simulation in the browser

https://luna.watermelonson.com/
5•watermelonson•4h ago•0 comments

Show HN: Zs3 – S3 server in ~1K lines of Zig, 250KB binary, zero dependencies

https://github.com/Lulzx/zs3
15•lulzx•7h ago•0 comments

Show HN: I built an "ilovepdf" for CSV files (and I called it ILoveCSV)

https://ilovecsv.net/
2•vinserello•3h ago•0 comments

Show HN: Superset – Terminal to run 10 parallel coding agents

https://superset.sh/
49•avipeltz•6d ago•42 comments

Show HN: Neko.js, a recreation of the first virtual pet

https://louisabraham.github.io/nekojs/
2•Labo333•4h ago•0 comments

Show HN: UpDown – Simple website uptime monitoring

https://updown.fly.dev/
3•ejncman•4h ago•0 comments

Show HN: MiddleViewer – A native macOS app for technical interview feedbacks

https://middleviewer.in
3•madhav_gaba•6h ago•0 comments

Show HN: Meter – Scrape sites and keep content in sync automatically (no LLM)

https://www.meter.sh/
4•mckinnonr•7h ago•0 comments

Show HN: Mysti – Claude, Codex, and Gemini debate your code, then synthesize

https://github.com/DeepMyst/Mysti
211•bahaAbunojaim•6d ago•176 comments

Show HN: I built a real-time IoT monitor bridging ESP8266, Go, and Next.js

https://synx-alpha.vercel.app
2•dapoadedire•8h ago•0 comments

Show HN: Kuack – Run Kubernetes jobs in visitor browsers

https://github.com/kuack-io/kuack
5•kuack•8h ago•0 comments

Show HN: LoongArch Userspace Emulator

https://github.com/libriscv/libloong
24•fwsgonzo•5d ago•9 comments

Show HN: Matchstick Puzzle Game in the Browser

https://matchmath.pages.dev/
10•prettyblocks•17h ago•2 comments

Show HN: Ez FFmpeg – Video editing in plain English

http://npmjs.com/package/ezff
411•josharsh•2d ago•194 comments

Show HN: Xcc700: Self-hosting mini C compiler for ESP32 (Xtensa) in 700 lines

https://github.com/valdanylchuk/xcc700
152•isitcontent•3d ago•36 comments

Show HN: Phantas – A browser-based binaural strobe engine (Web Audio API)

https://phantas.io
27•AphantaZach•1d ago•8 comments

Show HN: Witr – Explain why a process is running on your Linux system

https://github.com/pranshuparmar/witr
505•pranshuparmar•3d ago•98 comments

Show HN: Golazo – Live soccer updates in your terminal

https://github.com/0xjuanma/golazo
8•rocajuanma•23h ago•1 comments

Show HN: The Lobste.rs invitation tree, visualized

https://alexjacobs08.github.io/lobsters-graph/
4•tacoooooooo•6h ago•1 comments

Show HN: Awaaz – revolutionary public opinion app for understanding society

https://awaaz.app
4•shauryatusk•14h ago•0 comments

Show HN: I Built a Tool to Turn YouTube into Structured Courses

https://www.disclass.com
9•yunbiao•20h ago•0 comments