frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

CPU-only PPO solving TSPLIB lin318 in 20 mins (0.08% gap)

1•jivaprime•37m ago
Hi all

I’ve put together a repo demonstrating how to train PPO directly on a single TSPLIB instance (lin318) from scratch—without pre-training or GPUs.

Repo:https://github.com/jivaprime/TSP

1. Experiment Setup

Problem: TSPLIB lin318 (Opt: 42,029) & rd400

Hardware: Google Colab (CPU only)

Model: Single-instance PPO policy + Value network. Starts from random initialization.

Local Search: Light 2-opt during training, Numba-accelerated 3-opt for evaluation.

Core Concept: Instead of a "stable average-error minimizer," this policy is designed as a high-variance explorer. The goal isn't to keep the average gap low, but to occasionally "spike" very low-error tours that local search can polish.

2. Results: lin318

Best Shot: 42,064 (Gap ≈ +0.08%)

Time: Reached within ~20 minutes on Colab CPU.

According to the logs (included in the repo), the sub-0.1% shot appeared around elapsed=0:19:49. While the average error oscillates around 3–4%, the policy successfully locates a deep basin that 3-opt can exploit.

3. Extended Experiment: Smart ILS & rd400

I extended the pipeline with "Smart ILS" (Iterated Local Search) post-processing to see if we could hit the exact optimum.

A. lin318 + ILS

Took the PPO-generated tour (0.08% gap) as a seed.

Ran Smart ILS for ~20 mins.

Result: Reached the exact optimal (42,029).

B. rd400 + ILS

PPO Phase: ~2 hours on CPU. Produced tours with ~1.9% gap.

ILS Phase: Used PPO tours as seeds. Ran for ~40 mins.

Result: Reached 0.079% gap (Cost 15,293 vs Opt 15,281).

Summary

The workflow separates concerns effectively:

PPO: Drives the search into a high-quality basin (1–2% gap).

ILS: Digs deep within that basin to find the optimum.

If you are interested in instance-wise RL, CPU-based optimization, or comparing against ML-TSP baselines (POMO, AM, NeuroLKH), feel free to check out the code.

Constructive feedback is welcome!

Pony.ai Granted Citywide Driverless Robotaxi Permit in Shenzhen

https://humanprogress.org/pony-ai-granted-first-citywide-driverless-commercial-robotaxi-permit-in...
1•surprisetalk•1m ago•0 comments

Elite College Admissions

https://collisteru.substack.com/p/on-elite-college-admissions
1•surprisetalk•1m ago•0 comments

The Dangers of Ebikes

https://www.nytimes.com/2025/11/30/briefing/the-dangers-of-e-bikes.html
2•harambae•11m ago•0 comments

AI Threats Have Broken Strong Authentication

https://securityboulevard.com/2025/11/how-ai-threats-have-broken-strong-authentication/
1•mooreds•13m ago•0 comments

37-year-old quit her $390k Google job after saving up $1.5M

https://www.cnbc.com/2025/10/15/no-buy-checklist-helps-florence-poirel-save-money-in-switzerland....
1•mooreds•13m ago•0 comments

Could the US invade Venezuela? [video]

https://www.youtube.com/watch?v=svlAdZjxNeQ
1•mooreds•14m ago•0 comments

Flint Rockin' in Central Texas

https://www.pugetsoundknappers.com/interesting_stuff/Interesting%20Places/JM%20SoCentral%20Texas%...
2•doitLP•18m ago•1 comments

The Birth of the Performance Lab at Spring Health

https://medium.com/spring-health-engineering/the-birth-of-the-performance-lab-at-spring-health-76...
1•bob-surfs•19m ago•0 comments

The Differences Between an IndyCar and a F1 Car

https://www.openwheelworld.net/en/indycar101/76/IndyCar_vs_Formula_1_cars
1•1659447091•19m ago•0 comments

Blogging on My Gleam Experience: Compiling to Binary with Deno

https://caffeine-lang.run/blog/packaging-caffeine
1•bob-surfs•19m ago•0 comments

The Life Hunt for Red October

https://www.twz.com/sea/the-real-life-hunt-for-red-october-happened-50-years-ago
1•mauvehaus•23m ago•0 comments

Asteroid loaded with amino acids offers new clues about origin of life on Earth

https://phys.org/news/2025-11-asteroid-amino-acids-clues-life.html
3•pseudolus•27m ago•0 comments

What happens when you kick millions of teens off social media?

https://www.cnn.com/2025/11/29/australia/australia-social-media-ban-intl-hnk-dst
2•pseudolus•29m ago•0 comments

Agents Should Be More Opinionated

https://www.vtrivedy.com/posts/agents-should-be-more-opinionated/
1•emersonmacro•30m ago•0 comments

CPU-only PPO solving TSPLIB lin318 in 20 mins (0.08% gap)

1•jivaprime•37m ago•0 comments

Show IH: My App for Retail Investors

http://ultrajetsoftware.com
1•jm33077•45m ago•1 comments

Show HN: Fin2Cents – Learn investing by sandboxing portfolios with real data

https://www.fin2cents.com/
2•amywangyx•45m ago•0 comments

Linux 6.18 Released with Many New Features, Likely This Year's LTS Kernel

https://www.phoronix.com/news/Linux-6.18-Released
4•listic•46m ago•0 comments

New report examines how David Sacks might profit from Trump administration role

https://finance.yahoo.com/news/report-examines-david-sacks-might-213904399.html
3•zerosizedweasle•48m ago•0 comments

Show HN: Data lineage diagrams 10x faster than draw io

https://datadef.io
1•theolouvart•50m ago•0 comments

Show HN: The $1B problem with business cards nobody's solving properly

https://yenhyia.buzzchat.site/
2•abilafredkb•50m ago•0 comments

The man who discovered umami

https://www.bbc.com/future/article/20190503-the-mystery-taste-that-always-eluded-us
1•rzk•50m ago•0 comments

The Making of a Techno-Nationalist Elite

https://americanaffairsjournal.org/2025/11/the-making-of-a-techno-nationalist-elite/
1•Anon84•50m ago•0 comments

Fortnite fans are saying "no to AI slop"

https://www.eurogamer.net/fortnite-fans-are-saying-no-to-ai-slop-after-spotting-what-they-believe...
6•ryandrake•55m ago•1 comments

FreeBSD Status Report Third Quarter 2025

https://www.freebsd.org/status/report-2025-07-2025-09/
1•throw0101c•55m ago•0 comments

His time on Nickelodeon over, Tiny Chef strikes out on his own

https://www.latimes.com/entertainment-arts/awards/story/2025-11-25/the-tiny-chef-show-nickelodeon...
1•geox•55m ago•1 comments

Timeline for Selling a Micro-SaaS

https://www.jrhizor.dev/posts/timeline-for-selling-a-microsaas
2•jrhizor•57m ago•0 comments

Big Tech Wants Direct Access to Our Brains

https://www.nytimes.com/2025/11/14/magazine/neurotech-neuralink-rights-regulations.html
2•bookofjoe•59m ago•1 comments

Algorithms for Optimization [pdf]

https://algorithmsbook.com/optimization/files/optimization.pdf
15•Anon84•1h ago•0 comments

ChatGPT is three years old today

https://simonwillison.net/2025/Nov/30/chatgpt-third-birthday/
4•ingve•1h ago•2 comments