frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Source code graphRAG for Java/Kotlin development based on jQAssistant

https://github.com/2015xli/jqassistant-graph-rag
1•artigent•4m ago•0 comments

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor
2•dragandj•5m ago•0 comments

Tmux to Zellij (and Back)

https://www.mauriciopoppe.com/notes/tmux-to-zellij/
1•maurizzzio•6m ago•1 comments

Ask HN: How are you using specialized agents to accelerate your work?

1•otterley•7m ago•0 comments

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/
1•pranay01•8m ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/
1•todsacerdoti•8m ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel
1•Sean766•11m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos
1•fluantix•11m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/
1•MaximilianEmel•11m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf
1•mooreds•12m ago•0 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app
1•sngahane•13m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/
1•gaws•15m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba
1•mooreds•15m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747
1•paulpauper•16m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/
1•cherrylinedev•17m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...
1•mooreds•18m ago•0 comments

Sebastian Galiani on the Marginal Revolution

https://marginalrevolution.com/marginalrevolution/2026/02/sebastian-galiani-on-the-marginal-revol...
2•paulpauper•21m ago•0 comments

Ask HN: Are we at the point where software can improve itself?

1•ManuelKiessling•21m ago•1 comments

Binance Gives Trump Family's Crypto Firm a Leg Up

https://www.nytimes.com/2026/02/07/business/binance-trump-crypto.html
1•paulpauper•21m ago•0 comments

Reverse engineering Chinese 'shit-program' for absolute glory: R/ClaudeCode

https://old.reddit.com/r/ClaudeCode/comments/1qy5l0n/reverse_engineering_chinese_shitprogram_for/
1•edward•21m ago•0 comments

Indian Culture

https://indianculture.gov.in/
1•saikatsg•24m ago•0 comments

Show HN: Maravel-Framework 10.61 prevents circular dependency

https://marius-ciclistu.medium.com/maravel-framework-10-61-0-prevents-circular-dependency-cdb5d25...
1•marius-ciclistu•25m ago•0 comments

The age of a treacherous, falling dollar

https://www.economist.com/leaders/2026/02/05/the-age-of-a-treacherous-falling-dollar
2•stopbulying•25m ago•0 comments

Ask HN: AI Generated Diagrams

1•voidhorse•27m ago•0 comments

Microsoft Account bugs locked me out of Notepad – are Thin Clients ruining PCs?

https://www.windowscentral.com/microsoft/windows-11/windows-locked-me-out-of-notepad-is-the-thin-...
6•josephcsible•28m ago•1 comments

Show HN: A delightful Mac app to vibe code beautiful iOS apps

https://milq.ai/hacker-news
6•jdjuwadi•31m ago•1 comments

Show HN: Gemini Station – A local Chrome extension to organize AI chats

https://github.com/rajeshkumarblr/gemini_station
1•rajeshkumar_dev•31m ago•0 comments

Welfare states build financial markets through social policy design

https://theloop.ecpr.eu/its-not-finance-its-your-pensions/
2•kome•34m ago•0 comments

Market orientation and national homicide rates

https://onlinelibrary.wiley.com/doi/10.1111/1745-9125.70023
4•PaulHoule•35m ago•0 comments

California urges people avoid wild mushrooms after 4 deaths, 3 liver transplants

https://www.cbsnews.com/news/california-death-cap-mushrooms-poisonings-liver-transplants/
2•rolph•35m ago•0 comments
Open in hackernews

RL algorithms are less bitter-lesson-pilled than 2015-era deep learning

1•rajap•3mo ago
The real issue isn't reward shaping or curriculum learning - everyone complains about those. The deeper problem is that we're hardcoding the credit assignment timescale into our algorithms.

Discount factors (γ), n-step returns, GAE λ parameters - these are human priors about temporal abstraction baked directly into the learning signal. PPO's GAE(λ) literally tells the algorithm "here's how far into the future you should care about consequences." We're not learning this, we're imposing it. Different domains need different λ values. That's manual feature engineering, RL-style.

Biological learning doesn't have a global discount factor slider. Dopamine and temporal difference learning in the brain operate at multiple timescales simultaneously - the brain learns which timescales matter for which situations. Our algorithms? They get a single γ parameter tuned by grad students.

Even worse: exploration strategies are domain-specific hacks. ε-greedy for Atari, continuous noise processes for robotics, count-based bonuses for sparse rewards. We're essentially doing "exploration engineering" for each domain, like it's 2012 computer vision all over again.

Compare this to supervised learning circa 2015: we stopped engineering features and just scaled transformers. The architecture learned what mattered. RL in 2025? Still tweaking γ, λ, exploration coefficients, entropy bonuses for every new task.

True bitter-lesson compliance would mean learning your own temporal abstractions (dynamic γ), learning how to explore (meta-RL over exploration strategies), and learning credit assignment windows (adaptive eligibility traces). Some promising directions exist - options frameworks, meta-RL, world models with learned abstraction - but they're not mainstream because they're compute-hungry and unstable. We keep returning to human priors because they're cheaper. That's the opposite of the bitter lesson.

The irony is stark: RL researchers talk about "end-to-end learning" while manually tuning the most fundamental learning signal parameters. Imagine if vision researchers were still manually setting feature detector orientations in 2025. That's where RL is.

I predict: The next major RL breakthrough won't come from better policy gradient estimators. It'll come from algorithms that discover their own temporal abstractions and exploration strategies through meta-learning at scale. Only then will RL be bitter-lesson-pilled.