frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

List of unproven and disproven cancer treatments

https://en.wikipedia.org/wiki/List_of_unproven_and_disproven_cancer_treatments
1•brightbeige•43s ago•0 comments

Me/CFS: The blind spot in proactive medicine (Open Letter)

https://github.com/debugmeplease/debug-ME
1•debugmeplease•1m ago•1 comments

Ask HN: What are the word games do you play everyday?

1•gogo61•3m ago•0 comments

Show HN: Paper Arena – A social trading feed where only AI agents can post

https://paperinvest.io/arena
1•andrenorman•5m ago•0 comments

TOSTracker – The AI Training Asymmetry

https://tostracker.app/analysis/ai-training
1•tldrthelaw•9m ago•0 comments

The Devil Inside GitHub

https://blog.melashri.net/micro/github-devil/
2•elashri•9m ago•0 comments

Show HN: Distill – Migrate LLM agents from expensive to cheap models

https://github.com/ricardomoratomateos/distill
1•ricardomorato•9m ago•0 comments

Show HN: Sigma Runtime – Maintaining 100% Fact Integrity over 120 LLM Cycles

https://github.com/sigmastratum/documentation/tree/main/sigma-runtime/SR-053
1•teugent•10m ago•0 comments

Make a local open-source AI chatbot with access to Fedora documentation

https://fedoramagazine.org/how-to-make-a-local-open-source-ai-chatbot-who-has-access-to-fedora-do...
1•jadedtuna•11m ago•0 comments

Introduce the Vouch/Denouncement Contribution Model by Mitchellh

https://github.com/ghostty-org/ghostty/pull/10559
1•samtrack2019•11m ago•0 comments

Software Factories and the Agentic Moment

https://factory.strongdm.ai/
1•mellosouls•12m ago•1 comments

The Neuroscience Behind Nutrition for Developers and Founders

https://comuniq.xyz/post?t=797
1•01-_-•12m ago•0 comments

Bang bang he murdered math {the musical } (2024)

https://taylor.town/bang-bang
1•surprisetalk•12m ago•0 comments

A Night Without the Nerds – Claude Opus 4.6, Field-Tested

https://konfuzio.com/en/a-night-without-the-nerds-claude-opus-4-6-in-the-field-test/
1•konfuzio•14m ago•0 comments

Could ionospheric disturbances influence earthquakes?

https://www.kyoto-u.ac.jp/en/research-news/2026-02-06-0
2•geox•16m ago•1 comments

SpaceX's next astronaut launch for NASA is officially on for Feb. 11 as FAA clea

https://www.space.com/space-exploration/launches-spacecraft/spacexs-next-astronaut-launch-for-nas...
1•bookmtn•17m ago•0 comments

Show HN: One-click AI employee with its own cloud desktop

https://cloudbot-ai.com
2•fainir•19m ago•0 comments

Show HN: Poddley – Search podcasts by who's speaking

https://poddley.com
1•onesandofgrain•20m ago•0 comments

Same Surface, Different Weight

https://www.robpanico.com/articles/display/?entry_short=same-surface-different-weight
1•retrocog•23m ago•0 comments

The Rise of Spec Driven Development

https://www.dbreunig.com/2026/02/06/the-rise-of-spec-driven-development.html
2•Brajeshwar•27m ago•0 comments

The first good Raspberry Pi Laptop

https://www.jeffgeerling.com/blog/2026/the-first-good-raspberry-pi-laptop/
3•Brajeshwar•27m ago•0 comments

Seas to Rise Around the World – But Not in Greenland

https://e360.yale.edu/digest/greenland-sea-levels-fall
2•Brajeshwar•27m ago•0 comments

Will Future Generations Think We're Gross?

https://chillphysicsenjoyer.substack.com/p/will-future-generations-think-were
1•crescit_eundo•30m ago•1 comments

State Department will delete Xitter posts from before Trump returned to office

https://www.npr.org/2026/02/07/nx-s1-5704785/state-department-trump-posts-x
2•righthand•33m ago•1 comments

Show HN: Verifiable server roundtrip demo for a decision interruption system

https://github.com/veeduzyl-hue/decision-assistant-roundtrip-demo
1•veeduzyl•34m ago•0 comments

Impl Rust – Avro IDL Tool in Rust via Antlr

https://www.youtube.com/watch?v=vmKvw73V394
1•todsacerdoti•34m ago•0 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
3•vinhnx•35m ago•0 comments

minikeyvalue

https://github.com/commaai/minikeyvalue/tree/prod
3•tosh•40m ago•0 comments

Neomacs: GPU-accelerated Emacs with inline video, WebKit, and terminal via wgpu

https://github.com/eval-exec/neomacs
1•evalexec•45m ago•0 comments

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

https://moli-green.is/
2•ShinyaKoyano•49m ago•1 comments
Open in hackernews

Is AI alignment repeating the mistakes of "New Coke"?

https://en.wikipedia.org/wiki/New_Coke
2•zaptrem•9mo ago

Comments

zaptrem•9mo ago
"New Coke" is one of the most notable failed product launches in the American food and beverage industry, and I feel like some of its core lessons are becoming increasingly relevant to modern AI developers.

For those <40: In the 1980s senior executives at Coca Cola had a problem: Pepsi was gaining ground, partly thanks to the "Pepsi Challenge" - blind sip tests where consumers often preferred Pepsi's sweeter taste. Coke R&D developed a new, sweeter formula that also beat both Pepsi and original Coke in these single-sip taste tests involving many thousands of consumers. Based on this data, they launched "New Coke" in 1985.

The result was a legendary disaster. Outrage, protests, hoarding of the original formula. The problem was people didn't just sip Coke; they drank whole cans. They also valued the brand, the history and the familiarity - factors the narrow taste tests completely missed. Within months, "Coca-Cola Classic" was back. New Coke production was quietly scaled back in the early 90s, but stuck around in a few markets until the early 00s.

I think AI practitioners are starting to learn the same lesson. We're tuning our models with RLHF/DPO/other preference methods based on similar one-step blind taste tests. Raters pick the "better" response between two options, often optimizing for immediate helpfulness, agreeableness, or perceived safety in that isolated interaction. I think some of the more extreme recent LLM tuning may also be fueled by taste-test-style benchmarks like LMSYS and the Artificial Analysis image leaderboard.

Examples: ChatGPT's most recent update turned it into an overenthusiastic sycophant. Image models (Apple's Image Playground model is a particularly egregious example you can try right now) are frequently preference tuned until every generation looks like something out of a Pixar movie. Certain music models are incapable of generating music that doesn't sound like a 2020s top-40s song.

In all cases, it might taste/sound/look good once, but ultimately people will get sick of it. I work on generative models and I think (at least for our modality, music) the most enduring enjoyment of using them is the element of surprise and delight, which is increasingly being ruined by preference tuning which collapses the distribution of possible outputs.

Are we optimizing away the very qualities that make these models interesting, creative, and truthful in the long run, just to win the immediate "preference" taste test and rank higher in benchmarks? IMO we're witnessing the New Coke of AI.

techpineapple•9mo ago
I think you're metaphor is a bit convoluted :-) but I think you're theory here is important.

There are a lot of concerns I have with AI in this area. For "facts", i.e. who were the signers of the declaration of independence, efficient search with one answer is probably fine, but for anything remotely controversial, the idea that we're going to accept the one answer of the AI is really problematic, and I think will lead to what I've been calling AI-Think (ala group think)

This is sort of directly adjacent to what you're describing. Instead of reading perspectives by different bloggers, or different sources, you'll be getting all of your perspectives from an identical sort of worldview, instead of browsing a feed with a whole bunch of different personalities, it's basically a feed of one, and as you're describing, the feed will be defenitionally middle-ground milquetoast.

Paul Graham said that he was replacing most of his google searches with chatgpt, and I trust that Paul Graham is a smart guy aware of this risk, but it does make me wonder, if you're using AI to do all the research for your writing, how does that affect you're writing when your sources become a monoculture. How does it collectively affect all art inspired by AI?

zaptrem•9mo ago
Fighting back against one set of AI opinions for everyone (which tended to be criticized as woke) to better reflect the “vibes” of the user is also part of what got the most recent 4o release to start enthusiastically agreeing with flat earthers. But if it’s not allowed to state opinions at all then you get really high refusal rates which annoys everyone.
techpineapple•9mo ago
Both of these are bad options though, reflecting the "vibes" of the user is worse than ChatGPT having it's own crappy opinions, because at least you're exposed to something different. But it's not even about the opinion, it's not enough for Chat GPT to say "Some people believe in free markets and others would like a more centrally controlled economy" it's the diverse aesthetics or patina that are important.