frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Boogiebench: LLM Music Composition with Strudel

https://www.boogiebench.com/
1•__NSL__•2h ago

Comments

__NSL__•2h ago
How well can language models like Claude Opus and GPT-5.2 write music?

With boogiebench, I ask models make strudel compositions (https://strudel.cc/) in response to music prompts ('hyperpop', 'spaghetti-western theme', etc) and generate ELO rankings based on user votes.

Unlike Suno, LLMs haven't been trained explicitly on this task, making it a nice generalization test (coding, aesthetics, temporal reasoning), akin to the pelican-riding-a-bike with SVGs.

Models often struggle but are rapidly improving, judging by the performance gap between the strongest and weakest models. (The Anthropic models seem to underperform other model families relative what we'd expect, for whatever reason).

Honey's Dieselgate: Detecting and Tricking Testers

https://www.benedelman.org/honey-detecting-testers/
1•Leftium•2m ago•0 comments

Random Quality of Life Improvements That Will Change Your Life

https://twitter.com/thebeautyofsaas/status/2006104228381721052
1•gmays•4m ago•0 comments

A Year with Graphics

https://mropert.github.io/2025/12/30/a_year_with_graphics/
1•ibobev•5m ago•0 comments

Memory Safety Is

https://matklad.github.io/2025/12/30/memory-safety-is.html
1•ibobev•5m ago•0 comments

Updated LLM Benchmark (Gemini 3 Flash)

https://entropicthoughts.com/updated-llm-benchmark
1•ibobev•6m ago•0 comments

We have all we need to make mass surveillance a reality

https://idiallo.com/blog/we-have-all-we-need-for-mass-surveillance
1•firefoxd•8m ago•0 comments

Wildfire Smoke May Be More Dangerous Than Scientists Thought

https://scitechdaily.com/wildfire-smoke-may-be-far-more-dangerous-than-scientists-thought/
1•gmays•8m ago•0 comments

California DROP: Delete the information data brokers have about you coming 2026

https://consumer.drop.privacy.ca.gov/coming-soon.html
1•irsagent•11m ago•1 comments

Turning Dafny Sets into Sequences [video]

https://www.youtube.com/watch?v=-zAhtW8YFKM
2•larrytheliquid•16m ago•1 comments

Petition: Restore Free and Open Access to the ACM Digital Library

https://www.ipetitions.com/petition/restore-fully-free-and-open-access
1•underscoreF•18m ago•0 comments

X Users Have the Power to Edit Any Image Without Permission

https://petapixel.com/2025/12/29/x-users-have-the-power-to-edit-any-image-without-permission/
2•bookofjoe•25m ago•0 comments

We still don't know what Elon Musk's DOGE did

https://www.theguardian.com/technology/2025/dec/30/elon-musk-doge-impact-us-government
3•doener•26m ago•0 comments

Slaughterbots (2019 short film) [video]

https://www.youtube.com/watch?v=9fa9lVwHHqg
1•christianqchung•28m ago•1 comments

Show HN: Vektor – A native PHP vector database using HNSW

https://github.com/centamiv/vektor
1•centamiv•28m ago•0 comments

Waymo sues Santa Monica, and the city sues right back: Court fight ahead

https://www.latimes.com/california/story/2025-12-22/fight-between-waymo-santa-monica-goes-to-court
3•lokar•30m ago•1 comments

Attention Is Bayesian Inference

https://medium.com/@vishalmisra/attention-is-bayesian-inference-578c25db4501
2•samwillis•32m ago•0 comments

Unproven air taxi company is spending $126M to take over an L.A. airport

https://www.latimes.com/business/story/2025-11-24/california-air-taxi
1•PaulHoule•32m ago•0 comments

AI and politics and stagflation = workplace fatigue

https://www.glassdoor.com/blog/glassdoor-worker-fatigue-ai-politics/
3•andrewstetsenko•34m ago•0 comments

The Complete Sega Mark III (Retail) Collection

https://nintendosegajapan.com/2025/12/29/the-complete-sega-mark-iii-retail-collection/
1•msephton•34m ago•0 comments

Project ideas to appreciate the art of programming

https://codecrafters.io/blog/programming-project-ideas
10•vitaelabitur•36m ago•1 comments

Leadership Lab: The Craft of Writing Effectively (2014) [video]

https://www.youtube.com/watch?v=vtIzMaLkCaM
1•rognjen•37m ago•0 comments

Penn and Teller Help Rob Pike and Dennis Ritchie Play a Prank on Arno Penzias [video]

https://www.youtube.com/watch?v=fxMKuv0A6z4
2•susam•40m ago•0 comments

No Longer Burying the Lead: A New Media Culture for the Metacrisis

https://www.whatisemerging.com/opinions/no-longer-burying-the-lead
1•rendx•43m ago•1 comments

Alias Method

https://en.wikipedia.org/wiki/Alias_method
1•usgroup•47m ago•0 comments

I exposed my Homelab through Cloudflare Tunnels

http://ebourgess.dev/posts/exposing-homelab-through-cloudflare-tunnel/
3•ebourgess•48m ago•2 comments

Christmas 500 years ago was a drunken 6-week feast

https://fortune.com/2025/12/25/medieval-peasant-christmas-was-better-than-modern-holidays-histori...
2•Anon84•52m ago•4 comments

MemCachier Status Currently experiencing instability (for some days already)

https://status.memcachier.com
1•salzig•53m ago•0 comments

ReCollab: Retrieval-Augmented LLMs for Cooperative Ad-Hoc Teammate Modeling

https://arxiv.org/abs/2512.22129
1•StatsAreFun•53m ago•0 comments

Coverage.py sleepy snake logo (2019)

https://nedbatchelder.com/blog/201912/sleepy_snake.html
2•myroon5•54m ago•0 comments

New York's Subway, an Interview with Matthew Algeo

https://www.exasperatedinfrastructures.com/p/the-best-book-i-read-all-year
1•samsklar1•54m ago•0 comments