frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Open music foundation models for full-song generation

https://map-yue.github.io/
55•selvan•3d ago

Comments

lotyrin•2h ago
Very nice. Anyone know of projects that aren't tackling the full-song problem but rather instrument parts/loops/stems/acapellas? I'd like something that's more like "infinite AI Loopcloud/Splice" most of these full-song models don't do well to be asked for individual parts in my experience (though I will have to try it with this one).
platers•2h ago
https://suno.com/studio-waitlist Just a waitlist so far, but looks like this is the direction suno is going
lotyrin•2h ago
Yeah... I hope this is what their plan is with that, but I'm not entirely certain.
rwmj•2h ago
Also live AI dueting would be interesting, like having a virtual guitarist you could jam/duet with.
lotyrin•2h ago
Yeah. Or like, a loop that plays continuously and has style parameters exposed you can tweak with a controller like a Midi Fighter Twister and get feedback from in real-time. Then you could do something akin to DJ/live production by having two of these going in sync with each other into a mixer. (Tweak params of the cue track until you like it, transition at a phrase point, repeat).
HxokcPwi•1h ago
Like this? https://aistudio.google.com/apps/bundled/promptdj?showPrevie...
HxokcPwi•1h ago
Just saw this today: https://x.com/jesseengel/status/1953496623696556478
HxokcPwi•1h ago
Try https://magenta.withgoogle.com/infinite-crate
vunderba•48m ago
This gets discussed a lot but unfortunately there's just not much out there around this.

The closest thing I've seen is virtual drummers in Logic X which will follow along with the structure of your song and generate a percussive accompaniment. It's no substitute for a real drummer but it's serviceable.

ssalka•2h ago
Something interesting... the first 10 seconds or so of the "Death Growl" example[1] is basically copied verbatim from "Ov Fire And The Void" by Behemoth.

More specifically, I think the part that seems copied is at 2:13 of the original[2], as it leads into a solo-ish bit which in the AI version sounds similar still, but goes on to do its own thing:

[1] https://map-yue.github.io/music/moon.death_metal.mp3

[2] https://youtu.be/vAmnsKKrt9w?t=133

someothherguyy•1h ago
> Additionally, our memorization-effect experiments in Section 11 demonstrate that our design maintains creativity without plagiarizing, even under strong training set conditioning.

https://arxiv.org/html/2503.08638v1#S11

amelius•1h ago
Does Shazam think it is the same?
vorgol•50m ago
The youtube link is suddenly not available any more (at least in the UK)
bangaladore•1h ago
What is the use case for music generation models? I see usecases for alot of the other foundation models like text, image, tts, sst, but why do I want AI generated music?
FridgeSeal•1h ago
Now you don’t need to know how to make music! You’re finally free of all those pesky, elitist musicians gate-keeping music!!!!1!
frank_nitti•51m ago
I’ve mostly used them for laughs with my friends. Sometimes generating “custom” songs with funny lyrics, but most fun so far is editing lyrics of existing songs to say ridiculous things for fun.

No real clue how someone would use them for a more serious endeavor, only thing I could imagine would be to quickly iterate/prototype with song structures on a fixed seed to generate ideas for a real composition. Consider the case of an indie game developer or film maker getting some placeholder music to test the experience during early throwaway iterations.

libraryatnight•36m ago
Generating crappy background music for reality TV?
scarecrowbob•40m ago
yeah, but have yall made any progress in a model that can have sex with my partner for me?

Vibechart

https://www.vibechart.net/
241•datadrivenangel•1h ago•48 comments

GPT-5

https://openai.com/gpt-5/
1276•rd•6h ago•1497 comments

Historical Tech Tree

https://www.historicaltechtree.com/
217•louisfd94•3h ago•52 comments

GPT-5: Key characteristics, pricing and system card

https://simonwillison.net/2025/Aug/7/gpt-5/
377•Philpax•5h ago•148 comments

Flipper Zero DarkWeb Firmware Bypasses Rolling Code Security

https://www.rtl-sdr.com/flipperzero-darkweb-firmware-bypasses-rolling-code-security/
67•lq9AJ8yrfs•1h ago•25 comments

GPT-5 for Developers

https://openai.com/index/introducing-gpt-5-for-developers
316•6thbit•6h ago•165 comments

OpenAI's new open-source model is basically Phi-5

https://www.seangoedecke.com/gpt-oss-is-phi-5/
112•emschwartz•4h ago•36 comments

Encryption made for police and military radios may be easily cracked

https://www.wired.com/story/encryption-made-for-police-and-military-radios-may-be-easily-cracked-researchers-find/
86•mikece•4h ago•41 comments

Benchmark Framework Desktop Mainboard and 4-node cluster

https://github.com/geerlingguy/ollama-benchmark/issues/21
121•geerlingguy•5h ago•28 comments

Cursor CLI

https://cursor.com/cli
113•gonzalovargas•2h ago•59 comments

Building Bluesky comments for my blog

https://natalie.sh/posts/bluesky-comments/
248•g0xA52A2A•7h ago•100 comments

Windows XP Professional

https://win32.run/
265•pentagrama•9h ago•154 comments

Infinite Pixels

https://meyerweb.com/eric/thoughts/2025/08/07/infinite-pixels/
207•OuterVale•9h ago•48 comments

How to sell if your user is not the buyer

https://writings.founderlabs.io/p/how-to-sell-if-your-user-is-not-the
133•mooreds•7h ago•62 comments

Show HN: Octofriend, a cute coding agent that can swap between GPT-5 and Claude

https://github.com/synthetic-lab/octofriend
55•reissbaker•4h ago•19 comments

Open music foundation models for full-song generation

https://map-yue.github.io/
55•selvan•3d ago•24 comments

How AI conquered the US economy: A visual FAQ

https://www.derekthompson.org/p/how-ai-conquered-the-us-economy-a
150•rbanffy•12h ago•138 comments

Foundry (YC F24) is hiring staff-level product engineers

https://www.ycombinator.com/companies/foundry/jobs/jwdYx6v-founding-product-engineer
1•lakabimanil•6h ago

The Inkhaven Blogging Residency

https://www.inkhaven.blog/
47•venkii•22h ago•52 comments

Squashing my dumb bugs and why I log build IDs

https://rachelbythebay.com/w/2025/08/03/scope/
5•zoidb•3d ago•3 comments

Spatio-temporal indexing the Bluesky firehose

https://joelgustafson.com/posts/2025-08-07/spatio-temporal-indexing-the-bluesky-firehose
20•joelg•3h ago•0 comments

Gemini CLI GitHub Actions

https://blog.google/technology/developers/introducing-gemini-cli-github-actions/
225•michael-sumner•13h ago•90 comments

Lightweight LSAT

https://lightweightlsat.com/
49•gregsadetsky•5h ago•27 comments

The Q Programming Language

https://git.urbach.dev/cli/q
30•ygritte•3d ago•8 comments

Show HN: Browser AI agent platform designed for reliability

https://github.com/nottelabs/notte
42•ogandreakiro•5h ago•14 comments

DNA tests are uncovering the true prevalence of incest (2024)

https://www.theatlantic.com/health/archive/2024/03/dna-tests-incest/677791/
84•georgecmu•5h ago•65 comments

Monte Carlo Crash Course: Quasi-Monte Carlo

https://thenumb.at/QMC/
102•zote•4d ago•9 comments

An LLM does not need to understand MCP

https://hackteam.io/blog/your-llm-does-not-care-about-mcp/
92•gethackteam•10h ago•92 comments

Leonardo Chiariglione – Co-founder of MPEG

https://leonardo.chiariglione.org/
199•eggspurt•12h ago•184 comments

Zero-day flaws in authentication, identity, authorization in HashiCorp Vault

https://cyata.ai/blog/cracking-the-vault-how-we-found-zero-day-flaws-in-authentication-identity-and-authorization-in-hashicorp-vault/
227•nihsy•16h ago•89 comments