frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

DNA is maybe 60-750MB of data

https://dynomight.net/dna/
32•MattSayar•10h ago

Comments

E_Evan•5h ago
https://github.com/samtools/htslib/blob/develop/sam_internal...
biomcgary•5h ago
Your genome only needs to store the information necessary for the lineage leading to you to survive and compete in the range of environmental variation that actually happened.

Consequently, your DNA has less information about how to survive on Jupiter or in the absence of oxygen.

However, your genome contains a fair amount of data on how to identify a mate that will maximize your reproductive success in an environment similar to the one your lineage experienced, e.g., a preference for symmetrical faces.

Until we can measure the environment of humans accurately, all the algorithmic complexity measures applied to the genome are going to be missing the relevant context.

boshalfoshal•4h ago
The way I think about it, DNA is just a metaprogram. The "programs" this metaprogram create (brain, gut, other organs, etc). can be far more information dense and complex than the actual metaprogram that initially created them.

The real interesting part is _how_ this small metaprogram can generate something like a brain, which is ostensibly multiples more complex than the DNA that produced it, since obviously DNA cannot possibly encode the data for every possible synaptic connection or protein or whatever losslessly. I think this is more of a testament to how complex the human body is, that it has such complex seemingly emergent behavior from a very sparse set of initial conditions.

hulitu•5h ago
> DNA is maybe 60-750MB of data

maybe

jiggawatts•4h ago
For a long time now, a thought has been repeating in my mind: How exactly are high level behaviours like sexual attraction encoded in our genes!?

It such a subtle thing too! We're attracted (or not) to the tiniest differences in physiology. If you doubt this, try this exercise: Pretend you've just met green aliens and have to explain to them how to reliably tell the difference between men and women from appearance alone! Now explain why that particular girl (or boy) is very pretty/handsome, but not that one.

It's one of those topics where the more you know, the more freaky it is.

DNA does not -- to our knowledge -- directly encode the "weights" of our neurons! It can't possibly because there are far more synapses than there bits of information in our genes. Also, most of those genes are dedicated to non-brain parts of the body plan and to the low-level machinery of our cellular biochemistry.

Secondly, DNA has only an indirect effect of our development: it encodes for proteins, which then provide chemical signals such as concentration gradients that guide cell division. It's a bit like playing SimCity, where the players' control is limited to zoning and road topology. The individual Sims are not directly controllable and behave stochastically.

Solving this problem is so freakishly difficult for even the incredible brute force of parallel search of evolution only managed to discover a solution a few times in a billion years.

Our attraction to our partners is a genetic heritage shared with all mammals, going back hundreds of millions of years. That's why Furry is a thing, but not Featherry. Birds are a different class from us mammals and don't share the same "partner attraction wiring" genes. (This is closely related to why all mammal babies are cute to humans, but baby bird chicks are generally repulsive.)

Because this is a hard problem to solve, the few solutions that were discovered had to be reused by entire classes of Animalia. I would hazard a guess that this is precisely what defines a “class” in taxonomy! If there were intelligent birds, their equivalent of Furry would be Featherry, and their crimes of bestiality would be with other non-intelligent birds, not mammals.

With LLMs, we got to see a glimpse into the possible mechanisms of intelligence, and what it might take to design or evolve one.

The LLM equivalent of this kind of encoding would be to design a model architecture that falls in love with a specific, narrowly selected, subset of its users. Keep in mind that I'm not talking about a learned or specifically tuned set of model weights! The architecture is where the attraction is encoded, such as selecting some complex variant or combination of Transformers, Mamba, or CNNs that just "so happen" to result in the model preferentially learning to be attracted to certain styles of conversation, but not others.

Worse still, the direct equivalent to what genes do is that you can't even choose an architecture directly, instead you can only contribute to PyTorch. You have to design its API such that naive developers using it stochastically tend towards the desired architecture of their own accord by simply tab-completing often.

That's essentially what evolution figured out, at least five or six times, but tunable, so that individual species can be attracted to each other but much less to even very closely related species.

And then, evolution found a way to add a "notch filter" such that despite increased attraction to closely related individuals, most animals (including humans) are repulsed sexually by their parents and siblings.

That's mind-blowing to me.

nly•3h ago
> And then, evolution found a way to add a "notch filter" such that despite increased attraction to closely related individuals, most animals (including humans) are repulsed sexually by their parents and siblings.

Not being attracted sexually to close relatives is likely the result of early imprinting when growing up under one roof, and not genetics. Indeed there is some evidence relatives separated at birth etc who meet later are more likely to unwittingly be attracted to one another.

jiggawatts•2h ago
It's obviously imprinting, but the mechanism for that imprinting is implemented in your neurons, which are defined by your genes. Ultimately, it's a "feature" that wouldn't be there if it wasn't for something in your genes encoding for it!

The question is how is that encoded, not just in your genes, but in the final neurons?

There are hundreds of trillions of synapses in the brain that a few megabytes of genes somehow shaped into: "be attracted to faces almost, but not exactly like your parents"... but not actually repulsed, because you have to get along nicely with your parents and not just run away in terror.

motrm•3h ago
DNA makes me think of ASN.1 in that a short sequence of bits can convey rather a lot of information - but it makes no sense without knowing which message those bits represent. Only with that knowledge can you turn those bits into something useful.
refurb•1h ago
This article only touches on it but the data stored is far more than just the base pairs.

DNA is like a computer program that when it runs, it provides feedback for the code and determines which parts of it should run. It can also modify the code (DNA methylation).

Then add on top the external environment - external molecules can interact with the machinery which then impact which code is executed.

If code is self-regulating, the amount of information it encodes is far higher than that defined by its base pairs.

Void: Open-source Cursor alternative

https://github.com/voideditor/void
594•sharjeelsayed•10h ago•250 comments

Fui: C library for interacting with the framebuffer in a TTY context

https://github.com/martinfama/fui
65•Bhulapi•4h ago•21 comments

Reservoir Sampling

https://samwho.dev/reservoir-sampling/
297•chrisdemarco•9h ago•62 comments

Progress toward fusion energy gain as measured against the Lawson criteria

https://www.fusionenergybase.com/articles/continuing-progress-toward-fusion-energy-breakeven-and-gain-as-measured-against-the-lawson-criteria
166•sam•10h ago•72 comments

From: Steve Jobs. "Great idea, thank you."

https://blog.hayman.net/2025/05/06/from-steve-jobs-great-idea.html
725•mattl•8h ago•198 comments

How the US Built 5k Ships in WWII

https://www.construction-physics.com/p/how-the-us-built-5000-ships-in-wwii
60•rbanffy•5h ago•39 comments

Notes on rolling out Cursor and Claude Code

https://ghiculescu.substack.com/p/nobody-codes-here-anymore
161•jermaustin1•10h ago•78 comments

Phoenician culture spread mainly through cultural exchange

https://www.mpg.de/24574685/0422-evan-phoenician-culture-spread-mainly-through-cultural-exchange-150495-x
49•gmays•3d ago•9 comments

Podfox: First Container-Aware Browser

https://val.packett.cool/blog/podfox/
29•pierremenard•4h ago•4 comments

Gorilla study reveals complex pros and cons of friendship

https://www.sciencedaily.com/releases/2025/05/250505170816.htm
19•lentoutcry•2d ago•11 comments

When Abandoned Mines Collapse

https://practical.engineering/blog/2025/5/6/when-abandoned-mines-collapse
140•impish9208•2d ago•41 comments

Show HN: Using eBPF to see through encryption without a proxy

https://github.com/qpoint-io/qtap
210•tylerflint•9h ago•65 comments

How to start a school with your friends

https://prigoose.substack.com/p/how-to-start-a-university
72•geverett•7h ago•27 comments

Stability by Design

https://potetm.com/devtalk/stability-by-design.html
71•potetm•6h ago•15 comments

First American pope elected and will be known as Pope Leo XIV

https://www.cnn.com/world/live-news/new-pope-conclave-day-two-05-08-25
473•saikatsg•10h ago•734 comments

Show HN: OpenRouter Model Price Comparison

https://compare-openrouter-models.pages.dev/
14•pacific01•3d ago•7 comments

Mathematical Problem Solving

https://www.cip.ifi.lmu.de/~grinberg/t/20f/
61•ibobev•3d ago•3 comments

Gender characteristics of service robots can influence customer decisions

https://www.psu.edu/news/health-and-human-development/story/gender-characteristics-service-robots-can-influence-customer
11•gnabgib•3h ago•6 comments

Prepare your apps for Google Play's 16 KB page size compatibility requirement

https://android-developers.googleblog.com/2025/05/prepare-play-apps-for-devices-with-16kb-page-size.html
25•ingve•5h ago•9 comments

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models

https://m-arriola.com/bd3lms/
38•t55•8h ago•8 comments

Static as a Server

https://overreacted.io/static-as-a-server/
80•danabramov•8h ago•57 comments

A Brief History of Cursor's Tab-Completion

https://www.coplay.dev/blog/a-brief-history-of-cursor-s-tab-completion
20•josvdwest•2d ago•2 comments

The Rise and Fall of the Visual Telegraph (2017)

https://parisianfields.com/2017/11/05/the-rise-and-fall-of-the-visual-telegraph/
26•geox•7h ago•6 comments

A flat pricing subscription for Claude Code

https://support.anthropic.com/en/articles/11145838-using-claude-code-with-your-max-plan
96•namukang•5h ago•81 comments

Ciro (YC S22) is hiring a software engineer to build AI agents for sales

https://www.ycombinator.com/companies/ciro/jobs
1•dwiner•9h ago

Egyptologist uncovers hidden messages on Paris’s iconic obelisk

https://news.artnet.com/art-world/hidden-messages-paris-luxor-obelisk-2636508
84•isaacfrond•18h ago•79 comments

How Obama’s BlackBerry got secured (2013)

https://www.electrospaces.net/2013/04/how-obamas-blackberry-got-secured.html
195•lastdong•3d ago•75 comments

Ask HN: What are good high-information density UIs (screenshots, apps, sites)?

398•troupo•13h ago•311 comments

AI focused on brain regions recreates what you're looking at (2024)

https://www.newscientist.com/article/2438107-mind-reading-ai-recreates-what-youre-looking-at-with-amazing-accuracy/
59•openquery•2d ago•31 comments

My stackoverflow question was closed so here's a blog post about CoreWCF

https://richardcocks.github.io/2025-05-08-CoreWCF.html
99•eterm•14h ago•139 comments