frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Writing a Self-Mutating x86_64 C Program (2013)

https://ephemeral.cx/2013/12/writing-a-self-mutating-x86_64-c-program/
118•kepler471•1y ago

Comments

belter•1y ago
I guess in OpenBSD because of W ^ X this would not work?
akdas•1y ago
I was thinking the same thing. Usually, you'd want to write the new code to a page that you mark as read and write, then switch that page to read and execute. This becomes tricky if the code that's doing the modifying is in the same page as the code being modified.
timewizard•1y ago
The way it's coded it wouldn't; however, you can map the same shared memory twice. Once with R|W and a second time with R|X. Then you can write into one region and execute out of it's mirrored mapping.
rkeene2•1y ago
In Linux it also needs mprotect() to change the permissions on the page so it can write it. The OpenBSD man page[0] indicate that it supports this as well, though notes that not all implementations are guaranteed to allow it, but my guess is it would generally work.

[0] https://man.openbsd.org/mprotect.2

Retr0id•1y ago
It's not required on linux, if the ELF headers are set up such that the page is mapped rwx to begin with. (but rwx mappings are generally frowned upon from a security perspective)
mananaysiempre•1y ago
Not as is, but I think OpenBSD permits you to map the same memory twice, once as W and once as X (which would be a reasonable hoop to jump through for JITs etc., except there’s no portable way to do it). ARM64 MacOS doesn’t even permit that, and you need to use OS-specific incantations[1] that essentially prohibit two JITs coexisting in the same process.

[1] https://developer.apple.com/documentation/apple-silicon/port...

saagarjha•1y ago
No, the protection is per-thread. You can run the JITs in different threads
alcover•1y ago
I often think this could maybe allow fantastic runtime optimisations. I realise this would be hardly debuggable but still..
Retr0id•1y ago
It already does, in the form of JIT compilation.
alcover•1y ago
OK but I meant in already native code, like in a C program - no bytecode.
Retr0id•1y ago
I mean that, too.
connicpu•1y ago
LuaJIT has a wonderful dynamic code generation system in the form of the DynASM[1] library. You can use it separately from LuaJIT for dynamic runtime code generation to create machine code optimized for a particular problem.

[1]: https://luajit.org/dynasm.html

lmm•1y ago
If you are generating or modifying code at runtime then how is that different from bytecode? Standardised bytecodes and JITs are just an organised way of doing the same thing.
vbezhenar•1y ago
I used GNU lightning library once for such optimisation. I think it was ICFPC 2006 task. I had to write an interpreter for virtual machine. Naive approach worked but was slow, so I decided to speed it up a bit using JIT. It wasn't a 100% JIT, I think I just implemented it for loops but it was enough to tremendously speed it up.
userbinator•1y ago
Programs from the 80s-90s are likely to have such tricks. I have done something similar to "hardcode" semi-constants like frame sizes and quantisers in critical loops related to audio and video decompression, and the performance gain is indeed measurable.
alcover•1y ago
> "hardcode" semi-constants

You mean you somehow avoided a load. But what if the constant was already placed in a register ? Also how could you pinpoint the reference to your constant in the machine code ? I'm quite profane about all this.

ronsor•1y ago
> Also how could you pinpoint the reference to your constant in the machine code?

Not OP, but often one uses an easily identifiable dummy pattern like 0xC0DECA57 or 0xDEADBEEF which can be substituted without also messing up the machine code.

mananaysiempre•1y ago
If you’re willing to parse object files (a much easier proposition for ELF than for just about anything else), another option is to have the source code mention the constants as addresses of external symbols, then parse the relocations in the compiled object. Unfortunately, I’ve been unable to figure out a reliable recipe to get a C compiler to emit absolute relocations in position-independent code, even after restricting myself to GCC and Clang for x86 Linux; in some configurations it works and in others you (rather pointlessly) get a PC-relative one followed by an add.
userbinator•1y ago
All the registers were already taken.

You use a label.

econ•1y ago
The 80's:

Say you set a value for some reason. Later you have to check IF it is set. If the condition needs to be checked many times you replace it with the code (rather than set a value to check some place). If you need to check if something is still true repeatedly you replace the condition check with no-ops when it isn't true.

Also funny are insanely large loop unrolls with hard coded valued. You could make a kind of rainbow table of those.

barchar•1y ago
It sometimes can, but you then have to balance the time spent optimizing against the time spent actually doing whatever you were optimizing.

Also on modern chips you must wait quite a number of cycles before executing modified code or endure a catastrophic performance hit. This is ok for loops and stuff, but makes a lot of the really clever stuff pointless.

The debuggers software breakpoints _are_ self-modifying code :)

112233•1y ago
Linux kernel had the same idea, and now they have "static keys". It's both impressive and terrifying.
oxcabe•1y ago
It's impressive how well laid out the content in this article is. The spacing, tables, and code segments all look pristine to me, which is especially helpful given how dense and technical the content is.
AStonesThrow•1y ago
It was designed by Elves on Christmas Island where Dwarves run the servers and Hobbits operate the power plant
f1shy•1y ago
I have the suspicion that there is a high correlation between how organized the content is, and how organized and clear the mind of the writer is.
ivanjermakov•1y ago
I had a great experience writing self modified programs is a single instruction programming game SIC-1: https://store.steampowered.com/app/2124440/SIC1/
ycombinatrix•1y ago
Cool recommendation, will give it a try.
Someone•1y ago
Fun article, but the resulting code is extremely brittle:

- assumes x86_64

- makes the invalid assumption that functions get compiled into a contiguous range of bytes (I’m not aware of any compiler that violates that, but especially with profile-guided optimization or compilers that try to minimize program size, that may not be true, and there is nothing in the standard that guarantees it)

- assumes (as the article acknowledges) that “to determine the length of foo(), we added an empty function, bar(), that immediately follows foo(). By subtracting the address of bar() from foo() we can determine the length in bytes of foo().”. Even simple “all functions align at cache lines” slightly violates that, and I can see a compiler or a linker move the otherwise unused bar away from foo for various reasons.

- makes assumptions about the OS it is running on.

- makes assumptions about the instructions that its source code gets compiled into. For example, in the original example, a sufficiently smart compiler could compile

  void foo(void) {
    int i=0;
    i++;
    printf("i: %d\n", i);
  }
as

  void foo(void) {
    printf("1\n");
  }
or maybe even

  void foo(void) {
    puts("1");
  }
Changing compiler flags can already break this program.

Also, why does this example work without flushing the instruction cache after modifying the code?

nekitamo•1y ago
For the mainstream OSes (Windows, OSX, Linux Android) You don't need to flush the instruction cache on most x86 CPUs after modifying the code segment dynamically, but you do on ARM and MIPS.

This has burned me before while writing a binary packer for Android.

saagarjha•1y ago
They check all those assumptions by disassembling the code.
Cloudef•1y ago
> self-modifying code > brittle

I mean that is to be very much expected, unless someone comes up with a programming language that fully embraces the concept.

znpy•1y ago
The author clearly explained that the whole article is more a demonstration for illustrative purposes than anything else.

> Changing compiler flags can already break this program.

That's not the point of the article.

xixixao•1y ago
I’ve been thinking a lot about this topic lately, even studying how executables look on arm macOS. My motivation was exploring truly fast incremental compilation for native code.

The only way to do this now on macOS is remapping whole pages as JIT. This makes it quite a challenge but still it might work…

Cloudef•1y ago
Kaze Emanuar's "Optimizing with Bad Code" video also goes briefly go through self-modifying code https://www.youtube.com/watch?v=4LiP39gJuqE
pfdietz•1y ago
A program that can generate, compile, and execute new code is nothing special in the Common Lisp world. One can build lambda expressions, invoke the compile function on them, and call the resulting compiled functions. One can even assign these functions to the symbol-function slot of symbols, allowing them to be called from pre-existing code that had been making calls to that function named by that symbol.
BenjiWiebe•1y ago
I know that no other language can match Lisp, but many languages can generate and execute new code, if they're interpreted. Compile, too, if they're JITted. They all require quite a bit of runtime support though.
DrZhvago•1y ago
Someone correct me if I am wrong, but self-mutating code is not as uncommon as the author portrays it. I thought the whole idea of hotspot optimization in a compiler is essentially self-mutating code.

Also, I spent a moderately successful internship at Microsoft working on dynamic assemblies. I never got deep enough into that to fully understand when and how customers where actually using it.

https://learn.microsoft.com/en-us/dotnet/fundamentals/reflec...

iamcreasy•11mo ago
Is it possible to mutate the text segment by another process? For example, injecting something malicious instead of exec-ing a shell?

It's time to address the looming crisis in entry-level work

https://www.technologyreview.com/2026/05/26/1137865/its-time-to-address-the-looming-crisis-in-ent...
1•joozio•30s ago•0 comments

RepoRecon – a Claude Code plugin that validates project ideas against GitHub

https://github.com/suleman-dawood/reporecon
1•sulemandawood•45s ago•0 comments

US law enforcement warns of "anti-tech extremism" as AI hatred grows

https://arstechnica.com/ai/2026/05/us-law-enforcement-warns-of-anti-tech-extremism-as-ai-hatred-g...
2•ndsipa_pomu•1m ago•0 comments

Static Forms – open alternative to Formspree for static websites

https://static-forms.com
1•upggr•1m ago•0 comments

Dense vs. Moe Model

https://engineersmeetai.substack.com/p/dense-vs-moe-models-explained
1•Sathya_sns•1m ago•0 comments

Jensen Huang says CEOs who blame AI for layoffs are giving a 'lazy' excuse

https://www.businessinsider.com/nvidia-ceo-jensen-huang-ai-job-cuts-losses-lazy-narrative-2026-5
1•theanonymousone•1m ago•0 comments

Trying Out C++ as a Web Developer

https://hooby.blog/posts/trying-cpp-as-a-webdev/
1•hooby•2m ago•0 comments

Avatar 4.0 – A living AI organism with physics body, emotions, on a GTX 1660 Ti

https://github.com/linga009/Avatar
1•linga009•6m ago•0 comments

Plan Mode Is a Crutch

https://graphcoder.ai/blog/plan-mode-is-a-crutch
3•ramstar3000•7m ago•0 comments

Readable.css

https://readable-css.freedomtowrite.org/
1•birdculture•8m ago•0 comments

GSM, UMTS, LTE and 5G Standard Protocols and Procedures for Lawful Interception [pdf]

https://www.etsi.org/deliver/etsi_ts/133100_133199/133128/16.19.00_60/ts_133128v161900p.pdf
1•azalemeth•9m ago•0 comments

Spanish police raid headquarters of PM Sánchez's Socialist Party

https://apnews.com/article/spain-socialist-headquarters-police-raid-043e048333ea415a6ece0a6bf02fe6da
1•embedding-shape•9m ago•0 comments

A zero-dependency GitHub Issue poller for multi-agent coding teams

https://gist.github.com/atraining/d666e2d20e5abfdf92eb74a0d5f4918d
2•chelm•11m ago•0 comments

Show HN: Legato – a Rust audio graph framework with a minimal DSL

https://legato.gg/docs/getting-started
1•lukeweston1234•11m ago•0 comments

The Black Death (2016)

https://aeon.co/essays/what-caused-the-black-death-and-could-it-strike-again
1•downbad_•12m ago•0 comments

A Guideline of Performing Ibadah at the International Space Station [pdf]

https://theislamicworkplace.com/wp-content/uploads/2007/10/a_guideline_ibadah_at_iss.pdf
1•thunderbong•12m ago•0 comments

Domux

https://github.com/pranav7/domux
1•pranav7•14m ago•1 comments

Paxton's Texas Victory Creates a New Battleground for Senate Control

https://www.nytimes.com/2026/05/27/us/politics/paxton-talarico-texas-senate-race.html
1•Cider9986•14m ago•0 comments

Show HN: Next.js internal tools boilerplate – auth, RBAC, audit logs, jobs

https://coreui.io/product/next-js-boilerplate/
1•mrholek•14m ago•0 comments

Startup API credits for teams testing AI workflows

https://wisgate.ai/startup-credits
1•kevin_wang7•15m ago•0 comments

I'm Tired of Talking to AI

https://orchidfiles.com/im-tired-of-ai-generated-answers/
2•theorchid•18m ago•0 comments

Show HN: Axion – Browser-based guitar amp/effects rig

https://axion.cab/
1•rhysfonixone•19m ago•0 comments

Show HN: Chat Hoarding – Mac app to archive WhatsApp backups locally

https://chathoarding.app/
3•zzeynalov•22m ago•2 comments

The Despair of the Professor in the Age of A.I

https://www.newyorker.com/news/fault-lines/the-despair-of-the-professor-in-the-age-of-ai
1•YeGoblynQueenne•24m ago•0 comments

A file-level tree that lets an LLM reason over a document corpus

https://pageindex.ai/blog/pageindex-filesystem
1•cccaaai•24m ago•0 comments

Modern Web Guidance

https://developer.chrome.com/docs/modern-web-guidance
1•pramodbiligiri•25m ago•0 comments

Show HN: I hand-write 5 daily word puzzles before work

https://www.dailyworder.com/
1•DailyWorder•31m ago•0 comments

Show HN: Generate 54 social media assets in 1 click

https://socialpacks.co/
1•danielkempe•31m ago•0 comments

Tell HN: First commit on Linux Kernel GitHub Page is from 30th of April 2005

https://github.com/torvalds/linux/commit/1ddb8a16aa0e60e7fdc48b1f532cf43e692f8fae
2•theanonymousone•40m ago•1 comments

Mexican President Responds to World Cup Piracy Concerns, Prefers Open Broadcasts

https://torrentfreak.com/mexican-president-responds-to-world-cup-piracy-concerns-prefers-open-bro...
4•Cider9986•44m ago•0 comments