Speed-coding for the 6502 – a simple example

https://www.colino.net/wordpress/en/archives/2025/08/28/speed-coding-for-the-6502-a-simple-example/

13•mmphosis•2h ago

Comments

rbanffy•2h ago

What a delightful short read.

They could go one step further and calculate the table as needed and use it as a cache.

For an single image scaling it might get a little bit better.

spc476•21m ago

If you read the entire article, they do that at the end of the article.

anyfoo•6m ago

Not quite. They build the entire table upfront, whether any individual entry is needed or not.

Making it an on-demand cache instead is a neat next step. Whether it helps or hurts depends on the actual input: If the input image uses every pixel value anyway, the additional overhead of checking whether the table entry is computed is just unnecessary extra with no value.

But if a typical image only uses a few pixel values, then the amortized cost of just calculating the few needed table entries may very well be significantly below the cost of the current algorithm.

If images are somewhere in between, or their characteristics not well known, then simply trying out both approaches with typical input data is a good approach!

Unless you’re perfectly happy with 0.2 seconds, for example because the runtime of some other parts take so long that dwarfs those 0.2s, then why bother.

Joker_vD•4m ago

I wonder if building this table can be sped up by noticing a recurring pattern?

    x    0  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 ... 254 255
    f(x) 0  0  1  2  3  3  4  5  6  6  7  8  9  9 10 11 12 12 ... 190 191

So something like

    sta table,y
    iny
    sta table,y
    adc $0
    iny    
    sta table,y
    adc $0
    iny    
    sta table,y
    adc $0
    iny

used as the loop body that should be repeated 64 times, should work. Will it take less than 6000 cycles total?

anyfoo•3m ago

Very neat. Breaking up multiplications and divisions into bit shifts, and lookup table to trade off memory for runtime, are indeed nothing new to engineers working on the low level, but this paints a very pretty picture of how this looks in practice.

Increased autonomic activation in vicarious embarrassment (2012) [pdf]

Two Chinese Nationals Arrested for Allegedly Illegal Shipping AI Chips to China

'Universal' Cancer Vaccine Destroys Resistant Tumors in Mice

BCHS Stack: BSD, C, httpd, SQLite

Ask HN: How to teach a 4 year old to code?

F1 in Hungary: Strategy and fast tire changes make all the difference

Bad Craziness

The Rise of Computer Use and Agentic Coworkers

We Oops-Proofed Infrastructure Deletion on Railway

From the 'Banter Bill' to Bias Hotlines: The Alarming Rise of Snitch Networks

The latest Covid vaccines come with restrictions

The big idea: Turn lobbying into a high-stakes financial market

Mainframe upgrade done with wire cutters (2015)

Python: The Documentary

Show HN: DeepShot – an open-source NBA predictor with ML, EWMA, and live UI

ExxonMobil Global Outlook: Our view to 2050

Show HN: oLLM – LLM Inference for large-context tasks on consumer GPUs

Should We Anthropomorphize LLMs?

Prompt Engineering for Grok Code Fast 1

All Revenue Is Not Created Equal (2011)

Ask HN: How much better can the LLMs become assuming no AGI

The A.I. Talent Wars

Garmin Blaze Equine Wellness System

RSS Is Awesome

Why HyperCard Had to Die (2011)

Hygiene Hypothesis

The ABC Programming Language

The Electro-Industrial Stack Will Move the World

US withdraws from UN human rights report

I've always wanted to be an open-source maintainer- now I regret it