frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Why GADTs matter for performance (2015)

https://blog.janestreet.com/why-gadts-matter-for-performance/
83•hyperbrainer•6mo ago

Comments

rbjorklin•6mo ago
Does anyone have some hard numbers on the expected performance uplift when using GADTs? Couldn't see any mentioned in the article.
ackfoobar•6mo ago
The example here is basically an 8-fold memory saving going from `long[]` from `byte[]` - while still retaining polymorphism (whereas in Java the two are unrelated types).

Hard to say exactly how much performance one would get, as that depends on access patterns.

misja111•6mo ago
The reason that a byte array is in reality layed out as a (mostly empty) long array in Java, is actually for performance. Computers tend to have their memory aligned at 8 byte intervals and accessing such an address is faster than accessing an address that's at an offset of an 8 byte interval.

Of course it depends on your use case, in some cases a compact byte array performs better anyway, for instance because now you're able to fit it in your CPU cache.

john-h-k•6mo ago
But you can load any byte by loading 8 bytes and shift (v cheap)
ackfoobar•6mo ago
> a byte array is in reality layed out as a (mostly empty) long array in Java

Are you saying each byte takes up a word? That is the case in the `char array` in OCaml, but not Java's `byte[]`. AFAIK The size of a byte array is rounded up to words. Byte arrays of length 1-8 all have the same size in a 64-bit machine, then length 7-16 take up one more word.

https://shipilev.net/jvm/objects-inside-out/

cosmic_quanta•6mo ago
Interesting, thanks for posting.

I share the author's frustration with the lack of non-compiler-related examples of GADT uses. It seems like such a powerful idea, but I haven't been able to get a feel for when to reach for GADTs in Haskell

wyager•6mo ago
I often find them handy for locking down admissible states at compile time. Maybe ~10 years ago in a processor design class, I wrote some CPUs in Haskell/Clash for FPGA usage. A nice thing I could do was write a single top-level instruction set, but then lock down the instructions based on what stages of the processor they could exist at.

For example, something like (not an actual example from my code, just conceptually - may be misremembering details):

  data Instruction stages where
   MovLit :: Word64 -> Register -> Instruction '[Fetch, Decode, Execute, Writeback]
   -- MovReg instruction gets rewritten to MovLit in Execute stage
   MovReg :: Register -> Register -> Instruction '[Fetch, Decode, Execute]
   ...
And then my CPU's writeback handler block could be something like:

  writeback :: (Writeback `member` stages) => Instruction stages -> WritebackState -> WritebackState
  writeback (MovLit v reg) = ...
  -- Compiler knows (MovReg _ _) is not required here
So you can use the type parameters to impose constraints on the allowed values, and the compiler is smart enough to use this data during exhaustiveness checks (cf "GADTs Meet Their Match")
anyfoo•6mo ago
Wow, someone else who (used to be) using Clash. I still use it for everything I can in my (hobby) FPGA projects. I'm not sure I've used GADTs, but I've certainly made use of other more "advanced" parts of the type system, like type families.

What you're doing here is pretty cool, I think I will start doing so, too. I have a number of places where I use "undefined" instead. (The "undefined" from the Clash Prelude, which translates into a "don't care" signal state.)

wyager•6mo ago
Clash is awesome, IMO by far the best extant HDL.

I semi-recently used it for this ADAT fiber optic audio codec https://github.com/YagerICS/adat-codec

anyfoo•6mo ago
Awesome. I see you've made full use of Hedgehog as well.
hyperbrainer•6mo ago
Related: https://github.com/ocaml/RFCs/blob/881b220adc1f358ab15f7743d...
goldchainposse•6mo ago
I know Jane Street love OCaml, but you have to wonder how much it's cost them in velocity and maintenance. This is a quant firm blogging about a programming language they're the most famous user of.
pjmlp•6mo ago
It is thanks to the companies like Jane Street that believe there is something else beyond C, that we can have nice toys.

Remember if OCaml wasn't a mature programming language, maybe Rust would not have happened in first place.

kryptiskt•6mo ago
Why do you assume it's a drag for them and not a competitive advantage? I don't know if it's such a terrible thing to use a slightly out of mainstream language, when the standard in the business is to accumulate tens of millions of lines of C++.
ackfoobar•6mo ago
Agreed, indeed I believe they have mentioned that OCaml gets them to ship quicker because they are more confident with the correctness of changes.

But being outside of the mainstream may mean you need to occasionally debug more esoteric stuff: https://gallium.inria.fr/blog/intel-skylake-bug/ I'm sure Jane Street can afford doing that, but I'm not so sure if a small team can.

gjadi•6mo ago
That was an interesting read, thanks. However I fail to see how it's an issue specific to ocaml. It was a bug in the Skylake processor triggered by a special pattern of instructions produced by gcc. Ocaml built with clang was ok because it doesn't used the same pattern. Did I miss something?
ackfoobar•6mo ago
If the JVM encountered the same bug other people would have discovered it before me. Most probably I won't even know the bug exists.
goldchainposse•6mo ago
> Why do you assume it's a drag for them and not a competitive advantage?

Because despite them being very open about it, no one else does it, and every distinguished engineer who pushes a weird tech choice will justify and defend it.

cdaringe•6mo ago
People that haven’t used ocaml think it’s weird. I picked it up casually in 2020. It might not be popular, but it’s certainly not weird. It’s actually quite fantastic. These days I rarely ever use it, but I wish I did!
keybored•6mo ago
Concretely how do you think it’s holding them back? Just by being niche?
anyfoo•6mo ago
There are many things to say about this, but one of those things is that I think you are making the assumption that an (e.g.) C programmer who does not want (or even cannot) get into OCaml would somehow be better for this highly specialized, high-performance, and high-correctness-affine use case, than someone who does. And I'd question that assumption.
fjwufjfa•6mo ago
It's easier to reason in FP plus the python paradox [1] [2].

[1]: https://www.paulgraham.com/pypar.html

[2]: https://blog.janestreet.com/why-ocaml/

codr7•6mo ago
For certain classes of programs, yes. I have a hunch finance is a pretty good fit.
AdieuToLogic•6mo ago
I agree with your point about reasoning when employing Functional Programming (FP).

However, I very much disagree with Graham's 2004 assertion[0]:

  It's a lot of work to learn a new programming language. And 
  people don't learn Python because it will get them a job; 
  they learn it because they genuinely like to program and 
  aren't satisfied with the languages they already know.
It does not require "a lot of work to learn a new programming language" once a person has fluency with at least one. Actually, the difficulty of learning a new programming language is inversely proportional to how many programming languages the person has already learned. Especially if a new programming language is in the same paradigm category as those already known (Procedural, OOP, FP, etc.).

I was a professional software engineer in 2004, when the Graham post was written. To say, "people don't learn Python because it will get them a job ..." was bullshit then just as it is now. The remainder of the quoted sentence is unfounded extrapolation and has the value of same.

0 - https://www.paulgraham.com/pypar.html

lmm•6mo ago
Jane Street has been one of the most successful financial firms of the last 10 years or so, going from a niche hedge fund to a big player. Sounds like OCaml has been working out for them. Certainly I know it's helped them hire a lot of excellent programmers.
cryptonector•6mo ago
What's not clear from reading TFA is whether the compiler monomorphizes TFA's `Compact_array` for the two special cases of it (array of bytes vs. array of anything else), but I'm assuming so. Perhaps if I was familiar with OCaml the answer would be blindingly obvious. What's happening here is that w/ GADTs you can have a _singular_ abstraction with multiple distinct implementations for specific types and others for generic types, and you don't have to think about it too much, except you have to remember to use these type hints in the interface definitions to get the compiler to do what you want.

> Yaron Minsky joined Jane Street back in 2002, and claims the dubious honor of having convinced the firm to start using OCaml.

That's pretty cool. And I guess Stephen Dolan ended up there due to his work on OCaml, which is pretty cool too. (I'd like to meet Stephen some day.)

Open-source Zig book

https://www.zigbook.net
320•rudedogg•5h ago•105 comments

Britney Spears' Guide to Semiconductor Physics

https://britneyspears.ac/lasers.htm
76•lachlan_gray•1h ago•27 comments

Browser fingerprinting via favicon

https://github.com/jonasstrehle/supercookie
186•vxvrs•5h ago•42 comments

Heretic: Automatic censorship removal for language models

https://github.com/p-e-w/heretic
384•melded•10h ago•152 comments

What if you don't need MCP at all?

https://mariozechner.at/posts/2025-11-02-what-if-you-dont-need-mcp/
119•jdkee•6h ago•68 comments

Dark Pattern Games

https://www.darkpattern.games
94•robotnikman•5h ago•48 comments

The Pragmatic Programmer: 20th Anniversary Edition (2023)

https://www.ahalbert.com/technology/2023/12/19/the_pragmatic_programmer.html
63•ahalbert2•4h ago•8 comments

PicoIDE – An open IDE/ATAPI drive emulator

https://picoide.com/
10•st_goliath•1h ago•0 comments

How Your Brain Creates 'Aha' Moments and Why They Stick

https://www.quantamagazine.org/how-your-brain-creates-aha-moments-and-why-they-stick-20251105/
21•wjb3•3h ago•0 comments

Z3 API in Python: From Sudoku to N-Queens in Under 20 Lines

https://ericpony.github.io/z3py-tutorial/guide-examples.htm
79•amit-bansil•6h ago•3 comments

Goldman Sachs asks in biotech Report: Is curing patients a sustainable business?

https://www.cnbc.com/2018/04/11/goldman-asks-is-curing-patients-a-sustainable-business-model.html
94•randycupertino•2h ago•47 comments

I finally understand Cloudflare Zero Trust tunnels

https://david.coffee/cloudflare-zero-trust-tunnels
103•eustoria•7h ago•35 comments

Why your mock breaks later

https://nedbatchelder.com/blog/202511/why_your_mock_breaks_later.html
17•ingve•2h ago•6 comments

FPGA Based IBM-PC-XT

https://bit-hack.net/2025/11/10/fpga-based-ibm-pc-xt/
133•andsoitis•9h ago•27 comments

The fate of "small" open source

https://nolanlawson.com/2025/11/16/the-fate-of-small-open-source/
135•todsacerdoti•5h ago•93 comments

A File Format Uncracked for 20 Years

https://landaire.net/a-file-format-uncracked-for-20-years/
14•todsacerdoti•1w ago•2 comments

I have recordings proving Coinbase knew about breach months before disclosure

https://jonathanclark.com/posts/coinbase-breach-timeline.html
300•jclarkcom•4h ago•104 comments

Fourier Transforms

https://www.continuummechanics.org/fourierxforms.html
96•o4c•1w ago•13 comments

Shell Grotto, Margate

https://en.wikipedia.org/wiki/Shell_Grotto,_Margate
33•Michelangelo11•1w ago•4 comments

Linux mode setting, from the comfort of OCaml

https://roscidus.com/blog/blog/2025/11/16/libdrm-ocaml/
39•ibobev•5h ago•4 comments

Brimstone: ES2025 JavaScript engine written in Rust

https://github.com/Hans-Halverson/brimstone
184•ivankra•13h ago•92 comments

Decoding Leibniz Notation (2024)

https://www.spakhm.com/leibniz
31•coffeemug•6h ago•5 comments

Show HN: Spam classifier in Go using Naive Bayes

https://github.com/igomez10/nspammer
13•igomeza•1w ago•1 comments

Lithium vs. Lettuce

https://ambrook.com/offrange/photo-essay/lithium-v-lettuce
25•mfburnett•1d ago•3 comments

Anthropic’s paper smells like bullshit

https://djnn.sh/posts/anthropic-s-paper-smells-like-bullshit/
833•vxvxvx•13h ago•259 comments

Garbage collection is useful

https://dubroy.com/blog/garbage-collection-is-useful/
111•surprisetalk•11h ago•36 comments

Waiting for SQL:202y: Group by All

http://peter.eisentraut.org/blog/2025/11/11/waiting-for-sql-202y-group-by-all
41•ingve•5d ago•17 comments

De Bruijn Numerals

https://text.marvinborner.de/2023-08-22-22.html
63•marvinborner•9h ago•7 comments

Measuring the doppler shift of WWVB during a flight

https://greatscottgadgets.com/2025/10-31-receiving-wwvb-with-hackrf-pro/
118•Jyaif•1w ago•2 comments

The Man Who Keeps Predicting the Web's Death

https://tedium.co/2025/10/25/web-dead-predictions-george-colony/
40•thm•7h ago•26 comments