frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Derivation and Intuition behind Poisson distribution

https://antaripasaha.notion.site/Derivation-and-Intuition-behind-Poisson-distribution-1255314a56398062bf9dd9049fb1c396
105•sebg•9mo ago

Comments

meatmanek•9mo ago
Poisson distributions are sort of like the normal distribution for queuing theory for two main reasons:

1. They're often a pretty good approximation for how web requests (or whatever task your queuing system deals with) arrive into your system, as long as your traffic is predominantly driven by many users who each act independently. (If your traffic is mostly coming from a bot scraping your site that sends exactly N requests per second, or holds exactly K connections open at a time, the Poisson distribution won't hold.) Sort of like how the normal distribution shows up any time you sum up enough random variables (central limit theorem), the Poisson arrival process shows up whenever you superimpose enough uncorrelated arrival processes together: https://en.wikipedia.org/wiki/Palm%E2%80%93Khintchine_theore...

2. They make the math tractable -- you can come up with closed-form solutions for e.g. the probability distribution of the number of users in the system, the average waiting time, average number of users queuing, etc: https://en.wikipedia.org/wiki/M/M/c_queue#Stationary_analysi... https://en.wikipedia.org/wiki/Erlang_(unit)#Erlang_B_formula

emmelaich•9mo ago
Useful for understanding load on machines. One case I had was -- N machines randomly updating a central database. The database can only handle M queries in one second. What's the chance of exceeding M?

Also related to the Birthday Problem and hash bucket hits. Though with those you're only interested in low collisions. With some queues (e.g. database above) you might be interested when collisions hit a high number.

PessimalDecimal•9mo ago
There is another extremely important way in which they are like the normal distribution: both are maximum entropy distributions, i.e. each is the "most generic" within their respective families of distributions.

[1] https://en.wikipedia.org/wiki/Poisson_distribution#Maximum_e...

[2] https://en.wikipedia.org/wiki/Normal_distribution#Maximum_en...

srean•9mo ago
So is Gamma, Binomial, Bernoulli, negative-Binomial, exponential and many many more. Maxent distribution types are very common. In fact the entire family of distributions in the exponential family are Maxent distributions.
DAGdug•9mo ago
What’s special about this treatment? It’s the 101 part of a 101 probability course.
quirino•9mo ago
I really like the Poisson Distribution. A very interesting question I've come across once is:

A given event happens at a rate of every 10 minutes on average. We can see that:

- The expected length of the interval between events is 10 minutes.

- At a random moment in time the expected wait until the next event is 10 minutes.

- At the same moment, the expected time passed since the last event is also 10 minutes.

But then we would expect the interval between two consecutive events to be 10+10 = 20 minutes long. But we know intervals are 10 on average. What happened here?

The key is that by picking a random moment in time, you're more likely to fall into a bigger intervals. By sampling a random point in time the average interval you fall into really is 20 minutes long, but by sampling a random interval it is 10.

Apparently this is called the Waiting Time Paradox.

fc417fc802•9mo ago
> What happened here?

You went astray when you declared the expected wait and expected passed.

Draw a number line. Mark it at intervals of 10. Uniformly randomly select a point on that line. The expected average wait and passed (ie forward and reverse directions) are both 5, not 10. The range is 0 to 10.

When you randomize the event occurrences but maintain the interval as an average you change the range maximum and the overall distribution across the range but not the expected average values.

pfedak•9mo ago
If it wasn't clear, their statements are all true when the events follow a poisson distribution/have exponentially distributed waiting times.
yorwba•9mo ago
When you randomize the event occurences, you create intervals that are shorter and longer than average, so that a random point is more likely to be in a longer interval, so that the expected length of the interval containing a random point is greater than the expected length of a random interval.

To see this, consider just two intervals of length x and 2-x, i.e. 1 on average. A random point is in the first interval x/2 of the time and in the second one the other 1-x/2 of the time, so the expected length of the interval containing a random point is x/2 * x + (1-x/2) * (2-x) = x² - 2x + 2, which is 1 for x = 1 but larger everywhere else, reaching 2 for x = 0 or 2.

fc417fc802•9mo ago
I think I understand my mistake. As the variance of the intervals widens the average event interval remains the same but the expected average distances for a sample point change. (For some reason I thought that average distances wouldn't change. I'm not sure why.)

Your example illustrates it nicely. A more intuitive way of illustrating the math might be to suppose 1 event per 10 minutes but they always happen in pairs simultaneously (20 minute gap), or in triplets simultaneously (30 minute gap), or etc.

So effectively the earlier example that I replied to is the birthday paradox, with N people, sampling a day at random, and asking how far from a birthday you expect to be on either side.

If that counts as a paradox then so does the number of upvotes my reply received.

jwarden•9mo ago
The way, I understand it is that with a Poisson process, at every small moment in time there’s a small chance of the event happening. This leads to on average lambda events occurring during every (larger) unit of time.

But this process has no “memory” so no matter how much time has passed since the last event, the number of events expected during the next unit of time is still lambda.

me3meme•9mo ago
From last event to this event = 10, from this event to next event = 10, so the time between the first and the third event is 20, where is the surprise in the Waiting Time Paradox?, sure I must be missing some key ingredient here.
quirino•9mo ago
The random moment we picked in time is not necessarily an event. The expected time between the event to your left and the one to your right (they're consecutive) is 20 minutes.
me3meme•9mo ago
I think we must use conditional probability, that is the integral of p(X|A)P(A), for example probability the prior event was 5 minutes ago probabity(the next one is 10 minutes from the previous one (that is 1/2). This is like markov chain, probability of next state depends of current state.
hammock•9mo ago
Poisson, Pareto/power/zipf and normal distributions are really important. The top 3 for me. (What am I missing?) And often misused (most often normal). It’s really good to know which to use when
klysm•9mo ago
Normal is overused for sometimes sensible reasons though. The CLT is really handy when you have to consider sums
FilosofumRex•9mo ago
It's surprising that so few people bother to use non-parametric probability distributions. With today's computational resources, there is no need for parametric closed form models (may be with the exception of Normal for historical reasons), each dataset contains its own distribution.
klysm•9mo ago
It’s easier to do MCMC when the distributions at hand have nice analytic properties so you can take derivatives etc. You should also have a very good understanding of the standards distributions and how they all relate to each other
hyperbovine•9mo ago
How hard is it to estimate that distribution for modern high dimensional data?
jwarden•9mo ago
> What am I missing?

Beta

hammock•9mo ago
What are the common understandable use cases for beta distribution, in everyday life?
jwarden•9mo ago
I don’t use probability distributions in everyday life ;)

But it is the right distribution to represent uncertainty about the probability of binary events (eg a website user clicking some button). For example, if I have absolutely no idea the probability then I use the uniform distribution, Beta(1,1), which is the maximum entropy distribution. Then if I observe one user and they happen to click, I have Beta(2,1), and at a glance I known the mean of that (2/3) which is a useful point estimate.

klysm•9mo ago
Proportions of things frequently follow beta distributions. I think of it as the normal distribution of the domain 0 to 1.
cwmoore•9mo ago
Lightbulbs burn out, but when?
klysm•9mo ago
Later
digger495•9mo ago
Steve, le
joe_the_user•9mo ago
I can understand a message that javascript needs to be enabled for your ** site.

But permanently redirecting so I can't see this after I enable javascript is just uncool and might not endear one on site like hn where lots of folks disable js initially.

Edit: and anonymizing, disabling and reloading... It's just text with formatted math. Sooo many other solutions to this, jeesh guys.

_0ffh•9mo ago
It's notion, I don't know why people use this service.
Zecc•9mo ago
It breaks scrolling with the arrow keys or PgDn/PgUp as well.
Rant423•9mo ago
An application of the Poisson distribution (1946)

https://garcialab.berkeley.edu/courses/papers/Clarke1946.pdf

tatrajim•9mo ago
Famously used by Thomas Pynchon in Gravity's Rainbow. The notion of obtaining a distribution of random rocket attacks blew my young mind and prompted a life-long interest in the sturdy of statistics.
mmorse1217•9mo ago
This site is pretty helpful for me with this sort of thing. The style is more technical though.

https://www.acsu.buffalo.edu/~adamcunn/probability/probabili...

laichzeit0•9mo ago
But this just gives the definition of the distribution. No intuition about where it might have come from, it just appears magically out of thin air and shows some properties it has in the limit.
firesteelrain•9mo ago
At work we use Arena to model various systems and Poisson is our go to.

The Book of PF, 4th edition

https://nostarch.com/book-of-pf-4th-edition
42•0x54MUR41•1h ago•4 comments

Sometimes Your Job Is to Stay the Hell Out of the Way

https://randsinrepose.com/archives/sometimes-your-job-is-to-stay-the-hell-out-of-the-way/
44•ohjeez•4d ago•32 comments

Mobile carriers can get your GPS location

https://an.dywa.ng/carrier-gnss.html
658•cbeuw•16h ago•400 comments

The history of C# and TypeScript with Anders Hejlsberg | GitHub

https://www.youtube.com/watch?v=uMqx8NNT4xY
59•doppp•4d ago•23 comments

List animals until failure

https://rose.systems/animalist/
152•l1n•8h ago•81 comments

In praise of –dry-run

https://henrikwarne.com/2026/01/31/in-praise-of-dry-run/
164•ingve•13h ago•94 comments

Cells use 'bioelectricity' to coordinate and make group decisions

https://www.quantamagazine.org/cells-use-bioelectricity-to-coordinate-and-make-group-decisions-20...
61•marojejian•9h ago•19 comments

pg_tracing: Distributed Tracing for PostgreSQL

https://github.com/DataDog/pg_tracing
51•tanelpoder•3d ago•9 comments

Generative AI and Wikipedia editing: What we learned in 2025

https://wikiedu.org/blog/2026/01/29/generative-ai-and-wikipedia-editing-what-we-learned-in-2025/
146•ColinWright•12h ago•59 comments

Opentrees.org (2024)

https://opentrees.org/#pos=1/-37.8/145
77•surprisetalk•4d ago•7 comments

Drawings of the elements of CMS detector, in the style of Leonardo da Vinci

https://cds.cern.ch/record/1157741/
21•nill0•3d ago•1 comments

Outsourcing thinking

https://erikjohannes.no/posts/20260130-outsourcing-thinking/index.html
144•todsacerdoti•12h ago•125 comments

Scientist who helped eradicate smallpox dies at age 89

https://www.scientificamerican.com/article/smallpox-eradication-champion-william-foege-dies-at-89/
216•CrossVR•3d ago•56 comments

Coffee as a staining agent substitute in electron microscopy

https://phys.org/news/2026-01-coffee-agent-substitute-electron-microscopy.html
10•PaulHoule•2d ago•1 comments

EV-1 for Lease (1996)

https://www.loe.org/shows/shows.html?programID=96-P13-00047#feature4
27•1970-01-01•2d ago•6 comments

Data Processing Benchmark Featuring Rust, Go, Swift, Zig, Julia etc.

https://github.com/zupat/related_post_gen
96•behnamoh•12h ago•50 comments

Show HN: Moltbook – A social network for moltbots (clawdbots) to hang out

https://www.moltbook.com/
209•schlichtm•3d ago•826 comments

Sparse File LRU Cache

http://ternarysearch.blogspot.com/2026/01/sparse-file-lru-cache.html
28•paladin314159•8h ago•3 comments

Nvidia's 10-year effort to make the Shield TV the most updated Android device

https://arstechnica.com/gadgets/2026/01/inside-nvidias-10-year-effort-to-make-the-shield-tv-the-m...
151•qmr•18h ago•134 comments

Finland looks to introduce Australia-style ban on social media

https://yle.fi/a/74-20207494
613•Teever•16h ago•430 comments

Nintendo DS code editor and scriptable game engine

https://crl.io/ds-game-engine/
127•Antibabelic•15h ago•32 comments

Show HN: Minimal – Open-Source Community driven Hardened Container Images

https://github.com/rtvkiz/minimal
87•ritvikarya98•13h ago•26 comments

Apple Platform Security (Jan 2026) [pdf]

https://help.apple.com/pdf/security/en_US/apple-platform-security-guide.pdf
169•pieterr•17h ago•122 comments

Demystifying ARM SME to Optimize General Matrix Multiplications

https://arxiv.org/abs/2512.21473
71•matt_d•13h ago•16 comments

Nonograms: a practical guide with interactive examples

https://lab174.com/blog/202601-nonograms/
35•merelysounds•4d ago•9 comments

The Saddest Moment (2013) [pdf]

https://www.usenix.org/system/files/login-logout_1305_mickens.pdf
114•tosh•13h ago•23 comments

Swift is a more convenient Rust (2023)

https://nmn.sh/blog/2023-10-02-swift-is-the-more-convenient-rust
282•behnamoh•11h ago•271 comments

CPython Internals Explained

https://github.com/zpoint/CPython-Internals
197•yufiz•4d ago•46 comments

CollectWise (YC F24) Is Hiring

https://www.ycombinator.com/companies/collectwise/jobs/ZunnO6k-ai-agent-engineer
1•OBrien_1107•12h ago

Noctia: A sleek and minimal desktop shell thoughtfully crafted for Wayland

https://github.com/noctalia-dev/noctalia-shell
68•doener•13h ago•32 comments