“Are you the one?” is free money

https://blog.owenlacey.dev/posts/are-you-the-one-is-free-money/

481•samwho•1mo ago

Comments

gus_massa•2mo ago

> Score Probability

> 0 0.3679

> 1 0.3679

> 2 0.1839

> 3 0.0613

> 4 0.0153

> 5 0.0031

For 0, it's a well known [1] result 1/e, I remember a puzzle where people left their hat and then pick one randomly.

Looking at the table it looks like the general formula is 1/(e*n!) that is a Possion distribution. Compare with https://en.wikipedia.org/wiki/Poisson_distribution#Examples_...

Anyway, I'm not sure if my observation helps too much to solve the original problem.

[1] at least I know it :)

gaogao•1mo ago

> Season 8: In this season, they introduced gender fluidity. Whilst an interesting problem on its own, this would have wreaked havoc on my model.

Well I guess free money except for that one. In that one, one of the contestants, Danny, did the math for optimizing their remaining Truth Booths and Match Ups to get it down to a 50/50 shot.

yieldcrv•1mo ago

how much free money are we talking here? what are they awarded?

cgriswald•1mo ago

> If the group can correctly guess all the perfect matches, they win a cash prize of $1M.

mnw21cam•1mo ago

Is that each, or divided between 20 people?

jdkn•1mo ago

Highly enjoyable, high effort blogging

owenlacey•1mo ago

Made my day - thanks

Medea•1mo ago

They have an example that calculates the expected information gained by truth booths and all of the top ones are giving more than one bit. How can this be? It is a yes/no question a max of 1 bit should be possible

latortuga•1mo ago

Because when it's true, you also learn about any prior match ups involving those two people.

MarkusQ•1mo ago

That's not how information works. Learning more from one outcome than the other decreases the probability of that outcome occurring, so the expected information (which is the sum of the outcome probability times the outcome information for each of the two possible outcomes) is always less than or equal to one.

If all you can get is a "true" or "false" you expect, at most, one bit of information.

jncfhnb•1mo ago

I’m not really following. But if you’re told that one of A, B, or C is true; you learn more by being told A is True than if you learn D is True, no?

hatthew•1mo ago

Yes, you learn more than 1 bit in that case. However, if you are told A is false, you still don't know whether B or C is true, so you gain less than 1 bit. Assuming A, B and C all have equal probability, your average/expected information gain is <1 bit.

If you ask the question "which of A, B, or C is true?" then you're not asking a yes/no question, and it's not surprising that you expect to gain more than 1 bit of information.

jncfhnb•1mo ago

but that’s all consistent. “Expected” gain is less than 1 for the truth booths and sometimes > 1 for actuals; and is > 1 on expected value of the match ups, which aren’t binary questions.

hatthew•1mo ago

Sure, and the issue is that the article says "Suppose we have calculated the expected information gained by potential truth booths like below:" and then lists some values >1

edit: just saw that the article fixed this recently, and the values are now <1

kevindamm•1mo ago

It's not a yes/no per contestent, it's per edge between contestants. There are n(n-1)/2 of these.

A true answer for a potential match is actually a state update for all of the (n-1) edges connecting either contestant, that's 2(n-2) edges that can be updated to be false. Some of these may already be known from previous rounds' matchups but that's still more than a single binary.

hatthew•1mo ago

An answer of "yes" will generally eliminate many edges, with potential for >1 bit. However, an answer of "no" will generally eliminate just that one edge, which is generally <1 bit.

MarkusQ•1mo ago

But you don't receive more than a single binary value; you get a yes or no.

If both of these are equally likely, you gain one bit of information, the maximum possible amount. If you already have other information about the situation, you might gain _less_ than one bit on average (because it confirms something you already knew, but doesn't provide any new information), but you can't gain more.

twoodfin•1mo ago

If I’m trying to guess a 9-letter English word, and test whether the first letter is “x”, there are only the same two answers: Yes/No.

But “Yes” obviously gives me much more than one bit of the information I need to know the answer.

Dylan16807•1mo ago

But that "yes" is so unlikely that your expected/average information is still 1 bit or less.

twoodfin•1mo ago

The claim was that one bit was the maximum amount of information you could gain, which is clearly false.

Just to make this unambiguous: If you ask me to guess a number between one and one billion, and by fantastic luck I guess right, your “yes/no” answer obviously gives me more than one bit of information as to the right answer.

Dylan16807•1mo ago

> The claim was that one bit was the maximum amount of information you could gain, which is clearly false.

That's not what I see.

https://news.ycombinator.com/item?id=46282007 They have an example that calculates the expected information gained by truth booths and all of the top ones are giving more than one bit. How can this be? It is a yes/no question a max of 1 bit should be possible

https://news.ycombinator.com/item?id=46282343 the expected information (which is the sum of the outcome probability times the outcome information for each of the two possible outcomes) is always less than or equal to one.

The specific comment you replied to had one sentence that didn't say "expected" or "average", but the surrounding sentences and comments give context. The part you objected to was also trying to talk about averages, which makes it not false.

twoodfin•1mo ago

Can’t gain more!

The core confusion is this idea that the answer to a yes/no question can’t provide more than one bit of information, no matter what the question or answer. This is false. The question itself can encode multiple bits of potential information and the answer simply verifies them.

Dylan16807•1mo ago

> Can’t gain more!

"you might gain _less_ than one bit on average [...], but you can't gain more."

On. Average.

That's a true statement. Can't gain more than one bit on average.

twoodfin•1mo ago

I’m not arguing with that, it’s basic information theory.

One bit, however, is not “the maximum possible amount” you can gain from an oracular answer to a yes/no question. The OP covers exactly this point re: the “Guess Who?” game.

Dylan16807•1mo ago

The start of this comment thread was a complaint that OP is showing more than one bit expected for certain yes/no answers. Not best case, expected.

That's why people are talking about the maximum expected value.

sebastos•1mo ago

Right - but coming back to the original question, if I'm not mistaken, the explanation is that the blogpost is measuring information gained from an actual outcome, as opposed to _expected_ information gain. An example will help:

Say you're trying to guess the number on a 6-sided die that I've rolled. If I wanted to outright tell you the answer, that would be 2.58 bits of information I need to convey. But you're trying to guess it without me telling, so suppose you can ask a yes or no question about the outcome. The maximum of the _expected_ information add is 1 bit. If you ask "was it 4 or greater?", then that is an optimal question, because the expected information gain is min-maxed. That is, the minimum information you can gain is also the maximum: 1 bit. However, suppose you ask "was it a 5?". This is a bad question, because if the answer is no, there are still 5 numbers it could be. Plus, the likelihood of it being 'no' is high: 5/6. However, despite these downsides, it is true that 1/6 times, the answer WILL be yes, and you will gain all 2.58 bits of information in one go. The downside case more than counteracts this and preserves the rules of information theory: the _expected_ information gain is still < 1 bit.

EDIT: D'oh, nevermind. Re-reading the post, it's definitely talking about >1 bit expectations of potential matchings. So I don't know!

stevage•1mo ago

You also learn about other pairings now being impossible.

mnw21cam•1mo ago

No, that doesn't make sense either. For a truth booth, you're taking all the possible pairing arrangements, and dividing them into two sets. After the answer, one of those two sets is false. There is no way that this can provide more than 1 bit of information.

The match-ups can however give more information, as it isn't giving a yes/no answer.

tobyjsullivan•1mo ago

The author defines one “bit” as ruling out half the remaining options.

So a yes might rule out 75% of remaining options (for example) which provides 2 bits of information.

hatthew•1mo ago

We have to make a distinction between "expected information gain" vs "maximum information gain". An answer of "yes" generally gains >1 bit, but an answer of "no" generally gains <1 bit, and the average outcome ends up <1. It is impossible for a single yes/no question to have an expected gain of >1; the maximum possible is precisely 1.

tobyjsullivan•1mo ago

The total probabilities add up to 1. But I’m not following how that relates to the average bits.

Despite summing to 1, the exact values of P(true) and P(false) are dependent on the options which have previously been discounted. Then those variables get multiplied by the amount of information gained by either answer.

hatthew•1mo ago

The article states "Suppose we have calculated the expected information gained by potential truth booths like below: Expected information: 1.60 bits ..." This is impossible because of the general fact in information theory that (p(true) * bits_if_true) + (p(false) * bits_if_false) <= 1. If they had said "Suppose we have calculated the maximum information gained...", then 1.6 bits would be valid. They said "expected information" though, so 1.6 bits is invalid.

adastra22•1mo ago

It is definitional, which I mean in the strictest mathematical sense: the information content of a result is directly derived from how “unexpected” it is.

A result which conveys 2 bits of information should occur with 25% expected probability. Because that’s what we mean by “two bits of information.”

thaumasiotes•1mo ago

So, you have n options, you ask a question, and now you're down to m options.

The number of bits of information you gained is -log₂ (m/n).

If you ask a question which always eliminates half of the options, you will always gain -log₂ (1/2) = 1 bit of information.

If you go with the dumber approach of taking a moonshot, you can potentially gain more than that, but in expectation you'll gain less.

If your question is a 25-75 split, you have a 25% chance of gaining -log₂ (1/4) = 2 bits, and a 75% chance of gaining -log₂ (3/4) = 0.415 bits. On average, this strategy will gain you (0.25)(2) + (0.75)(0.415) = 0.8113 bits, which is less than 1 bit.

The farther away you get from 50-50, the more bits you can potentially gain in a single question, but - also - the lower the number of bits you expect to gain becomes. You can never do better than an expectation of 1 bit for a trial with 2 outcomes.

(All of this is in the article; see footnote 3 and its associated paragraph.)

The article explicitly calls out the expectational maximum of one bit:

>> You'll also notice that you're never presented with a question that gives you more than 1 expected information, which is backed up by the above graph never going higher than 1.

So it's strange that it then goes on to list an example of a hypothetical (undescribed, since the scenario is impossible) yes/no question with an expected gain of 1.6 bits.

owenlacey•1mo ago

Great spot! The max expected information is 1. I've updated this part of the post to only show examples that are < 1, thank you for raising!

akoboldfrying•1mo ago

Do you mean the diagram following the sentence "Suppose we have calculated the expected information gained by potential truth booths like below:"?

Yes, that looks like a mistake -- a truth booth only has 2 outcomes, so it can produce at most 1 bit of information.

Regarding the other mentions on the page of information levels exceeding 1 bit: Those are OK, since they allow match-ups, which for 6 people have 7 possible outcomes, thus can yield up to log2(7) ≈ 2.81 bits.

jellevdv•1mo ago

what a nice interactive blogpost, that's amazing all the effort that went into it

owenlacey•1mo ago

thanks so much - glad you enjoyed it <3

verteu•1mo ago

Fun post. I'd be interested to know: How many consecutive Truth Booths (or: how many consecutive Match Ups) are needed to narrow down the 10! possibilities to a single one?

Discussing "events" (ie, Truth Booth or Match Up) together muddles the analysis a bit.

I agree with Medea above that a Truth Booth should give at most 1 bit of information.

jncfhnb•1mo ago

If you can only check pairings one at a time I’m not sure it’s possible to do better than greedily solving one person at a time.

mnw21cam•1mo ago

Agreed. There's an argument elsewhere about how a truth booth can possibly have an expected return of more than 1 bit of information, but in reality most of the time it's going to give you way less than that.

vitus•1mo ago

So, for 10 pairs, 45 guesses (9 + 8 + 7 + 6 + 5 + 4 + 3 + 2 + 1) in the worst case, and roughly half that on average?

It's interesting how close 22.5 is to the 21.8 bits of entropy for 10!, and that has me wondering how often you would win if you followed this strategy with 18 truth booths followed by one match up (to maintain the same total number of queries).

Simulation suggests about 24% chance of winning with that strategy, with 100k samples. (I simplified each run to "shuffle [0..n), find index of 0".)

owenlacey•1mo ago

Based on my research, MUs perform better than TBs. For my simulated information theories, the MUs gained ~2 bits of information on average vs ~1.1 for TBs.

So if only MUs, we're talking around 10 events - meaning you could get enough information on MUs alone to win the game! Conversely, it would take about 20 events to do this just for TBs.

It's not super obvious from the graphs, but you can just about notice that the purple dots drop a bit lower than the pink ones!

Hope this helps

wonger_•1mo ago

> I also pitched this idea to The Pudding, and had a great experience with them nerding out about this subject. Though they didn't take my up on my idea, I left with really great and actionable feedback, and I'm looking forward to my next rejection.

Would've been a great Pudding post imo, but oh well, happy to find this nice devblog instead.

travisjungroth•1mo ago

This brings up an area that’s been on the edge of my curiosity for years: how do you combine the knowledge of the experts (contestants) with logic to do better either than either strategy individually?

It’s mostly about how to elicit the information from the contestants. Once you have the probabilities from them, it seems relatively straightforward.

Yossarrian22•1mo ago

I think you could do some form of Bayesian analysis with the prior being how likely each contestant thought that their available partners were "The One" for each other. Then the truth booth would be used to update your priors.

jncfhnb•1mo ago

I saw an episode of this and felt the contestants didn’t seem that interested in winning the money. Just romance. I was curious how suboptimally they tended to play.

codebje•1mo ago

Everything is lined up for sub-optimal play.

For a start, the setting is an emotive one. It's not just a numeric game with arbitrary tokens, it's about "the perfect romantic partner." It would take an unusually self-isolating human to not identify who they feel their perfect match should be and bias towards that, subconsciously or consciously. We (nearly) all seek connection.

Then, it's reality TV. Contestants will be chosen for emotional volatility, and relentlessly manipulated to generate drama. No-one is going to watch a full season of a handful of math nerds take a few minutes to progress a worksheet towards a solution each week coupled with whatever they do to pass the time otherwise.

pants2•1mo ago

I'd watch a game show where you put a variety of math nerds on each team and watch them argue about the optimal strategy. Who's strategy will win? The quant analyst or the bioinformatician? Tune in next week!

bryanhogan•1mo ago

Reminds me a bit of Devils Plan, or other similar reality game shows in Korea / Asia.

empathy_m•1mo ago

Watched a fantastic film about this on the plane a few years ago, "Liar Game - Reborn". There is some fairly sophisticated logic puzzling and scheming going on (see e.g. sample illustration https://imgur.com/a/0AOb67G from an interlude about 50min in where there are 3 groups of people who mutually distrust each other, each know a secret collection of 3-4 integers unique to their group, and want to deniably pass share integers with each other which are "not my team's". Another participant watches what happened and realizes in retrospect this is how the info was shared.)

A lot more upbeat than "Alice in Borderland".

jncfhnb•1mo ago

I saw a couple devils plan and felt they were trying to imply Mathematical scheming without substance. There was one guy who kept talking about “science” but then his “game theory” for a game of mafia was “social distancing” like Covid…

nrhrjrjrjtntbt•1mo ago

Need to find out their psycopath screening technique

yaur•1mo ago

Just Ask them to describe Shannon Entropy. If they start talking about information they are out, if they start talking about their crazy cousin they are in.

pityJuke•1mo ago

Charlie Brooker did a good bit (in 2007!) about picking the right candidates for a mock reality TV show: https://youtube.com/watch?v=NGkJxju3uKo

bronco21016•1mo ago

We need a ManningCast version of the show. For those unaware, ManningCast is a show following an NFL game with special guests and nontraditional commentary and analysis. Think of it kind of like having the Mannings in the living room while watching an NFL game.

In my hypothetical version of "Are you the one?", the math nerds would be giving commentary and explaining the math behind how they'll solve "Are you the one?" while also hilariously explaining how foolish the contestants' theories are.

vintermann•1mo ago

> No-one is going to watch a full season of a handful of math nerds take a few minutes to progress a worksheet towards a solution each week coupled with whatever they do to pass the time otherwise.

Um, what about those of us who watch Blood on the Clocktower streams?

codebje•1mo ago

Both of you are the exceptions that prove the rule?

Yossarrian22•1mo ago

That's because the real game is occurring both before and after the show in modern reality tv competitions. The goal is to be entertaining and get social media followers and potential invites to further reality tv shows.

stevage•1mo ago

This was great, but it skipped over the most interesting bit - how you actually choose which matchups and truth booths. That is, an actual strategy that contestants could use that doesn't require a computer.

owenlacey•1mo ago

Thank you! This is consistent with feedback I got from the pudding, and is ultimately the reason they didn't go ahead with the post. I tried reverse-engineering the information-theory approach to try see what sort of decisions it made.

I noticed that for any match up score of X, the following match up would keep exactly X pairs in common. So if they scored 4/10 one week, they would change 6 couples before the next one. Employing that approach alone performed worse than the contestants did in real life, so didn't think it was worth mentioning!

vitus•1mo ago

It should be easier to understand the optimal truth booth strategy. Since this is a yes/no type of question, the maximum entropy is 1 bit, as noted by yourself and others. As such, you want to pick a pair where the odds are as close to 50/50 as possible.

> Employing that approach alone performed worse than the contestants did in real life, so didn't think it was worth mentioning!

Yeah, this alone should not be sufficient. At the extreme of getting a score of 0, you also need the constraint that you're not repeating known-bad pairs. The same applies for pairs ruled out (or in!) from truth booths.

Further, if your score goes down, you need to use that as a signal that one (or more) of the pairs you swapped out was actually correct, and you need to cycle those back in.

I don't know what a human approximation of the entropy-minimization approach looks like in full. Good luck!

CmdDot•1mo ago

«As such, you want to pick a pair where the odds are as close to 50/50 as possible.»

This is incorrect, the correct strategy is mostly to check the most probable match (the exception being if the people in that match has less possible pairings remaining than the next most probable match).

The value of confirming a match, and thus eliminate all other pairings involving those two from the search space, is much higher than a 50/50 chance of getting a no match and only excluding that single pairing.

vitus•1mo ago

> This is incorrect, the correct strategy is mostly to check the most probable match (the exception being if the people in that match has less possible pairings remaining than the next most probable match).

Do you have any hard evidence, or just basing this on vibes? Because your proposed strategy is emphatically not how you maximize information gain.

Scaling up the problem to larger sizes, is it worth explicitly spending an action to confirm a match that has 99% probability? Is it worth it to (most likely) eliminate 1% of the space of outcomes (by probability)? Or would you rather halve your space?

This isn't purely hypothetical, either. The match-ups skew your probabilities such that your individual outcomes cease to be equally probable, so just looking at raw cardinalities is insufficient.

If you have a single match out of 10 pairings, and you've ruled out 8 of them directly, then if you target one of the two remaining pairs, you nominally have a 50/50 chance of getting a match (or no match!).

Meanwhile, you could have another match-up where you got 6 out of 10 pairings, and you've ruled out 2 of them (thus you have 8 remaining pairs to check, 6 of which are definitely matches). Do you spend your truth booth on the 50/50 shot (which actually will always reveal a match), or the 75/25 shot?

(I can construct examples where you have a 50/50 shot but without the guarantee on whether you reveal a match. Your information gain will still be the same.)

CmdDot•1mo ago

In addition to Mastermind, Wordle also falls into the same category.

Optimal play to reduce the search space in both follow the same general pattern - the next check should satisfy all previous feedback, and included entries should be the most probable ones, both of those previously tested, and those not. If entries are equally probable, include the one which eliminates the largest number of remaining possibilities if it is correct.

For wordle, «most probable» is mostly determined by letter frequency - while in Mastermind, it’s pure probability based on previous guesses. For instance, if you play a Mastermind variant with 8 pegs, and get a 2/8 in the first test - each of your 8 pegs had a 1/4 chance of being correct. So you select 2 at random to include in the next guess.

If you then get a 2/8 from the second - you would include 4 previous entries in the next guess, 2 entries from the first that was not used in the second, as well as 2 entries from the 2nd - because the chance you chose the correct entries twice, is less than the chance the two hits are from the 6 you changed.

codeflo•1mo ago

> For wordle, «most probable» is mostly determined by letter frequency

I don't think that's a justified assumption. I wouldn't be surprised if wordle puzzles intentionally don't follow common letter frequency to be more interesting to guess. That's certainly true for people casually playing hangman.

CmdDot•1mo ago

When it comes to quickly reducing the search space of possible words, it is - that’s how you solve it optimally, even if (or in fact, especially) if the word they chose intentionally does not use the most frequent letters.

The faster you can discard all words containing «e» because of a negative match, the better.

If you want to be really optimal, you’ll use their list of possible words to calculate the actual positional frequencies and pick the highest closest match based on this - that’s what «mostly» was meant to imply, but the general principle of how to reduce the search space quickly is the same

IAmBroom•1mo ago

I would guess Wordle picks from a big bag'o'words. The words are all fairly common - "regel" is not going to show up - but I see no evidence the list favors "zebra" over "taint" (which has occurred, BTW).

andrewaylett•1mo ago

The original Wordle had a hard-coded ordering that was visible in the source. I had a toy around with the list (as did many other people) a few years back, you can see my copy of the word list here: https://github.com/andrewaylett/wordle/blob/main/src/words.r...

Bratmon•1mo ago

It's not an assumption- it's a factual statement about how wordle works

Majromax•1mo ago

> In addition to Mastermind, Wordle also falls into the same category.

> Optimal play to reduce the search space in both follow the same general pattern - the next check should satisfy all previous feedback, and included entries should be the most probable ones, both of those previously tested, and those not.

The "next check should satisfy all previous feedback" part is not exactly true. That's hard-mode wordle, but hard mode is provably slower to solve than non-hard-mode (https://www.poirrier.ca/notes/wordle-optimal/) where the next guess can be inconsistent with previous feedback.

kruffalon•1mo ago

> Optimal play to reduce the search space in both follow the same general pattern - the next check should satisfy all previous feedback

Thank you! I might look into this once I break my current streak of the localised wordle clone I'm playing now.

I always try to use as many different bits for the first few rounds...

But then again, maybe I'm not so good at these kinds of games as I think.

calfuris•1mo ago

It's not actually optimal. Each check should account for all previous feedback, but it may be optimal to make a known-incorrect guess and trade the chance of winning with that guess for additional information.

For example, if your first guess on wordle is BOUND and you learn that the word is _OUND, you know the answer is one of FOUND, HOUND, MOUND, POUND, ROUND, SOUND, WOUND. Satisfying all previous feedback leaves you checking those one at a time and losing with probability 2/7. Or you could give up the 1-in-7 chance of winning in 2 and trade it for certainly winning in either 3 or 4: HARMS checks four of those options, and WHOOP identifies the remaining three.

akoboldfrying•1mo ago

If the goal is to find the perfect matching in some maximum number of turns or less, it's possible to do even better by using a full game tree that minimises the maximum height of the tree ( = number of turns required), instead of using information/entropy as done here.

Basically, using the entropy produces a game tree that minimises the number of steps needed in expectation -- but that tree could be quite unbalanced, having one or more low-probability leaves (perfect matchings) many turns away from the root. Such a leaf will randomly occur some small fraction of the time, meaning those games will be lost.

For concreteness, a game requiring 6 bits of information to identify the perfect matching will take 6 steps on average, and may sometimes require many more; a minimax tree of height 7 will always be solved in at most 7 steps. So if you're only allowed 7 steps, it's the safer choice.

thaumasiotes•1mo ago

> For concreteness, a game requiring 6 bits of information to identify the perfect matching will take 6 steps on average, and may sometimes require many more

I'm not following your logic. Consider the setup we actually have:

1. You get to ask a series of yes/no questions. (Ignoring the matchups.)

2. Each question can produce, in expectation, up to one bit of information.

3. In order to achieve this maximum expectation, it is mathematically necessary to use questions that invariably do produce exactly one bit of information, never more or less. If your questions do not have this property, they will in expectation produce less than the maximum amount of information, and your expected number of steps to the solution will increase, which contradicts your stated goal of minimizing that quantity.

You get the minimum number of steps needed in expectation by always using questions with maximum entropy. Yes. But those questions never have any variation in the amount of entropy they produce; a maximum entropy strategy can never take more - or fewer - steps than average.¹

¹ Unless the number of bits required to solve the problem is not an integer. Identifying one of three options requires 1.585 bits; in practice, this means that you'll get it after one question 1/3 of the time and after two questions the other 2/3 of the time. But identifying one of 128 options requires 7 bits, you'll always get it after 7 questions, and you'll never be able to get it after 6 questions. (Assuming you're using a strategy where the expected number of questions needed is 7.)

dmurray•1mo ago

You're correct but the complexity is in the things you're ignoring for simplicity.

This game is constructed such that the questions you can ask are not arbitrary, so you cannot choose them to always produce one bit of entropy (you need to frame your questions as ten matchups in parallel, using all the contestants exactly once) and the number of bits you need may indeed not be an integer.

Because you can't choose your questions to partition the state space arbitrarily, that affects not just the question you ask today, but also previous days: you want to leave yourself with a partitionable space tomorrow no matter what answers you get today.

In the Guess Who analogy, it's against the rules or at least the spirit to ask "does your character have a name which is alphabetically before Grace?". That would allow a strategy which always divides the state space exactly in two.

thaumasiotes•1mo ago

> and the number of bits you need may indeed not be an integer.

That's true but not relevant; needing a non-integer number of bits makes it possible for some games to end one turn faster than other games, but it doesn't make it possible for a maximum-entropy strategy to have more variation than that one maybe-necessary-maybe-not turn. The scenario described by my parent comment still isn't possible.

> This game is constructed such that the questions you can ask are not arbitrary, so you cannot choose them to always produce one bit of entropy (you need to frame your questions as ten matchups in parallel, using all the contestants exactly once)

You're confusing two types of questions. The question where you're targeting one bit of information is "are Matt and Felicia a match?", just one pair.

A full proposed matchup of n pairs has n+1 possible answers and so your goal is to produce log₂ (n+1) bits. (Better conceptualized as one base-n+1 "bit".) I agree that it's not obvious how to do this or to what extent it's possible.

akoboldfrying•1mo ago

> Unless the number of bits required to solve the problem is not an integer.

That is one case where root-to-leaf path lengths can vary, though it's not obvious to me that it exhausts all such cases -- in particular, even if we have "ideal leaves" (numbering a power of 2, and each equally likely), it's not clear that there is always a question we can ask that divides a given node's leaves exactly in half.

codeflo•1mo ago

That's a good point, entropy is only a heuristic for the thing you actually want to optimize, worst-case guesses (though it's probably a very good heuristic).

> Basically, using the entropy produces a game tree that minimises the number of steps needed in expectation

It might be even worse than that for problems of this kind in general. You're essentially using a greedy strategy: you optimize early information gain.

It's clear that this doesn't optimize the worst-case, but it might not optimize the expected number of steps either.

I don't see why it couldn't be the case that an expected-steps-optimal strategy gains less information early on, and thus produces larger sets of possible solutions, but through some quirk those larger sets are easier to separate later.

daturkel•1mo ago

As a math guy who loves reality tv, I was also drawn to the show and wrote a blog post [0] about how to programmatically calculate the probabilities as the show progresses. It was a lot of fun optimizing it to be performant. You can `pip install ayto` to use it to follow along with the show or try out scenarios.

The linked post is a very thorough treatment of AYTO and a great read. I really like the "guess who" bit on how to maximize the value of guesses. It's a shame the participants aren't allowed to have pen and paper—it makes optimization a lot trickier! I'm impressed they do as well as they do.

[0]: https://danturkel.com/2023/01/25/math-code-are-you-the-one.h...

vasco•1mo ago

And sometimes they just don't do better as a plot point, staying together an extra week after finding out they are not the one because of the intensity of their love (they met 4 days before)

daturkel•1mo ago

Giving them more credit than they probably deserve but: when you're solving "by hand" like they are in the show, keeping a known non-match couple together may actually be helpful for interpreting the results of a matchup ceremony because you'll know that that couple didn't contribute to the beams.

vasco•1mo ago

That's different, they do that also, but sometimes for the plot one couple intentionally mess those plans because the love is just too big.

daturkel•1mo ago

Relevant xkcd: https://xkcd.com/55/

owenlacey•1mo ago

Let's be friends :')

Loved your post, really enjoyed getting into the meat of it. I wanted to position mine to a layman, kept asking myself "can I explain this to my Dad?"

I think where the post falls short is the absence of a silver bullet that contestants can use to win the game sooner.

daturkel•1mo ago

Thanks! Optimization was something I'd played with in previous rounds of coding up AYTO simulations, but not in the most recent version. (See the bottom section of this notebook [0]). There's also a very thorough treatment of the problem in a blog post from 2018 by SAS (the software company) [1]. It's surprising how many people have been drawn in by the allure of AYTO!

[0]: https://github.com/daturkel/pyto/blob/master/AYTO_S8.ipynb [1]: https://blogs.sas.com/content/operations/2018/08/14/are-you-...

ChristopherDrum•1mo ago

I think the part that stings most about this article is when he says, "In my research I came across a boardgame called Mastermind."

My lived childhood is old enough to be someone's "research."

owenlacey•1mo ago

Sorry :')

My wife and I bought the game - it's a great turn based came you can play whilst having trashy reality shows on in the background!

ChristopherDrum•1mo ago

No no, I was just blindsided by my own feelings when I read that and thought it was kind of funny. Judging from the inexplicable downvoting that comment received (?!) apparently that wasn't clear. I always enjoyed Mastermind, though there is a line of attack on the solution that can render it a little boring. It still maintains a bit of a Wordle-like pleasure.

rahimnathwani•1mo ago

After he described the rules, my immediate reaction was 'this is like mastermind'. Sure enough, further down the page:

  Other than that, in my research I came across a boardgame called Mastermind, which has been around since the 70s. This is a very similar premise - think of it as "Guess Who?" on hard mode.

shermantanktop•1mo ago

Mastermind! What a great game. Played it for hours.

https://www.mcsweeneys.net/articles/the-mastermind-box-cover...

karel-3d•1mo ago

A thing to note - the contestants are not allowed to have even pen and paper, as mentioned in the other blogpost. So they need to do these computations in their heads.

komali2•1mo ago

I would love to know how the author went about constructing those lovely charts. Some library, or done by hand?

owenlacey•1mo ago

Hey! It's all vanilla html/js/css - which I'm pretty proud of. I wanted something really minimal, and felt most charting libraries were overkill so wanted to keep my bundle size small. I'm thinking of making the interactive parts open source so people can take a look for themselves if that would be helpful

komali2•1mo ago

Wow, that's super cool! I would love to take a peek at the interactive bits.

AtlasBarfed•1mo ago

Isn't 5 impossible in the 6-6 matching?

owenlacey•1mo ago

correct! I might put a little footnote on this to make that clear, thanks

0xkalle•1mo ago

When my wife and I watched the show I wrote a solver on the side so we always had the current probabilities and impossible combinations on the side.

I am thinking about making a website for it when the next season starts.

Also: in Germany at least they have 10 x 10 candidates from the start, but sometimes they add a 11th or even 12th of one gender so that there are double matches (e.g. 1 woman has two man as match and needs to find one of it to succeed). This raises the possible combination quite a bit.

owenlacey•1mo ago

Would love to see this!

Yes there's a gender fluid season and a season where someone had > 1 match, as well as people leaving part way through the season (apparently perfect matches are interchangeable...). All very interesting spins on the core problem to solve; would be really interested if anyone tries to tackle those seasons.

zvr•1mo ago

Surely you mean 10 + 10 candidates (not 10x10).

owenlacey•1mo ago

What are some other gameshows one could nerd out about? I'll start:

An obvious one is the traitors, but I dunno if there's much you can do with this one as the contestants rarely gain much concrete information.

"Deal or no deal" / "let's make a deal" would have interesting game theory approaches - probably has a lot of parallels with Monty Hall?

Countdown (UK) - solving the maths puzzles on here using integer programming would be cool

ChristopherDrum•1mo ago

"Let's Make a Deal" was hosted by Monty Hall and is literally the Monty Hall problem.

philipwhiuk•1mo ago

> This time, one guy has two matches which means that there will be eleven girls, but only ten boys.

One thing the show runners do subtle alterations that makes the logic much harder.

The Traitors has to do lots of these tricks when not playing the Celebrity edition because there's a self-selection for the sort of person who has already played Werewolf/Avalon-type games.

yoan9224•1mo ago

Love the approach of using information theory to optimize decisions. The concept of "expected information gained" is fascinating - it's basically what good analytics should do: help you ask the right questions to eliminate uncertainty fastest.

The interactive visualizations on this post are fantastic. More technical content should be presented this way. Makes complex probability much more intuitive.

oneeyedpigeon•1mo ago

Love this article, OP, especially those interactive demos. One small suggestion:

> A truth booth is where a male & female ...

The use of "male" and "female" as nouns sounds very unnatural. "A man and a woman" would be a little less jarring, imo.

mwcz•1mo ago

One small boon of AI is that making small interactive web components for demonstrations is now a few-minutes diversion instead of an hour or more. I have no idea if that's what OP did, but I've been happy with generating low-stakes code for blog posts.

Rperry2174•1mo ago

Most hilarious part of this is that if you've ever watched "The Challenge" then you know that these people, truly, often cannot add 3 digit numbers together let alone understand information theory

CollinEMac•1mo ago

> This post is my first foray into content like this. I wanted to scratch the itch of an interesting maths problem, with a light-hearted spin that I hope you enjoyed as much as I did making it.

Really impressive imo. I don't remember the last time I was this engaged reading an article on HN.

SectorC: A C Compiler in 512 bytes

The F Word

Brookhaven Lab's RHIC concludes 25-year run with final collisions

Speed up responses with fast mode

Software factories and the agentic moment

Stories from 25 Years of Software Development

Hoot: Scheme on WebAssembly

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

I write games in C (yes, C)

First Proof

Show HN: A luma dependent chroma compression algorithm (image compression)

The Waymo World Model

Al Lowe on model trains, funny deaths and working with Disney

Vocal Guide – belt sing without killing yourself

Start all of your commands with a comma (2009)

Reinforcement Learning from Human Feedback

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Selection Rather Than Prediction

Coding agents have replaced every framework I used

The AI boom is causing shortages everywhere else

A Fresh Look at IBM 3270 Information Display System

France's homegrown open source online office suite

72M Points of Interest

We mourn our craft

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Where did all the starships go?

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Learning from context is harder than we thought

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

History and Timeline of the Proco Rat Pedal (2021)

SectorC: A C Compiler in 512 bytes

The F Word

Brookhaven Lab's RHIC concludes 25-year run with final collisions

Speed up responses with fast mode

Software factories and the agentic moment

Stories from 25 Years of Software Development

Hoot: Scheme on WebAssembly

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

I write games in C (yes, C)

First Proof

Show HN: A luma dependent chroma compression algorithm (image compression)

The Waymo World Model

Al Lowe on model trains, funny deaths and working with Disney

Vocal Guide – belt sing without killing yourself

Start all of your commands with a comma (2009)

Reinforcement Learning from Human Feedback

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Selection Rather Than Prediction

Coding agents have replaced every framework I used

The AI boom is causing shortages everywhere else

A Fresh Look at IBM 3270 Information Display System

France's homegrown open source online office suite

72M Points of Interest

We mourn our craft

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Where did all the starships go?

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Learning from context is harder than we thought

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

History and Timeline of the Proco Rat Pedal (2021)

“Are you the one?” is free money

Comments