Dyson’s Algorithm: The General Case
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
In a previous post, we looked at Dyson’s algorithm (Dyson 1946) for solving the Coins in Weighings problem :
You have coins, to appearance exactly identical; but possibly one is counterfeit (we’ll call it a “dud”). You do not know if a dud is present, nor whether it is heavier or lighter than the good coins. Can you determine in weighings on a balance scale: (1) whether there is a dud, (2) if so, which coin, and (3) its relative weight (lighter or heavier than the good coins)?
The most famous version of this problem is to resolve 12 coins in 3 weighings; this is an instance of the special case where , which is the maximum number of coins that can be resolved in weighings. The version of the algorithm we showed in the previous post solved exactly this case. In this post, we will adjust that algorithm to the more general case where .
Sketch of the Special Case Algorithm
You probably want to review the previous post before moving on to this one, but I’ll quickly sketch the special-case algorithm here.
- First, number the coins from
1:M
. Then assign each coin a label that corresponds to either the signed trinary representation of its number, or the negation of that number, depending on which representation rotates “forward” (defined to mean that the first change of digits increases by 1, modulo 3). - Next, determine a “weighing schedule”: which coin goes where for each weighing. The position of each coin on the th weighing is determined by its th digit (starting from the left): means the left pan, means the right pan, and means the table.
- Similarly, we record the outcome of each weighing by which pan is heavier (if any): means the left pan is heavier, means the right pan is heavier, means the scale is balanced.
As we showed in the last post, if the th coin is a heavy dud, it will spell out its own label in signed trinary, as . A light dud will spell out the negation of the label.
You can recover the decimal representation of the final outcome, as
Then the location of the dud is , and the dud is heavy if rotates forward, and it is light if rotates backward. If there is no dud, then .
The General Case
The general case algorithm works basically the same way, but now the coin labels do not correspond directly to the coin numbers (which are still 1:M
). Instead, they are assigned from groups of “weighing triplets,” or cyclic groups, in a way that keeps the scale counts equal on the first round.
We can then use the same weighing schedule as in the case, with a little extra bookkeeping to keep the scale counts equal on every round. As before, the scale will spell out the label of the dud coin if the dud is heavy, or the label’s negation, if the dud is light.
To make this concrete, we’ll use the example of weighings, but now we want to resolve fewer than 12 coins. You can find the code I’m calling in this example here.
Let’s look again at the forward representations for 12 coins.
Show the code
source("dyson_signed_general.R") library(poorman) # dplyr will work here, too nrounds = 3 Fmat = get_forward_representation(nrounds) Fmat
[,1] [,2] [,3] [1,] 0 0 1 [2,] 0 1 -1 [3,] 0 1 0 [4,] 0 1 1 [5,] 1 -1 -1 [6,] 1 -1 0 [7,] 1 -1 1 [8,] -1 0 1 [9,] -1 0 0 [10,] -1 0 -1 [11,] 1 1 -1 [12,] -1 -1 0
In the above, the row numbers of the matrix are the coin numbers. When we are resolving 12 coins, the forward representations of the coin numbers give the labels for each coin.
1. Create “weighing triplets”.
When we want to resolve fewer than 12 coins, the coin number and the forward representations no longer directly correspond. We have to assign the labels as follows.
We’ll start with the label [0 0 1]
, and increase each digit by 1 modulo 3, to get a new forward label, [1 1 -1]
. We’ll call this a cyclic shift. Then we’ll take the new label, and shift it again to make a triplet.
coin 1: label [0 0 1] (1) coin 2: label [1 1 -1] (11) coin 3: label [-1 -1 0] (12) # -12, actually, but this is the forward representation
This is the first group (or triplet).
Now get the next unassigned label ([0 1 -1] = 2
), and make another triplet, and so on, until all the labels are assigned to a group. This results in groups. For this example, that’s 4 groups. You can think of each group as a “weighing triplet” because every weighing of those three coins together has one coin on the left, one on the right, and one on the table, every round.
Here are all the possible coin labels, their groups, and the digits of the forward representations.
Show the code
gps = create_cyclic_groups(nrounds) M_max = (3^nrounds - 3) / 2 gpmap = data.frame(group = gps, coin_label = 1:M_max) gpmap = cbind(gpmap, data.frame(Fmat)) arrange(gpmap, group, coin_label) |> knitr::kable()
group | coin_label | X1 | X2 | X3 |
---|---|---|---|---|
1 | 1 | 0 | 0 | 1 |
1 | 11 | 1 | 1 | -1 |
1 | 12 | -1 | -1 | 0 |
2 | 2 | 0 | 1 | -1 |
2 | 6 | 1 | -1 | 0 |
2 | 8 | -1 | 0 | 1 |
3 | 3 | 0 | 1 | 0 |
3 | 7 | 1 | -1 | 1 |
3 | 10 | -1 | 0 | -1 |
4 | 4 | 0 | 1 | 1 |
4 | 5 | 1 | -1 | -1 |
4 | 9 | -1 | 0 | 0 |
Note that the groups can be precompiled, if you know .
2. Assign labels to the coins
Now, if you have coins, , instead of assigning them sequential labels, you assign them labels from each group, in order. The first 3 coins get labels from the first group, the next 3 from the second group, and so on. You will have coins left over. If , assign the last two coins to the members of the next group that start with the digits -1
and 1
; if , then assign the last coin to the member of the next group that starts with 0
.
Let’s try some examples. The first group has the labels (1, 11, 12)
, and the second group has the labels (2, 6, 8)
. Suppose we have 5 coins. Then we’d assign the first three coins the labels 1, 11, 12, and the last two coins the labels 6 ([1 -1 0]
) and 8 ([-1 0 1]
).
coin 1: label [0 0 1] (1) coin 2: label [1 1 -1] (11) coin 3: label [-1 -1 0] (12) coin 4: label [1 -1 0] (6) coin 5: label [-1 0 1] (8)
If we have 4 coins, we’d assign the first three coins the labels 1, 11, 12, and the last coin the label 2 ([0 1 -1]
).
coin 1: label [0 0 1] (1) coin 2: label [1 1 -1] (11) coin 3: label [-1 -1 0] (12) coin 4: label [0 1 -1] (2)
3. Weigh the coins according to the weighing schedule.
The weighing schedule is the same as it was before, but now we’re only using some of the labels. Each coin is placed on the scale according to the digits of its label. Here’s the weighing schedule for the first weighing:
Show the code
Cset = compileC(Fmat) knitr::kable(Cset[[1]])
left | table | right |
---|---|---|
8 | 1 | 5 |
9 | 2 | 6 |
10 | 3 | 7 |
12 | 4 | 11 |
So the first weighing of five coins is as follows:
coin1 (1): table coin2 (11): right coin3 (12): left coin4 (6): right coin5 (8): left [ {coin3, coin5} | {coin2, coin4} ] ------------------------------------ coin1
This time, in addition to keeping track of the weighing outcome , we also need to keep track of which coins we know to be good, based on the outcome of the weighing. We know that
- if the scale is balanced, then all the coins on the scale are good (the dud is on the table), and
- if the scale is not balanced, then all the coins on the table are good (the dud is on the scale).
Let’s suppose the scale tilts right (a[1] = 1
). Then we know that coin1
is good. Now, to the second weighing:
Show the code
knitr::kable(Cset[[2]])
left | table | right |
---|---|---|
5 | 1 | 2 |
6 | 8 | 3 |
7 | 9 | 4 |
12 | 10 | 11 |
coin1 (1, good): table coin2 (11): right coin3 (12): left coin4 (6): left coin5 (8): table [{coin3, coin 4} | {coin2} ] ------------------------------ coin1, coin5
Oops! Now, the scale counts aren’t equal. But we know that coin1
is good, so we can put it in the right pan to equalize the coin counts without affecting the outcome.
[{coin3, coin 4} | {coin2, coin1} ] ------------------------------ coin5
Sometimes the scale count isn’t equal, but none of the coins on the table have been marked good. In that case, find a good coin in the pan with more coins and put it on the table.
To continue the example, let’s say that the scale again tilts right (a[2] = 1
). Now we know that coin5
is good. On to weighing three.
Show the code
knitr::kable(Cset[[3]])
left | table | right |
---|---|---|
2 | 3 | 1 |
5 | 6 | 4 |
10 | 9 | 7 |
11 | 12 | 8 |
coin1 (1, good): right coin2 (11): left coin3 (12): table coin4 (6): table coin5 (8, good): right [{coin2} | {coin1, coin5}] --------------------------- coin3, coin4 Move either of coin1 or coin5 to the table. [ {coin2} | {coin1} ] ----------------------- coin3, coin4, coin5
Now, this time, the scale tilts left (a[3] = -1
). We have a = [1 1 -1]
, which means . The label 11 corresponds to coin2
, and a
rotates forward, so the dud is coin2
and is heavy.
Let’s confirm that with code. As before, the function find_dud
returns a value D
such that abs(D)
gives the index of the dud, and sign(D)
gives the dud’s relative weight: negative means the dud is light, and positive means the dud is heavy.
Show the code
nrounds = 3 ncoins = 5 coins = numeric(ncoins) + 1 # good coins weigh 1 unit # precompilation computes both # the weighing schedule and the label groups precompiled = precompile(nrounds) # make coin 2 a heavy dud coins[2] = 1.5 find_dud(coins, precompiled)
[1] 2
We’ll try 10 coins.
Show the code
ncoins = 10 coins = numeric(ncoins) + 1 # no dud case find_dud(coins, precompiled)
[1] "No dud."
[1] 0
Show the code
# 7, light coins[7] = 0.5 find_dud(coins, precompiled)
[1] -7
Show the code
# confirm all possible cases work for(icoin in 1:ncoins) { for (direction in c(-1, 1)) { coins = numeric(ncoins) + 1 coins[icoin] = 1 + 0.5 * direction actual = icoin * direction result = find_dud(coins, precompiled) stopifnot(actual == result) } }
This generalizes Dyson’s algorithm. It’s not as pretty as the special case version, but it works.
Nina Zumel is a data scientist based in San Francisco, with 20+ years of experience in machine learning, statistics, and analytics. She is the co-founder of the data science consulting firm Win-Vector LLC, and (with John Mount) the co-author of Practical Data Science with R, now in its second edition. ## References
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.