Example:Alec's unmarked die experiment
From Maths
Contents
[hide]Experiment
A volunteer was asked to roll 5 indistinguishable[Note 1] (6 sided) dice (at once) onto a surface and announce the outcome as a sequence of numbers[Note 2], I recorded these outcomes as a list[Note 3]
Terminology:
- "values" will be used to describe dice readings, eg a dice showing 6 is described as "value of 6"
- "digits" will be used to describe elements of the sequence of a reading, for example digit 3 of "22655" is "6"
We wish to investigate whether or not the recording order was "random" or whether there was some sort of order to the volunteer's reading of the die.
I then tabulated (via a tally chart) the frequencies of values for each digit to obtain:
Results table
Digit | Die value | Total | |||||
---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | ||
Digit 1 | 7 | 9 | 6 | 7 | 6 | 10 | 45 |
Digit 2 | 4 | 8 | 13 | 10 | 8 | 2 | 45 |
Digit 3 | 7 | 12 | 8 | 6 | 7 | 5 | 45 |
Digit 4 | 5 | 7 | 11 | 8 | 9 | 5 | 45 |
Digit 5 | 11 | 4 | 5 | 5 | 4 | 16 | 45 |
Total freq. | 34 | 40 | 43 | 36 | 34 | 38 | 225 |
Average[Note 4] | 6.8 | 8.0 | 8.6 | 7.2 | 6.8 | 7.6 |
Notice the extreme values of:
- value 6 occurred '16 times as the fifth and final digit of a recording
- value 6 occurred just 2 times as digit two
- Note:
- Maybe use something like this to show distributions....
- Maybe use something like this to show distributions....
Modelling the situation
Assuming the volunteer was random We have some options:
- Model each digit (row) as the record of 45 die rolls
- This would model each row as Bin(16,45)
- Model each value (column) as being assigned to a random one of 5 digits.
- For example, there were 38 values of six, if random we'd expect the probability of any individual 6 value being recorded in any particular digit as 1/5
- This would mean any particular digit of the six value for example would be modelled Bin(15,38)
Results
- TODO: I want to save this and move on to another task, these are just me jotting down what I have on paper for now
For X∼Bin(16,46) - the distribution of values for any particular digit (if random)
- Warning:notice the 46 should be 45 - I miscounted and can't be bothered to work out, the difference will be fairly small
- P[X≥16]=0.002239 (4 sf) - significant
- P[X≤2]=0.01176 (4 sf) - on the fence
For Y6∼Bin(15,38) - the distribution of the value of 6 for any particular digit (if random)
- P[Y6≥16]=0.001560 (4 sf) - significant
- P[Y6≤2]=0.01131 (4 sf) - on the fence
Notes
- ↑ Each die was a generic white black-spotted die, an individual die could not be identified from the group of 5 dice.
- ↑ A sequence is ordered, it has a 1st term and a 2nd term, a sequence made from one by swapping its 1st and 2nd terms is a distinct sequence to the original.
- Specifically, the volunteer may call (1,2,3,4,5), or call (5,3,4,1,2) - these are distinct, but reflect the same outcome of throwing the 5 dice.
- ↑ I may record "22665 33456" - this means the volunteer announced "two-two-six-six-five" for one trial, then "three-three-four-five-six" for the next trial.
- ↑ The average of a value of "1" is 6.8, this is 345. Notice:
- Bin(15,34) is the expected distribution of the number of 1s recorded in any particular digit
- assuming each digit is independent, and random (so a 15 chance of any particular digit)
- Bin(15,34) is the expected distribution of the number of 1s recorded in any particular digit