“Based on experience I feel there is some algorithm other than randomness at work. I have 5K plus songs and to have 2 consecutive songs play from the same album or the same artist – well that’s not random.” – Disparate blog comment
There has been a lot of debate over the “randomness,” or possible lack thereof, in Apple’s “shuffle” feature in its iTunes software and iPod products. Some of the debate stems from confusion about how the feature works, or the need to distinguish between selection with/without replacement. But more interesting– to me, anyway– are comments like the one above, that illustrate just how poorly we humans perceive “true” randomness, searching for patterns and order where none exist. Also, they make for a source of nice math problems.
So let’s turn this problem into a puzzle: suppose that you have a library of 5000 songs in some natural order, e.g. with titles “Song 1,” “Song 2,” “Song 3,” etc., up to “Song 5000.” Now shuffle and play the entire library. Define a coincidence to be a pair of consecutive songs (in the natural order) that are played consecutively in the shuffled order. For example, if you hear songs 137, 138, and 139 in order, that is 2 coincidences (137 followed by 138, and 138 followed by 139).
What is the expected (i.e., average) number of observed coincidences? What is the probability that no coincidences will occur?
A couple of notes:
- A slight variant of this problem was asked, but not correctly answered, on the xkcd forum.
- The problem becomes slightly “cleaner” if we view the song library as cyclic, so that song 5000 followed by song 1 is also considered a coincidence. How does the solution change in this case?
Froelich, Duckworth, and Culhane, Does your iPod Really Play Favorites? The American Statistician, 63(3):263-268. (August 2009) [PDF]