Introduction
The coupon collector’s problem is an old standby in the puzzle community: what is the minimum number of random selections with replacement from a set of
coupons that are needed to obtain at least one of every coupon in the set? This post collects solutions to some variants of this problem involving what Stadje [3] refers to as “group drawings,” where each random selection is not of a single coupon, but of a group of
coupons. Python code implementing the formulas described here is available on GitHub.
Group drawing a subset (without replacement)
Results depend on how each group of coupons is selected. Stadje addresses the variant where at each turn we select a random subset of distinct coupon types, i.e., sampling the
coupons without replacement (but always returning all coupons to the pile for the next turn).
In this case, the cumulative distribution function for is
with expected value
Group drawing with replacement
Alternatively, we have considered here before the variant where at each turn we sample a group of coupons with replacement, so that duplicates are possible within a group. In this case,
Group drawing consecutively from a cycle
So far, nothing new. But this post was motivated by an interesting variant of the problem posed recently by John D. Cook [1]: arrange the coupons in a circle, and at each turn, randomly select an “interval” of
coupons that appear consecutively around the circle.
A comment on the linked blog post mentions a very elegant solution to the continuous version of this problem described in [2,4,5], where we ask for the number of random arcs of fixed length needed to cover the circle. For reference, the resulting distribution of
in this continuous case is given by
where is the length of each arc as a fraction of the circumference.
It is a delightfully challenging exercise to apply this same inclusion-exclusion approach, where we are prohibiting uncovered “gaps” between arcs, to this discrete “consecutive coupon” collector problem. The resulting probability distribution is
where the first inner summation counts ways to select exactly specified distinct intervals in
turns (i.e., the number of surjections from
to
), and the second inner summation is the discrete analog to the continuous case summation above, counting ways to “cover” the entire cycle of
coupons with
distinct intervals of
consecutive coupons.
Unfortunately, unlike the other variants where we can compute the expected number of turns as a finite sum, it’s not clear to me how to compute the expected value here other than lower bound approximations from the series .
References:
- Cook, John D., Consecutive coupon collector problem [HTML]
- Solomon, H., Covering a Circle Circumference and a Sphere Surface, Chapter 4 in Geometric Probability, Philadelphia, PA: SIAM, p. 75-96, 1978 [PDF]
- Stadje, Wolfgang, The Collector’s Problem with Group Drawings, Advances in Applied Probability, 22(4) December 1990, p. 866-882 [JSTOR]
- Stevens, W. L., Solution to a Geometrical Problem in Probability, Annals of Eugenics, 9(4) 1939, p. 315-320 [PDF]
- Weisstein, Eric W., Circle Covering by Arcs, from MathWorld–A Wolfram Web Resource [HTML]
In paragraph 4.7 of my recent introductory book A first Course in Probability fot Computer and Data Science, World Scientific Press, 2023 simple but accurate heuristics are discussed for a variety of coupon collector problems