If you’ve flipped through a textbook on high-dimensional probability or geometry, you’ve seen the obligatory drawing of an -dimensional cube. It is usually rendered as a strange spiky object: a sea urchin, a thorny diamond, a star with way too many points. Why does the high-dimensional cube get drawn like that?
Some of it is concentration of measure, but there is a much simpler counting argument behind a particular family of these drawings. It is just the binomial coefficient in disguise.
Counting corners
The unit cube in dimensions is the set . Its corners are the points where every coordinate is either or . There are exactly of them.
- : corners, namely and .
- : corners, namely . The unit square.
- : corners. The familiar 3D cube.
- : corners.
- : just over a billion.
Now group the corners by Hamming weight, the number of coordinates equal to . Define
How many such corners are there? You’re picking which of the slots get the , so
That is the punchline: the level distribution of cube corners is exactly the binomial coefficient.
What does look like?
A bell. Peaked at , symmetric around it, with at the extremes and in the middle.
For large , the de Moivre–Laplace theorem (a classical, pre-CLT special case of the central limit theorem) gives
In other words, the count of corners at level , viewed as a function of , converges to a Gaussian with mean and variance .
Drawing the cube
Now suppose we draw the cube on a 2D page. One natural convention: stack the levels vertically and draw each level as a horizontal band whose width is proportional to .
For small this is a chunky diamond. For large it is a vertical bell curve, narrow at the top and bottom (only one corner each, at and ) and bulging in the middle. The ratio between the middle width and the tip width is , which is roughly , exponentially huge.
Try it yourself. Slide and watch the chunky diamond at sharpen into a needle-tipped spindle:
That’s the spike. The drawing has sharp points at the top and bottom because there is exactly one corner at each end, and a dramatic bulge in the middle because the binomial is dramatic about its peak. As grows, the tips become invisibly thin while the middle balloons out.
A few observations follow immediately:
- Almost all corners have roughly ones. The fraction of corners with falls off Gaussian-fast in .
- The tips are vanishingly rare. Out of corners, only two (the all-zero and the all-one) sit at the extreme levels. They are the spike points that give the drawing its name.
- Two random corners are typically at Hamming distance , which drops out of the same binomial picture for free.
References
Versions of the spiky-cube drawing appear all over high-dimensional probability and data-science texts. A few good places to look:
- Roman Vershynin, High-Dimensional Probability: An Introduction with Applications in Data Science. Cambridge University Press, 2018. The first few chapters set up exactly this geometric picture before deploying it for concentration inequalities.
- Avrim Blum, John Hopcroft, Ravi Kannan, Foundations of Data Science. Cambridge University Press, 2020. Chapter 2 is a friendly tour of the geometry of high dimensions; available freely on the authors’ webpages.
- Stéphane Boucheron, Gábor Lugosi, Pascal Massart, Concentration Inequalities: A Nonasymptotic Theory of Independence. Oxford University Press, 2013.
The drawings vary author to author, but the underlying combinatorics is always the same: corners count as a binomial, and a binomial peaks sharply.