Counting Circles

Here’s one of my favorite problems: How many ways are there to arrange 3 red marbles and 2 blue marbles into a circle?

1. Intro to combinatorics

Let’s start with something simpler: Arranging the same marbles into a sequence.
Imagine 5 empty boxes with numbers from 1 to 5 written on them. The ways of arranging the 3 red and 2 blue marbles into a sequence correspond one-to-one to the ways to put them (one each) into these boxes.
We’ll do that by first placing all the red ones and then the blue ones. For the first one, we have 5 possible boxes to put it into; For the second one, there are only 4 left and for the third one there are 3 choices. After that, the rest of the boxes has to be filled with blue marbles.

You might be inclined to say that the answer must thus be $5 \times 4 \times 3$ , but that’s not quite right because the red marbles are indistinguishable from each other. For example, placing the first red marble into box 1 and the second one into box 2 results in the same thing as placing the first one into box 2 and the second one into box 1.

But what if they were distinguishable? Say we write numbers onto the red marbles. The exact numbers don’t really matter (only that no two red marbles have the same number), but let’s just go with 1, 2, and 3. Simple and easy.
Now that we’ve numbered them, the number of ways to place these three marbles into 5 boxes is actually $5 \times 4 \times 3$ .

But consider such a placement with the numbered marbles already in the boxes. We could also have got there by placing our original, indistinguishable marbles into the boxes first and then writing the numbers onto them.
Let’s do that in order: For each possible placement of the indistinguishable marbles, there are 3 possible choices for the number 1, then 2 possible choices for the number 2, then the final number 3 has to be the remaining one.
Thus, the total number of ways to write the numbers 1, 2, 3 onto the three marbles is $3 \times 2 \times 1 = 6$ .

So, if we call the number of ways to place 3 indistinguishable marbles into 5 numbered boxes $𝑁$ , we know that the number of ways to place 3 distinguishable marbles into the same boxes is $6 \times 𝑁$ .
But from before, we also know that this number must be equal to $5 \times 4 \times 3$ , so we can deduce that

\begin{aligned} 6 \times 𝑁 & = 5 \times 4 \times 3 \\ ⟺ 𝑁 & = \frac{5 \times 4 \times 3}{6} = 10 . \end{aligned}

Let’s clean this up a bit with some notation:

An expression like $5 \times 4 \times 3$ is called a falling power and written as $5^{\underset{̲}{3}}$ . Other examples would be $10^{\underset{̲}{2}} = 10 \times 9$ or $7^{\underset{̲}{4}} = 7 \times 6 \times 5 \times 4$ .
In general, $𝑛^{\underset{̲}{𝑘}} = 𝑛 \times \dots \times (𝑛 - 𝑘 + 1)$ .
A special case of the falling power is the factorial $𝑛! ≔ 𝑛^{\underset{̲}{𝑛}}$ . We encountered this for the number of ways to write numbers onto the red marbles, $3! = 6$ .

So, using this, we can write

𝑁 = \frac{5^{\underset{̲}{3}}}{3!} .

This number $𝑁$ is also called a binomial coefficient, or “5 choose 3”. There are varying notations for it, but I’ll go with

𝑁 = 𝐶 (5, 3) .

Note that nothing in this argument actually relied on any properties of the numbers 3 and 5, so, in general, the number of ways to put $𝑘$ red and $(𝑛 - 𝑘)$ blue marbles into a sequence is

𝐶 (𝑛, 𝑘) = \frac{𝑛^{\underset{̲}{𝑘}}}{𝑘!} .

By the way, here’s the list of possible sequences: RRRBB, RRBRB, RRBBR, RBRRB, RBRBR, RBBRR, BRRRB, BRRBR, BRBRR, BBRRR.
We can be sure that that’s all of them by just confirming that there’re no duplicates and that there’s really 10 of them.

2. Counting with Symmetry

That was for a sequence of marbles, but what about a row of them?

You might say “that’s the same”, but the difference is that a row has reflection symmetry.
In particular, imagine a row of marbles lying on the ground. You can just look at them from the opposite side and thus swap what you see as the left and right end, but the row is still the same.

So, in general, there are up to two sequences corresponding to any given row. For example, the sequences RRBBR and RBBRR are the same row because they’re just mirror images of each other.

But it gets more complicated when you consider RBRBR. This row is a palindrome, meaning it is its own mirror image. The existence of rows like that means we can’t just divide by two to go from the number of sequences to the number of rows.

But we can split the sequences into palindromes and non-palindromes. Then, each palindrome counts as one row and each non-palindrome counts as half a row. So, $𝑃$ being the number of palindromes and $\bar{𝑃}$ being the number of non-palindromes, we’d have

\begin{aligned} 𝑁_{row} & = 𝑃 + \frac{1}{2} \bar{𝑃} \\ = \frac{1}{2} (𝑃 + 𝑃 + \bar{𝑃}) \\ = \frac{1}{2} (𝑃 + 𝑁_{seq}), \end{aligned}

where $𝑁_{seq} = 𝐶 (5, 3)$ is the number of sequences.

So how many palindromes are there?
Since we have an even number of blue and an odd number of red marbles, the marble in the middle must be red.
The remaining four marbles are split into a left and right half and the right half must be the mirror image of the left half. So we are free to choose the left half, but that’s it. In other words, our freedom of choice corresponds to placing one blue and one red marble into a sequence:

𝑃 = 𝐶 (2, 1) .

So the total comes out to

𝑁_{row} = \frac{1}{2} [𝐶 (2, 1) + 𝐶 (5, 3)] = 6 .

Here’s the list:

Palindromes: RBRBR, BRRRB
Non-palindromes: RRRBB/BBRRR, RRBRB/BRBRR, RRBBR/RBBRR, RBRRB/BRRBR

Generalizing this is somewhat easy, but depends on the parity of $𝑛$ and $𝑘$ .

If the total number of marbles is odd, then we always have an odd number of one type of marble and an even number of the other type. This is the same situation as the previous example, so the general formula is
$\frac{1}{2} [𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋) + 𝐶 (𝑛, 𝑘)],$
where $⌊ 𝑥 ⌋$ is the largest integer less than or equal to $𝑥$ ; In particular, $⌊ \frac{𝑛}{2} ⌋ = \frac{𝑛 - 1}{2}$ since $𝑛$ is odd.
It doesn’t matter if we pick $𝑘$ to be odd or even, since¹
$𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋) = 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑛 - 𝑘}{2} ⌋) .$
If the total number of marbles is even, then it’s even easier. Either there’s an odd number of both red and blue marbles, in which case there are zero palindromes, or there’s an even number of both, in which case the number of palindromes is again
$𝑃 = 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋),$
where now $⌊ \frac{𝑛}{2} ⌋ = \frac{𝑛}{2}$ and $⌊ \frac{𝑘}{2} ⌋ = \frac{𝑘}{2}$ .

I’ll use the Iverson Bracket to unify these three cases. Its definition is that $⟦ 𝜑 ⟧ = 1$ if $𝜑$ is true and $⟦ 𝜑 ⟧ = 0$ if $𝜑$ is false.

So the general formula for $𝑃$ is

𝑃 = ⟦ 𝑛 odd or 𝑘 even ⟧ 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋) .

Thus, the general formula for the number of ways to place $𝑘$ red marbles and $(𝑛 - 𝑘)$ blue marbles into a row is

𝑁_{row} = \frac{1}{2} [𝐶 (𝑛, 𝑘) + ⟦ 𝑛 odd or 𝑘 even ⟧ 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋)] .

3. Group Theory

The general study of symmetries as part of abstract algebra is called group theory.

I’ll introduce the parts of it we will need in a sort-of unconventional way.

To start with, consider the idea of a symmetry transformation. The most well-known symmetry transformations are reflections and rotations.
Something is symmetric if it stays the same after performing some symmetry transformation.

To get to a general definition of symmetry transformations, one can think long and hard about what reflections and rotations have in common, but I’ll cut it short here and just tell you that the deciding property is invertibility.

Reflections are their own inverses: Perform the same reflection twice and you always end up with what you started with.
Rotations always have an inverse. For example, rotating the minute hand of a clock 20° clockwise is undone by rotating it 20° counterclockwise.

You might be a bit confused since, if something is symmetric, you already end up with what you started with after performing one symmetry transformation, but the important thing is that the description of invertibility also holds for non-symmetric things.
For example, reversing the letters in a palindrome like “wow” results in the same word, but for a non-palindrome like “word”, you get “drow”, and only after performing the inverse transformation (in this case “reversing again”), you end up back where you started.

Now let’s think about formalizing this using the example of the minute hand on a special clock. The clock isn’t really useful for telling the time; It is basically a circular image with a singular hand on top of it.

The reason for this example over just a regular clock is that it gives us a reason to call rotating the hand a symmetry operation. Namely, if the image behind the hand has some rotational symmetry, for example if it’s the image of a playing card with 180° rotational symmetry, then rotating the hand by that amount is basically the same as rotating the entire clock by that amount, which means that the clock hasn’t really changed.²

In math, basically everything we deal with is a set, so we need a set $𝑋$ to perform symmetry operations on. In particular, the elements of $𝑋$ will be the input/output states of the symmetry operations, so that we can model the operations themselves as functions $𝑋 \to 𝑋$ .

In our example, the symmetry operations are rotations of the hand, so the input/output states of that can be taken to just be the position of the hand on the clock, which can be modeled as an angle measured in degrees, so a real number in the interval $[0, 360)$ .

Then, as stated before, symmetry transformations need to be invertible, so we’re looking at the invertible functions (aka. “bijections”) from $𝑋$ to $𝑋$ .
The set of those bijections will be denoted $𝑆 (𝑋)$ here; It underlies what’s called the symmetric group of $𝑋$ .

I won’t define the term “group” in general, since it’s more abstract than we need. Instead, we’ll be working with subgroups of $𝑆 (𝑋)$ , which are defined as a subset $𝐺$ of $𝑆 (𝑋)$ such that

For any functions $𝑓, 𝑔 \in 𝐺$ , the composition $𝑓 \circ 𝑔$ is also in $𝐺$ .
For any function $𝑓 \in 𝐺$ , the inverse $𝑓^{- 1}$ is also in $𝐺$ .

Note that a combination of the two requirements implies $𝑓 \circ 𝑓^{- 1} = id \in 𝐺$ , so a subgroup of $𝑆 (𝑋)$ always contains the identity function.

These conditions basically mean that $𝐺$ is a “self-contained” set of symmetry operations. For example:

Condition 1 means that, if I can rotate by 20° ( $𝑓$ ), I should also be able to rotate by 40° ( $𝑓 \circ 𝑓$ ).
Condition 2 means that, if I can rotate by 20° clockwise ( $𝑓$ ), I should also be able to rotate by 20° counterclockwise ( $𝑓^{- 1}$ ).

Now we can recontextualize the previous section: We had a set $𝑋$ of sequences of 3 red and 2 blue marbles and a subgroup $𝐺 = {id, 𝑠}$ of $𝑆 (𝑋)$ , where $𝑠$ is the function that reverses a sequence. The conditions are easily confirmed:

For any $𝑓 \in 𝐺$ , $𝑓 \circ id = 𝑓 \in 𝐺$ and $id \circ 𝑓 = 𝑓 \in 𝐺$ . Additionally, $𝑠 \circ 𝑠 = id \in 𝐺$ .
${id}^{- 1} = id \in 𝐺$ and $𝑠^{- 1} = 𝑠 \in 𝐺$ .

4. Burnside’s Lemma

We’re close to the payoff of all that group theory.

First, we need another definition: Given a set $𝑋$ and a subgroup $𝐺$ of $𝑆 (𝑋)$ ,

For each element $𝑓 \in 𝐺$ , the set of fixpoints of $𝑓$ is defined as
${fix}_{𝑋} (𝑓) = {𝑥 \in 𝑋 | 𝑓 (𝑥) = 𝑥} .$
For each element $𝑥 \in 𝑋$ , the orbit ${[𝑥]}_{𝐺}$ of $𝑥$ is the set of possible outcomes from performing one of the symmetry transformations from $𝐺$ on $𝑥$ :
${[𝑥]}_{𝐺} = {𝑓 (𝑥) | 𝑓 \in 𝐺} .$
These orbits cleanly partition $𝑋$ : Each $𝑥 \in 𝑋$ is part of one and only one orbit.
The set of orbits is denoted $𝑋 / 𝐺$ :
$𝑋 / 𝐺 = {{[𝑥]}_{𝐺} | 𝑥 \in 𝑋} .$

In Section 2, the orbits corresponded to the rows of marbles: For a sequence $𝑥$ of marbles, we had ${[𝑥]}_{𝐺} = {𝑥, 𝑠 (𝑥)}$ . For non-palindromes, this set has two elements; For palindromes, $𝑠 (𝑥) = 𝑥$ by definition.

In general, our problem becomes finding the number of orbits, i.e. $| 𝑋 / 𝐺 |$ .

This is solved by something called Burnside’s Lemma, which states that

| 𝑋 / 𝐺 | = \frac{1}{| 𝐺 |} \sum_{𝑓 \in 𝐺} | {fix}_{𝑋} (𝑓) | .

For a small example, recall $𝐺 = {id, 𝑠}$ from Section 2. For this, the formula becomes

| 𝑋 / 𝐺 | = \frac{1}{2} (| {fix}_{𝑋} (id) | + | {fix}_{𝑋} (𝑠) |),

where ${fix}_{𝑋} (id) = 𝑋$ and ${fix}_{𝑋} (𝑠)$ is the set of palindromes. Setting $𝑁_{seq} = | 𝑋 |$ and $𝑃 = | {fix}_{𝑋} (𝑠) |$ , we can see that this is exactly the formula we had arrived at:

| 𝑋 / 𝐺 | = \frac{1}{2} (𝑁_{seq} + 𝑃) .

4.1. Proof

The following is a proof of Burnside’s Lemma, adapted from Wikipedia.³

Start with the right-hand side and notice that

\begin{aligned} \sum_{𝑓 \in 𝐺} | {fix}_{𝑋} (𝑓) | & = \sum_{𝑓 \in 𝐺} | {𝑥 \in 𝑋 | 𝑓 (𝑥) = 𝑥} | \\ = | {(𝑓, 𝑥) \in 𝐺 \times 𝑋 | 𝑓 (𝑥) = 𝑥} | \\ = \sum_{𝑥 \in 𝑋} | {𝑓 \in 𝐺 | 𝑓 (𝑥) = 𝑥} | \\ = \sum_{𝑥 \in 𝑋} | {[id]}^{𝑥} |, \end{aligned}

where we have defined

{[𝑔]}^{𝑥} ≔ {𝑓 \in 𝐺 | 𝑓 (𝑥) = 𝑔 (𝑥)}

for any $𝑔 \in 𝐺$ .

Note that this partitions $𝐺$ , which is proven as follows: Consider the set

{[𝑔]}^{𝑥} \cap {[ℎ]}^{𝑥} = {𝑓 \in 𝐺 | 𝑓 (𝑥) = 𝑔 (𝑥) and 𝑓 (𝑥) = ℎ (𝑥)} .

Either $𝑔 (𝑥) = ℎ (𝑥)$ and ${[𝑔]}^{𝑥} = {[ℎ]}^{𝑥}$ , or $𝑔 (𝑥) \neq ℎ (𝑥)$ and ${[𝑔]}^{𝑥} \cap {[ℎ]}^{𝑥} = \emptyset$ .

Next, for any $𝑔 \in 𝐺$ , consider the precomposition map $𝑔_{*} : 𝐺 \to 𝐺$ defined by $𝑔_{*} (𝑓) ≔ 𝑔 \circ 𝑓$ . It is a bijection with inverse ${(𝑔^{- 1})}_{*}$ . This implies that, for any set $𝐻 \subseteq 𝐺$ ,

| {𝑔_{*} (𝑓) | 𝑓 \in 𝐻} | = | 𝐻 | .

In particular, let $𝐻 = {[id]}^{𝑥}$ , then we have proven that, for any $𝑔 \in 𝐺$ ,

\begin{aligned} | {[id]}^{𝑥} | & = | {𝑔_{*} (𝑓) | 𝑓 \in {[id]}^{𝑥}} | \\ = | {𝑔 \circ 𝑓 | 𝑓 \in {[id]}^{𝑥}} | \\ = | {ℎ | 𝑔^{- 1} \circ ℎ \in {[id]}^{𝑥}} | . \end{aligned}

Now note that

𝑔^{- 1} \circ ℎ \in {[id]}^{𝑥} ⟺ 𝑔^{- 1} (ℎ (𝑥)) = 𝑥 ⟺ ℎ (𝑥) = 𝑔 (𝑥),

which implies

| {[id]}^{𝑥} | = | {ℎ | ℎ (𝑥) = 𝑔 (𝑥)} | = | {[𝑔]}^{𝑥} | .

Recall that this holds for all $𝑔 \in 𝐺$ .

Now we want to find the number of distinct sets ${[𝑔]}^{𝑥}$ . This is easily done by noticing that there is a bijection between ${[𝑔]}^{𝑥}$ and $𝑔 (𝑥)$ : Each ${[𝑔]}^{𝑥}$ uniquely defines $𝑔 (𝑥)$ and each $𝑎 = 𝑔 (𝑥)$ uniquely defines ${[𝑔]}^{𝑥} = {𝑓 \in 𝐺 | 𝑓 (𝑥) = 𝑎}$ .
Thus, the number of distinct sets ${[𝑔]}^{𝑥}$ is equal to the number of distinct values $𝑔 (𝑥)$ , i.e.

| {𝑔 (𝑥) | 𝑔 \in 𝐺} | = | {[𝑥]}_{𝐺} | .

Since $𝐺$ is partitioned into the ${[𝑔]}^{𝑥}$ , which all have the same size $| {[𝑔]}^{𝑥} | = | {[id]}^{𝑥} |$ , and there are $| {[𝑥]}_{𝐺} |$ of them, it follows that the cardinality of $𝐺$ satisfies

| 𝐺 | = | {[𝑥]}_{𝐺} | \cdot | {[id]}^{𝑥} | .

Now recall the sum from the start and apply this:

\begin{aligned} \sum_{𝑓 \in 𝐺} | {fix}_{𝑋} (𝑓) | & = \sum_{𝑥 \in 𝑋} | {[id]}^{𝑥} | \\ = \sum_{𝑥 \in 𝑋} | 𝐺 | / | {[𝑥]}_{𝐺} | . \end{aligned}

Split the sum over $𝑋$ along the orbits $𝑂 \in 𝑋 / 𝐺$ , noting that $𝑥 \in 𝑂$ implies ${[𝑥]}_{𝐺} = 𝑂$ ,

\begin{aligned} \sum_{𝑓 \in 𝐺} | {fix}_{𝑋} (𝑓) | & = \sum_{𝑥 \in 𝑋} | 𝐺 | / | {[𝑥]}_{𝐺} | \\ = \sum_{𝑂 \in 𝑋 / 𝐺} \sum_{𝑥 \in 𝑂} | 𝐺 | / | {[𝑥]}_{𝐺} | \\ = \sum_{𝑂 \in 𝑋 / 𝐺} \sum_{𝑥 \in 𝑂} | 𝐺 | / | 𝑂 | \\ = \sum_{𝑂 \in 𝑋 / 𝐺} | 𝐺 | \\ = | 𝑋 / 𝐺 | \cdot | 𝐺 | . \end{aligned}

This immediately implies

| 𝑋 / 𝐺 | = \frac{1}{| 𝐺 |} \sum_{𝑓 \in 𝐺} | {fix}_{𝑋} (𝑓) | .

Q.E.D.

5. Counting Circular Configurations

Now we’re technically ready to answer the original question of arranging 3 red and 2 blue marbles into a circle, but let’s warm up with a slightly easier one: “How many ways are there to arrange 5 marbles, any of which may be red or blue, into a circle?”

Restating this question into more formal language, we’re asking about the number of orbits, $| 𝑌 / 𝐺 |$ , where $𝑌$ is the set of 5-long sequences of red and blue marbles, and $𝐺$ is one of two subgroups of $𝑆 (𝑌)$ :

The symmetry transformations that correspond to rotations of the circle. Since we encode the “circle” as a sequence, this means that these are functions that split the sequence into two contiguous subsequences and put those back together in the opposite order. For example, “ABCDE → CDEAB” or “ABCDE → EABCD”.
In the jargon, this is called the cyclic group of order 5.
All of the previous transformations, together with reflections across either an element or a gap between elements. The number of these reflections is also equal to 5. (And in general equal to the number of elements in the sequence).
In the jargon, this is called the dihedral group of order 10.

Call these $𝐺_{1}$ and $𝐺_{2}$ .

All of these symmetry transformations can be generated from two basic elements:

The simple rotation by one place: “ABCDE → BCDEA”, denoted $𝑟$ .
Formally, it is defined as ${𝑟 (𝑥)}_{𝑖} ≔ 𝑥_{𝑖 + 1}$ for $𝑖 < 5$ and ${𝑟 (𝑥)}_{5} ≔ 𝑥_{1}$ .
Alternatively, we can use the remainder operation: $𝑑 % 𝑛$ is the non-negative remainder from dividing $𝑑$ by $𝑛$ .⁴
Using that, we can see that ${𝑟 (𝑥)}_{𝑖} ≔ 𝑥_{(𝑖 % 5) + 1}$ for all $𝑖$ . For this, zero-based indexing is more convenient; With sequence index $𝑗 \in {0, \dots, 𝑛 - 1}$ , we get ${𝑟 (𝑥)}_{𝑗} ≔ 𝑥_{(𝑗 + 1) % 5}$ .
The simple reflection across a standard axis, again denoted $𝑠$ .
For simplicity, this is the exact same operation as was used in Section 2.
Formally, it can be defined as ${𝑠 (𝑥)}_{𝑖} ≔ 𝑥_{6 - 𝑖}$ . With zero-based indexing, this becomes ${𝑠 (𝑥)}_{𝑗} ≔ 𝑥_{5 - 𝑗}$ .

In the first case, we can reach each of the possible transformations by applying $𝑟$ some number of times: $𝑟^{0} = id, 𝑟^{1} = 𝑟, 𝑟^{2}, 𝑟^{3}, 𝑟^{4}$ . Applying $𝑟$ 5 times always leaves the input untouched ( $𝑟^{5} = id$ ), so $𝑟^{- 1} = 𝑟^{4}$ .

In the second case, we have the same thing with $𝑟$ , and the reflections are all of the form $𝑟^{𝑘} \circ 𝑠 \circ 𝑟^{- 𝑘}$ , which should make intuitive sense: To reflect across some arbitrary axis, first rotate such that that axis moves to a standard position, then perform the standard reflection, then rotate back.

The first answer is then

| 𝑌 / 𝐺_{1} | = \frac{1}{5} \sum_{𝑘 = 0}^{4} | {fix}_{𝑌} (𝑟^{𝑘}) | .

We still need to find ${fix}_{𝑌} (𝑟^{𝑘})$ , but since $| {fix}_{𝑌} (𝑓) | = | {fix}_{𝑌} (𝑓^{- 1}) |$ , $𝑟^{4} = {(𝑟^{1})}^{- 1}$ and $𝑟^{3} = {(𝑟^{2})}^{- 1}$ , we only need to find this for $𝑘 = 0$ , $𝑘 = 1$ , and $𝑘 = 2$ .

For $𝑘 = 0$ , this is easy: $| {fix}_{𝑌} (𝑟^{0}) | = | {fix}_{𝑌} (id) | = | 𝑌 | = 2^{5} = 32$ .
For $𝑘 = 1$ , $𝑟^{1} = 𝑟$ moves each element of the sequence by one: ${𝑟 (𝑥)}_{𝑖} = 𝑥_{𝑖 + 1}$ for $𝑖 < 5$ and ${𝑟 (𝑥)}_{5} = 𝑥_{1}$ . Thus, $𝑟 (𝑥) = 𝑥$ decomposes into $𝑥_{𝑖 + 1} = 𝑥_{𝑖}$ for $𝑖 < 5$ and $𝑥_{1} = 𝑥_{5}$ , which results in $𝑥_{𝑖} = 𝑥_{1}$ for all $𝑖$ , so there are only two choices: RRRRR and BBBBB, i.e. $| {fix}_{𝑌} (𝑟) | = 2$ .
For $𝑘 = 2$ , $𝑟^{2}$ moves each element of the sequence by two: ${𝑟 (𝑥)}_{𝑖} = 𝑥_{(𝑖 + 2) % 5}$ . Concretely, we decompose this into $𝑥_{3} = 𝑥_{1}, 𝑥_{4} = 𝑥_{2}, 𝑥_{5} = 𝑥_{3}, 𝑥_{1} = 𝑥_{4}, 𝑥_{2} = 𝑥_{5}$ , which again results in $𝑥_{𝑖} = 𝑥_{1}$ for all $𝑖$ , so $| {fix}_{𝑌} (𝑟^{2}) | = 2$ .

This implies that

| 𝑌 / 𝐺_{1} | = \frac{1}{5} \sum_{𝑘 = 0}^{4} | {fix}_{𝑌} (𝑟^{𝑘}) | = \frac{1}{5} (32 + 4 \cdot 2) = 8 .

The second answer is found similarly, by also considering $| {fix}_{𝑌} (𝑟^{𝑘} 𝑠 𝑟^{- 𝑘}) |$ for $𝑘 = 0 \dots 4$ .
This is even easier because all of those reflections are just rotations of each other, so their fixpoint sets have to be rotations of each other and in particular have to be the same size. Thus, we only need to find $| {fix}_{𝑌} (𝑠) |$ .

$𝑠$ is defined as ${𝑠 (𝑥)}_{𝑖} = 𝑥_{6 - 𝑖}$ , which decomposes into $𝑥_{1} = 𝑥_{5}$ and $𝑥_{2} = 𝑥_{4}$ , so we can assign 3 of the 5 elements freely, giving $| {fix}_{𝑌} (𝑠) | = 2^{3} = 8$ .

This implies that

| 𝑌 / 𝐺_{2} | = \frac{1}{10} \sum_{𝑘 = 0}^{4} (| {fix}_{𝑌} (𝑟^{𝑘}) | + | {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠 \circ 𝑟^{- 𝑘}) |) = \frac{1}{10} (32 + 4 \cdot 2 + 5 \cdot 8) = 8,

meaning that the $𝐺_{1}$ -orbits of $𝑌$ are already reflection-symmetric.

We can now find a general formula that gives the number of ways to arrange $𝑛$ marbles, each red or blue, into a circle.

First of all, some caution is highly necessary with the definition of $𝐺_{2}$ . To rotate the axis of mirror symmetry by $𝑟^{𝑘}$ , you use the formula $𝑟^{𝑘} \circ 𝑠 \circ 𝑟^{- 𝑘}$ , but for even $𝑛$ , this only reaches half of the possible symmetry axes: Since $𝑠$ mirrors across an axis that doesn’t touch any marbles, all of its rotations by $𝑟^{𝑘}$ will also only mirror across such an axis.
In this case, we also need rotations of $𝑠$ by $𝑟^{\frac{2 𝑘 + 1}{2}}$ , which are the missing reflections across axes that each go through two marbles. Strictly speaking, $𝑟^{\frac{2 𝑘 + 1}{2}}$ isn’t defined in $𝐺_{2}$ , but we can save ourselves by noting that

𝑟^{𝑘} \circ 𝑠 \circ 𝑟^{- 𝑘} = 𝑟^{2 𝑘} \circ 𝑠

for all $𝑘$ , so if we set $𝑘 \leftarrow \frac{2 𝑘 + 1}{2}$ , we get that these reflections are given by

𝑟^{2 𝑘 + 1} \circ 𝑠 .

This means that we should work with the definition ${𝑟^{𝑘} \circ 𝑠 | 𝑘 \in ℤ}$ for the set of reflections, which always works, including when $𝑛$ is even.

Now let $𝑌$ be the set of sequences of $𝑛$ marbles, each red or blue. We now index these sequences starting at zero to keep the logic a bit simpler.

We first find $| {fix}_{𝑌} (𝑟^{𝑘}) |$ for $𝑘 = 0 \dots (𝑛 - 1)$ . Note that $𝑟^{𝑘}$ is defined as
$𝑟^{𝑘} {(𝑥)}_{𝑗} = 𝑥_{(𝑗 + 𝑘) % 𝑛},$
so the equation $𝑟^{𝑘} (𝑥) = 𝑥$ decomposes into
$𝑥_{(𝑗 + 𝑘) % 𝑛} = 𝑥_{𝑗}$
for all $𝑗 = 0 \dots (𝑛 - 1)$ . Substitute $𝑗 \leftarrow (𝑗 + 𝑘) % 𝑛$ and note that $((𝑗 + 𝑘) % 𝑛 + 𝑘) % 𝑛 = (𝑗 + 2 𝑘) % 𝑛$ to further find
$𝑥_{(𝑗 + 2 𝑘) % 𝑛} = 𝑥_{(𝑗 + 𝑘) % 𝑛},$
which implies
$𝑥_{(𝑗 + 2 𝑘) % 𝑛} = 𝑥_{𝑗} .$
We can repeat this process indefinitely to arrive at
$𝑥_{(𝑗 + 𝑚 𝑘) % 𝑛} = 𝑥_{𝑗}$
for all $𝑚 \in ℕ$ .
Since there are only finitely many indices, but $(𝑗 + 𝑚 𝑘) % 𝑛$ is an infinite sequence in $𝑚$ , there must be a first $𝑚$ for which $(𝑗 + 𝑚 𝑘) % 𝑛 = (𝑗 + 𝑙 𝑘) % 𝑛$ with some $𝑙 < 𝑚$ .
This equation of remainders can also be written as $𝑗 + 𝑚 𝑘 \equiv 𝑗 + 𝑙 𝑘 (mod 𝑛)$ .
It must then be the case that $𝑙 = 0$ , because otherwise we would have $𝑗 + (𝑚 - 1) 𝑘 \equiv 𝑗 + (𝑙 - 1) 𝑘 (mod 𝑛)$ , which would contradict the assumption that $𝑚$ was the first number with the desired property.
This means that there is an $𝑚$ such that, for some $𝑙 \in ℤ$ ,
$𝑗 + 𝑚 𝑘 = 𝑙 𝑛 + 𝑗 .$
Subtract $𝑗$ on both sides and divide by $𝑘$ to get
$𝑚 = \frac{𝑙 𝑛}{𝑘} .$
We want the smallest $𝑚$ for which this equation holds; In other words, we need $𝑙 𝑛$ to be the smallest multiple of $𝑛$ that is divisible by $𝑘$ .
This is, by definition, the least common multiple of $𝑘$ and $𝑛$ , so
$𝑚 = \frac{lcm (𝑘, 𝑛)}{𝑘} = \frac{𝑛}{gcd (𝑘, 𝑛)},$
where the second equality follows from $𝑘 𝑛 = gcd (𝑘, 𝑛) lcm (𝑘, 𝑛)$ .
With this, we have shown that each chain of equalities contains $\frac{𝑛}{gcd (𝑘, 𝑛)}$ entries, and since they form a partition of the elements of the sequence, there are $\frac{𝑛}{\frac{𝑛}{gcd (𝑘, 𝑛)}} = gcd (𝑘, 𝑛)$ of them.
Each of these chains leaves one choice of red or blue, so we get
$| {fix}_{𝑌} (𝑟^{𝑘}) | = 2^{gcd (𝑘, 𝑛)} .$
Now we find $| {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) |$ , but since we redefined $𝐺_{2}$ , it’s better to not appeal to the previous argument here. (And it would also be wrong to do that, as we will shortly see.)
There are three cases to consider:
- If $𝑘$ is even, we have $𝑟^{𝑘} \circ 𝑠 = 𝑟^{\frac{𝑘}{2}} \circ 𝑠 \circ 𝑟^{- \frac{𝑘}{2}}$ and thus
  $\begin{aligned} {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) & = {𝑦 \in 𝑌 | 𝑟^{\frac{𝑘}{2}} (𝑠 (𝑟^{- \frac{𝑘}{2}} (𝑦))) = 𝑦} \\ = {𝑟^{\frac{𝑘}{2}} (𝑧) | 𝑧 \in 𝑌, 𝑠 (𝑧) = 𝑧} \\ = {𝑟^{\frac{𝑘}{2}} (𝑧) | 𝑧 \in {fix}_{𝑌} (𝑠)} . \end{aligned}$
  Note that $𝑟^{\frac{𝑘}{2}}$ is a bijection, which immediately implies
  $| {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) | = | {fix}_{𝑌} (𝑠) |$
  and the same logic from before gives us
  $| {fix}_{𝑌} (𝑠) | = 2^{⌈ \frac{𝑛}{2} ⌉},$
  where $⌈ 𝑟 ⌉$ is the smallest integer greater than or equal to $𝑟$ . This comes from the fact that we can freely choose the middle element for odd $𝑛$ .
- If $𝑘$ is odd and $𝑛$ is odd, we have that $𝑛 + 𝑘$ is even and $𝑟^{𝑘} = 𝑟^{𝑛 + 𝑘}$ , so this just reduces to the previous case for $𝑘 \leftarrow 𝑛 + 𝑘$ .
- If $𝑘$ is odd and $𝑛$ is even, we have $𝑟^{𝑘} \circ 𝑠 = 𝑟^{\frac{𝑘 - 1}{2}} \circ 𝑟 \circ 𝑠 \circ 𝑟^{- \frac{𝑘 - 1}{2}}$ and thus
  $\begin{aligned} {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) & = {𝑦 \in 𝑌 | 𝑟^{\frac{𝑘 - 1}{2}} (𝑟 (𝑠 (𝑟^{- \frac{𝑘 - 1}{2}} (𝑦)))) = 𝑦} \\ = {𝑟^{\frac{𝑘 - 1}{2}} (𝑧) | 𝑧 \in 𝑌, 𝑟 (𝑠 (𝑧)) = 𝑧} \\ = {𝑟^{\frac{𝑘 - 1}{2}} (𝑧) | 𝑧 \in {fix}_{𝑌} (𝑟 \circ 𝑠)} . \end{aligned}$
  Note that $𝑟^{\frac{𝑘 - 1}{2}}$ is a bijection, which immediately implies
  $| {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) | = | {fix}_{𝑌} (𝑟 \circ 𝑠) | .$
  Then note that $𝑟 \circ 𝑠$ is defined by
  ${𝑟 (𝑠 (𝑥))}_{𝑗} = {𝑠 (𝑥)}_{(𝑗 + 1) % 𝑛} = 𝑥_{𝑛 - (𝑗 + 1) % 𝑛},$
  which is equal to $𝑥_{𝑛 - 1 - 𝑗}$ for $𝑗 = 0 \dots 𝑛 - 2$ and to $𝑥_{𝑛 - 1}$ for $𝑗 = 𝑛 - 1$ .
  This means that $𝑟 (𝑠 (𝑥)) = 𝑥$ decomposes into $𝑥_{𝑛 - 1 - 𝑖} = 𝑥_{𝑖}$ for all $𝑗 = 0 \dots 𝑛 - 2$ , which leaves $𝑥_{\frac{𝑛}{2}}$ and $𝑥_{𝑛}$ completely free and arranges the other elements into pairs.
  Thus, there are $2 + \frac{𝑛 - 2}{2} = \frac{𝑛}{2} + 1$ places where we can choose either red or blue, so we get
  $| {fix}_{𝑌} (𝑟 \circ 𝑠) | = 2^{\frac{𝑛}{2} + 1}$
We can summarize all these cases by saying that
$| {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) | = 2^{⌈ \frac{𝑛}{2} ⌉ + ⟦ 𝑛 even and 𝑘 odd ⟧} .$

For the answers, this gives us

\begin{aligned} | 𝑌 / 𝐺_{1} | & = \frac{1}{𝑛} \sum_{𝑘 = 0}^{𝑛 - 1} | {fix}_{𝑌} (𝑟^{𝑘}) | \\ = \frac{1}{𝑛} \sum_{𝑘 = 0}^{𝑛 - 1} 2^{gcd (𝑘, 𝑛)} \\ = \frac{1}{𝑛} \sum_{𝑑 ∣ 𝑛} | {𝑘 \in {0, \dots, 𝑛 - 1} | gcd (𝑘, 𝑛) = 𝑑} | 2^{𝑑}, \\ | 𝑌 / 𝐺_{2} | & = \frac{1}{2 𝑛} \sum_{𝑘 = 0}^{𝑛 - 1} (| {fix}_{𝑌} (𝑟^{𝑘}) | + | {fix}_{𝑌} (𝑟^{𝑘} \circ 𝑠) |) \\ = \frac{1}{2} (| 𝑌 / 𝐺_{1} | + 2^{⌈ \frac{𝑛}{2} ⌉ + ⟦ 𝑛 even and 𝑘 odd ⟧}) . \end{aligned}

There’s a special case to look out for: If $𝑛 = 2$ , then $𝑟 = 𝑠$ , so the $𝐺_{2}$ formula breaks down, but that’s fine, since the $𝐺_{1}$ formula already respects all the wanted symmetries included in this case.
Thus, let the second formula require $𝑛 \geq 3$ .⁵

The $𝐺_{1}$ formula can be simplified a bit further:
We find $| {𝑘 \in {0, \dots, 𝑛 - 1} | gcd (𝑘, 𝑛) = 𝑑} |$ by first noting that the set being measured is a subset of ${𝑘 \in {0, \dots, 𝑛 - 1} : 𝑑 ∣ gcd (𝑘, 𝑛)}$ . Then note that $𝑑 ∣ 𝑛$ is a given, so $𝑑 ∣ gcd (𝑘, 𝑛) ⟺ 𝑑 ∣ 𝑘$ , so the second set is just ${𝑘 \in {0, \dots, 𝑛 - 1} : 𝑑 ∣ 𝑘}$ , i.e. the set of multiples of $𝑑$ below $𝑛$ .
All of these take the form $ℓ 𝑑$ and the condition on $ℓ$ can be read off from

ℓ 𝑑 < 𝑛 ⟺ ℓ < \frac{𝑛}{𝑑} .

Thus, we find that

\begin{aligned} | {𝑘 \in {0, \dots, 𝑛 - 1} | gcd (𝑘, 𝑛) = 𝑑} | & = | {ℓ \in {0, \dots, \frac{𝑛}{𝑑} - 1} | gcd (ℓ 𝑑, 𝑛) = 𝑑} | \\ = | {ℓ \in {0, \dots, \frac{𝑛}{𝑑} - 1} | gcd (ℓ, \frac{𝑛}{𝑑}) = 1} | \\ = 𝜑 (\frac{𝑛}{𝑑}), \end{aligned}

where $𝜑$ is Euler’s totient function and the last equality is by its definition.

This gives us

| 𝑌 / 𝐺_{1} | = \frac{1}{𝑛} \sum_{𝑑 ∣ 𝑛} 𝜑 (\frac{𝑛}{𝑑}) 2^{𝑑} .

More concretely, we have

𝜑 (𝑚) = 𝑚 \prod (1 - \frac{1}{𝑝}),

where the product is over primes $𝑝$ dividing $𝑚$ .

We thus use the prime factorization $𝑛 = \prod_{𝑖 = 1}^{𝑚} 𝑝_{𝑖}^{𝑘_{𝑖}}$ , where ${(𝑝_{𝑖})}_{𝑖 = 1}^{𝑚}$ is some finite sequence of prime numbers and ${(𝑘_{𝑖})}_{𝑖 = 1}^{𝑚}$ some finite sequence of positive integers.
Then, the sum over the divisors of $𝑛$ splits into $𝑚$ sums over the exponents of the $𝑝_{𝑖}$ :

\begin{aligned} | 𝑌 / 𝐺_{1} | & = \frac{1}{𝑛} \sum_{𝑎_{1} = 0}^{𝑘_{1}} \dots \sum_{𝑎_{𝑚} = 0}^{𝑘_{𝑚}} 𝜑 (\prod_{𝑖 = 1}^{𝑚} 𝑝_{𝑖}^{𝑘_{𝑖} - 𝑎_{𝑖}}) 2^{\prod_{𝑖 = 1}^{𝑚} 𝑝_{𝑖}^{𝑎_{𝑖}}} \\ = \frac{1}{𝑛} \sum_{𝑎_{1} = 0}^{𝑘_{1}} \dots \sum_{𝑎_{𝑚} = 0}^{𝑘_{𝑚}} \prod_{𝑖 = 1}^{𝑚} (𝑝_{𝑖}^{𝑘_{𝑖} - 𝑎_{𝑖}} (1 - ⟦ 𝑎_{𝑖} < 𝑘_{𝑖} ⟧ \frac{1}{𝑝_{𝑖}})) 2^{\prod_{𝑖 = 1}^{𝑚} 𝑝_{𝑖}^{𝑎_{𝑖}}} \\ = \frac{1}{𝑛} \sum_{𝑎_{1} = 0}^{𝑘_{1}} \dots \sum_{𝑎_{𝑚} = 0}^{𝑘_{𝑚}} \prod_{𝑖 = 1}^{𝑚} ((𝑝_{𝑖} - ⟦ 𝑎_{𝑖} < 𝑘_{𝑖} ⟧) 𝑝_{𝑖}^{𝑘_{𝑖} - 𝑎_{𝑖} - 1}) 2^{\prod_{𝑖 = 1}^{𝑚} 𝑝_{𝑖}^{𝑎_{𝑖}}} . \end{aligned}

As a nice example with applications to music theory, let’s take $𝑛 = 12$ . In western music theory, it’s typical to work with 12-tone equal temperament, which is a tuning system with 12 notes per octave. Notes an octave apart are considered the same note, so we’re effectively working with a circle with 12 elements.

A scale can then be defined as any subset of these 12 notes. It corresponds to an arrangement of 12 marbles, each red or blue, into a circle: Just take a red marble for all notes that aren’t in the subset and a blue one for those that are.

Different scales related to each other by rotation of the circle are called modes of each other, and the reflection of a scale across a certain axis is called its inversion.

This means that counting the number of different scales, considering modes and/or inversions to be the same scale, can be done with our two formulae. Concretely, $12 = 2^{2} 3^{1}$ , so there are

\begin{aligned} | 𝑌 / 𝐺_{1} | & = \frac{1}{12} \sum_{𝑎 = 0}^{2} \sum_{𝑏 = 0}^{1} ((2 - ⟦ 𝑎 < 2 ⟧) 2^{2 - 𝑎 - 1}) ((3 - ⟦ 𝑏 < 1 ⟧) 3^{1 - 𝑏 - 1}) 2^{2^{𝑎} 3^{𝑏}} \\ = \frac{1}{12} ((8 + 16) + (8 + 64) + (32 + 4096)) \\ = 352 . \end{aligned}

scales when considering different modes as the same scale.

Further, when also considering inversions as the same scale, the number drops to

\begin{aligned} | 𝑌 / 𝐺_{2} | & = \frac{1}{2} (| 𝑌 / 𝐺_{1} | + 2^{⌈ \frac{12}{2} ⌉}) \\ = \frac{1}{2} (352 + 64) \\ = 208 . \end{aligned}

5.1. Final Answer

Let’s directly start with the general form of the original question: How many ways are there to arrange $𝑘$ red and $(𝑛 - 𝑘)$ blue marbles into a circle?

Again, we have $𝑋$ , the set of sequences of $𝑘$ red and $(𝑛 - 𝑘)$ blue marbles and $𝐺_{1}$ and $𝐺_{2}$ like before.

For the fixpoint set sizes, we can reuse some of the logic from the previous section:

For $| {fix}_{𝑋} (𝑟^{𝑎}) |$ , we still have $gcd (𝑎, 𝑛)$ chains of equalities, each involving $𝑛 / gcd (𝑎, 𝑛)$ sequence elements. To be able to assign the marbles to these, both $𝑘$ and $(𝑛 - 𝑘)$ need to be multiples of $𝑛 / gcd (𝑎, 𝑛)$ . This condition is actually equivalent to only $𝑘$ being a multiple of $𝑛 / gcd (𝑎, 𝑛)$ (or alternatively to only $(𝑛 - 𝑘)$ being one),⁶
If this is satisfied, we can assign red to $𝑘 / (𝑛 / gcd (𝑎, 𝑛)) = \frac{𝑘}{𝑛} gcd (𝑎, 𝑛)$ of the chains and blue to the rest. This gives a factor of $𝐶 (gcd (𝑎, 𝑛), \frac{𝑘}{𝑛} gcd (𝑎, 𝑛))$ . If the condition isn’t satisfied, it’s impossible to satisfy the equality chains created from $𝑟^{𝑘} (𝑥) = 𝑥$ , so $| {fix}_{𝑋} (𝑟^{𝑘}) | = 0$ .
Thus, we get
$| {fix}_{𝑋} (𝑟^{𝑘}) | = ⟦ 𝑛 / gcd (𝑎, 𝑛) ∣ 𝑘 ⟧ 𝐶 (gcd (𝑎, 𝑛), \frac{𝑘}{𝑛} gcd (𝑎, 𝑛)) .$
For $| {fix}_{𝑋} (𝑟^{𝑎} \circ 𝑠) |$ , the same logic about rotations applies, so we only need to look at $𝑎 = 0$ and $𝑎 = 1$ .
- For $| {fix}_{𝑋} (𝑠) |$ , the logic is actually identical to Section 2, so we have
  $| {fix}_{𝑋} (𝑠) | = ⟦ 𝑛 odd or 𝑘 even ⟧ 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋)$
- For even $𝑛$ , we need to consider $| {fix}_{𝑋} (𝑟 \circ 𝑠) |$ . There, like in the last section, we have two single slots and $\frac{𝑛}{2} - 1$ pairs to fill.
  - If $𝑘$ is even, then so is $𝑛 - 𝑘$ , so the two free slots are forced to also be the same as each other, so we end up with $\frac{𝑛}{2}$ pairs, which just gives an easy factor of
    $𝐶 (\frac{𝑛}{2}, \frac{𝑘}{2}) .$
  - If $𝑘$ is odd, then so is $𝑛 - 𝑘$ , so the two free slots are forced to be opposites of each other, so we end up with one choice of which is which, followed by the choice of putting the remaining $𝑘 - 1$ red marbles into the $\frac{𝑛}{2} - 1$ pairs. This therefore gives a factor of
    $2 𝐶 (\frac{𝑛}{2} - 1, \frac{𝑘 - 1}{2}) .$
  In total, we therefore get
  $| {fix}_{𝑋} (𝑟 \circ 𝑠) | = (1 + ⟦ 𝑘 odd ⟧) 𝐶 (\frac{𝑛}{2} - ⟦ 𝑘 odd ⟧, ⌊ \frac{𝑘}{2} ⌋) .$
These can be consolidated as follows:
$\begin{aligned} | {fix}_{𝑋} (𝑟^{𝑎} \circ 𝑠) | & = {\begin{cases} 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋), & 𝑛 odd \\ ⟦ 𝑘 even ⟧ 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋), & 𝑛 even, 𝑎 even \\ 𝐶 (\frac{𝑛}{2}, \frac{𝑘}{2}), & 𝑛 even, 𝑎 odd, 𝑘 even \\ 2 𝐶 (\frac{𝑛}{2} - 1, \frac{𝑘 - 1}{2}), & 𝑛 even, 𝑎 odd, 𝑘 odd \end{cases} \\ = {\begin{cases} 2 ⟦ 𝑎 odd ⟧ 𝐶 (\frac{𝑛}{2} - 1, \frac{𝑘 - 1}{2}), & 𝑛 even, 𝑘 odd \\ 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋), & otherwise \end{cases} . \end{aligned}$
The sum of this over all $𝑎 \in {0, \dots, 𝑛 - 1}$ is therefore
$\begin{aligned} \sum_{𝑎 = 0}^{𝑛 - 1} | {fix}_{𝑋} (𝑟^{𝑎} \circ 𝑠) | & = {\begin{cases} 𝑛 𝐶 (\frac{𝑛}{2} - 1, \frac{𝑘 - 1}{2}), & 𝑛 even, 𝑘 odd \\ 𝑛 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋), & otherwise \end{cases} \\ = 𝑛 {\begin{cases} 𝐶 (⌊ \frac{𝑛}{2} ⌋ - 1, ⌊ \frac{𝑘}{2} ⌋), & 𝑛 even, 𝑘 odd \\ 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋), & otherwise \end{cases} \\ = 𝑛 𝐶 (⌊ \frac{𝑛}{2} ⌋ - ⟦ 𝑛 even and 𝑘 odd ⟧, ⌊ \frac{𝑘}{2} ⌋) . \end{aligned}$

Thus, we get

\begin{aligned} | 𝑋 / 𝐺_{1} | & = \frac{1}{𝑛} \sum_{𝑎 = 0}^{𝑛 - 1} ⟦ 𝑛 / gcd (𝑎, 𝑛) ∣ 𝑘 ⟧ 𝐶 (gcd (𝑎, 𝑛), \frac{𝑘}{𝑛} gcd (𝑎, 𝑛)) \\ = \frac{1}{𝑛} \sum_{𝑑 ∣ 𝑛} | {𝑎 \in {0, \dots, 𝑛 - 1} | gcd (𝑎, 𝑛) = 𝑑} | ⟦ \frac{𝑛}{𝑑} ∣ 𝑘 ⟧ 𝐶 (𝑑, \frac{𝑘}{𝑛} 𝑑) \\ = \frac{1}{𝑛} \sum_{𝑑 ∣ 𝑛} ⟦ \frac{𝑛}{𝑑} ∣ 𝑘 ⟧ 𝜑 (\frac{𝑛}{𝑑}) 𝐶 (𝑑, \frac{𝑘}{𝑛} 𝑑), \\ = \frac{1}{𝑛} \sum_{𝑑 ∣ 𝑛} ⟦ 𝑑 ∣ 𝑘 ⟧ 𝜑 (𝑑) 𝐶 (\frac{𝑛}{𝑑}, \frac{𝑘}{𝑑}) \\ = \frac{1}{𝑛} \sum_{𝑑 ∣ gcd (𝑘, 𝑛)} 𝜑 (𝑑) 𝐶 (\frac{𝑛}{𝑑}, \frac{𝑘}{𝑑}), \\ | 𝑋 / 𝐺_{2} | & = \frac{1}{2} (| 𝑋 / 𝐺_{1} | + \frac{1}{𝑛} \sum_{𝑎 = 0}^{𝑛 - 1} | {fix}_{𝑋} (𝑟^{𝑎} \circ 𝑠) |) \\ = \frac{1}{2} (| 𝑋 / 𝐺_{1} | + 𝐶 (⌊ \frac{𝑛}{2} ⌋ - ⟦ 𝑛 even and 𝑘 odd ⟧, ⌊ \frac{𝑘}{2} ⌋)) . \end{aligned}

Concretely, for the introductory example ( $𝑛 = 5, 𝑘 = 3$ ), this gives

\begin{aligned} | 𝑋 / 𝐺_{1} | & = \frac{1}{5} 𝜑 (1) 𝐶 (5, 3) = \frac{10}{5} = 2, \\ | 𝑋 / 𝐺_{2} | & = \frac{1}{2} (2 + 𝐶 (2, 1)) = \frac{1}{2} (2 + 2) = 2 . \end{aligned}

Concretely, these orbits are {RRRBB, RRBBR, RBBRR, BBRRR, BRRRB} and {RRBRB, RBRBR, BRBRR, RBRRB, BRRBR}.

Finally, this problem also has an application in music theory, when considering a more conservative definition of a scale: Here, a scale is defined by an arrangement of 5 whole steps and 2 half steps into a circle. This lets us apply the formula with $𝑛 = 7$ and $𝑘 = 2$ , which gives

\begin{aligned} | 𝑋 / 𝐺_{1} | & = \frac{1}{7} 𝜑 (1) 𝐶 (7, 2) = \frac{21}{7} = 3, \\ | 𝑋 / 𝐺_{2} | & = \frac{1}{2} (3 + 𝐶 (3, 1)) = \frac{1}{2} (3 + 3) = 3 . \end{aligned}

Concretely, these orbits are

{WWWWWHH, WWWWHHW, WWWHHWW, WWHHWWW, WHHWWWW, HHWWWWW, HWWWWWH},
{WWWWHWH, WWWHWHW, WWHWHWW, WHWHWWW, HWHWWWW, WHWWWWH, HWWWWHW}, and
{WWWHWWH, WWHWWHW, WHWWHWW, HWWHWWW, WWHWWWH, WHWWWHW, HWWWHWW}.

The last of them corresponds to the well-known modes of the major/minor scale:

{Lydian, Mixolydian, Aeolian, Locrian, Ionian, Dorian, Phrygian}.

The second-to-last of them corresponds to the modes of the also somewhat well-known melodic minor scale, WHWWWWH. The mode HWHWWWW is also known as the superlocrian scale.

Alternatively, we can also use the formula to find the size of specific subsets of the set of scales (now using the previous 12-based definition again). For example, the numbers of 7-tone scales are

\begin{aligned} | 𝑋 / 𝐺_{1} | & = \frac{1}{12} 𝜑 (1) 𝐶 (12, 7) = 66, \\ | 𝑋 / 𝐺_{2} | & = \frac{1}{2} (66 + 𝐶 (5, 3)) = 38 . \end{aligned}

That’s already way more than the measly 3 we got above.

6. Conclusion

This has been a journey from a simple-sounding question to combinatorics, group theory, back to combinatorics with Burnside’s Lemma, and finally to the answer of that question.

There are lots of related questions I didn’t bring up here, such as considering more than two colors for the marbles, or asking about the size of more specific subsets, but if you followed along well, you’re probably equipped to answer those on your own now.

Most of all, I hope you had fun with this. I certainly did.

Proof: WLoG let $𝑘$ be odd, then
$\begin{aligned} 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑛 - 𝑘}{2} ⌋) & = 𝐶 (\frac{𝑛 - 1}{2}, \frac{𝑛 - 𝑘}{2}) \\ = 𝐶 (\frac{𝑛 - 1}{2}, \frac{𝑛 - 1}{2} - \frac{𝑘 - 1}{2}) \\ ⩮ 𝐶 (\frac{𝑛 - 1}{2}, \frac{𝑘 - 1}{2}) = 𝐶 (⌊ \frac{𝑛}{2} ⌋, ⌊ \frac{𝑘}{2} ⌋), \end{aligned}$
where step $*$ used the reflection formula (archived). ↩︎
I didn’t want to choose just rotating something in space as the example because that already invites the objection that that’s not an “operation”, because our brains already model object permanence in a way that respects spatial symmetries. ↩︎
This proof also includes the proofs of special cases of Lagrange’s theorem and the orbit-stabilizer theorem, since I don’t want to presuppose them and we won’t be needing them by name. ↩︎
This can also be defined as $𝑑 % 𝑛 ≔ 𝑑 - 𝑛 ⌊ \frac{𝑑}{𝑛} ⌋$ . ↩︎
$𝑛 = 1$ and $𝑛 = 0$ are totally useless and we can just ignore them. ↩︎
The common divisors of $𝑘$ and $𝑛 - 𝑘$ are the same as those of $𝑘$ and $𝑛$ , so $𝑛 / gcd (𝑎, 𝑛)$ divides both $𝑘$ and $𝑛 - 𝑘$ if and only if it divides both $𝑘$ and $𝑛$ . But it always divides $𝑛$ since $𝑛 = gcd (𝑎, 𝑛) \cdot (𝑛 / gcd (𝑎, 𝑛))$ , so it only has to divide $𝑘$ . ↩︎