Note that this requires not a multiple of 4 but of 8 participants to be perfectly counterbalanced. Also when randomizig it it not considered that 🟠 1 = 🟠 1 they are treated as two separate conditions in the Latin Square, which can yield some patterns reoccuring more often then others. However, it is hard to tell what a better solution would be.