5. Collections of Events#
Most questions in data science involve multiple variables and events. Random variables and their joint distributions give us a way to set up probabilistic models for how our data originate. Some techniques are particularly useful for working with large collections of variables and events. These include:
Using bounds when exact values are difficult to calculate
Noticing patterns when working with small collections and then generalizing to larger ones
Using symmetry, both for insight and for simplifying calculation
In this chapter we will study powerful examples of all these techniques.