Collections of Events

Most questions in data science involve multiple variables and events. Random variables and their joint distributions give us a way to set up probabilistic models for how our data originate. Some techniques are particularly useful for working with large collections of variables and events. These include:

  • Using bounds when exact values are difficult to calculate
  • Noticing patterns when working with small collections and then generalizing to larger ones
  • Using symmetry, both for insight and for simplifying calculation

In this chapter we will study powerful examples of all these techniques.