Sean C. Crosby

Sampling for Maximum Dissimilarity

Creating a subsample of data that covers your N-dimensional space

Sampling Sampling data can be done in many ways depending on what is desired. Most often a random uniform sampling is used to collect a small subset for preliminary analysis. This can reduce computation and provide rapid insight. There are, however, other reasons to sample data, such as create a representative sample set that covers the data range. A particular example arises in earth sciences when the goal is to model weather or ocean conditions across the range of possible forcing.