Feature request: Example datasets
Things to include:
- Datasets from sklearn
- Datasets from tensorflow/keras; list of datasets
- Datasets sampled "on-the-fly", for example the Gaussian mixture dataset from this paper.
- Additional stuff*?
*Related: Google has a released a dataset search engine, so feel free to make suggestions!
Might be very useful to validate surrogates, preprocessors etc.