datasets-0.2.1: Classical data sets for statistics and machine learning
Classical machine learning and statistics datasets from the UCI Machine Learning Repository and other sources.
The datasets package defines two different kinds of datasets:
- small data sets which are directly (or indirectly with `file-embed`) embedded in the package as pure values and do not require network or IO to download the data set. This includes Iris, Anscombe and OldFaithful.
- other data sets which need to be fetched over the network with
getDataset
and are cached in a local temporary directory.
import Numeric.Datasets (getDataset) import Numeric.Datasets.Iris (iris) import Numeric.Datasets.Abalone (abalone) main = do -- The Iris data set is embedded print (length iris) print (head iris) -- The Abalone dataset is fetched abas <- getDataset abalone print (length abas) print (head abas)
Modules
- Numeric
- Numeric.Datasets
- Numeric.Datasets.Abalone
- Numeric.Datasets.Adult
- Numeric.Datasets.Anscombe
- Numeric.Datasets.BostonHousing
- Numeric.Datasets.BreastCancerWisconsin
- Numeric.Datasets.CO2
- Numeric.Datasets.Car
- Numeric.Datasets.Iris
- Numeric.Datasets.Michelson
- Numeric.Datasets.Nightingale
- Numeric.Datasets.OldFaithful
- Numeric.Datasets.Quakes
- Numeric.Datasets.States
- Numeric.Datasets.Sunspots
- Numeric.Datasets.UN
- Numeric.Datasets.Vocabulary
- Numeric.Datasets.Wine
- Numeric.Datasets.WineQuality
- Numeric.Datasets