Open source repositories tagged with #training-data, ranked by health score.
New and extensible file format for storage of large columnar datasets.