Dataset contains the qri ("query") dataset document definition. This package contains the base definition, as well as a number of subpackages that build from this base to add functionality as necessary Datasets take inspiration from HTML documents, deliniating semantic purpose to predefined tags of the document, but instead of orienting around presentational markup, dataset documents emphasize interoperability and composition. The principle encoding format for a dataset document is JSON.
- compression: defines supported types of compression for interpreting a dataset
- detect: dataset structure & schema inference
- dsfs: "datasets on a content-addressed file system" tools to work with datasets stored with the cafs interface:
github.com/qri-io/qfs/cafs
- dsgraph: expressing relationships between and within datasets as graphs
- dsio:
io
primitives for working with dataset bodies as readers, writers, buffers, oriented around row-like "entries". - dstest: utility functions for working with tests that need datasets
- dsutil: utility functions that avoid dataset bloat
- generate: io primitives for generating data
- use_generate: small package that uses generate to create test data
- validate: dataset validation & checking functions
- vals: data type mappings & definitions
We would love involvement from more people! If you notice any errors or would like to submit changes, please see our Contributing Guidelines.