While kamu is a very powerful tool for managing and processing data on your own computer, the real power of it becomes apparent only when you start exchanging data with other people. Thanks to its core properties it makes sharing data reliable and safe both within your organization and between multiple completely independent parties.

    Repositories

    Repository Types Data in kamu is shared via repositories. There are multiple types of repositories that differ by the kinds of services they provide. The most basic repository allows you to simply upload (“push”) and download (“pull”) data. Type Description Capabilities URL Examples Local FS A basic repository that uses the local file system folder. Mainly used for examples and testing. pullpush file:///home/me/example/repositoryfile:///c:/Users/me/example/repository S3 A basic repository that stores data in Amazon S3 bucket.

    Validity of Data

    This topic is also covered in detail in this video: With kamu sharing data becomes very easy, but with that problem out of the way you will soon start wondering “How can I trust the data I downloaded from someon else?”. Let’s first define what validity or trustworthiness of data means. Validity of Root Data Let’s say you’re about to use a root dataset containing historical weather information in your city.

    Provenance