Who cares about data provenance?

Who cares about data provenance?

Without context, there’s no meaning

To better understand a person, it’s often helpful to understand their context. What language do they speak? What city are they from? How old are they? Which schools did they attend and what did they study?

Data is no different. If you don’t know where your data came from, who created it, when it was made, and which dataset it belongs to, it can be premature or even misguided to draw meaningful conclusions.

However, tracking and communicating this source information can be very difficult, particularly in typical environments where data producers and consumers have unaligned incentives, as is often the case.

A few financial examples

One area we’re focused on given our prior experience is financial data. It’s very easy to make a profitable financial strategy with the benefit of hindsight. Thus, knowing when financial data was created and who created it is almost as important as the observations themselves.

If you’re constructing a financial index, tracking a portfolio, making financial predictions, or generating data that may be useful for predictive analysis, recording the provenance of this information will likely make your data more trustworthy and valuable.

Introducing vBase

vBase is a cheap, scalable, and robust means of assuring the provenance of data records and digital objects, and of communicating it to others.

If you work with time-sensitive financial data or know people who do, please get in touch with us at hello@vbase.com or just start using our app or simple SDKs. We’d love to hear from you.

The views expressed in this post are those of validityBase and do not constitute professional advice. Any information provided here is for informational purposes only.

This article may be republished under the following conditions: attribution must be given to validityBase with a link to the original post.

This work is licensed under a Creative Commons Attribution 4.0 International License.

vBase Blog

Recent Posts

Whose data is worth more? Whose data is worth more?

Data with validated provenance is worth more. A lot more in some cases

Dan Averbukh
Stop Using Brokerage Statements to Show Your Track Record Stop Using Brokerage Statements to Show Your Track Record

Many traders use brokerage statements to create a verifiable track record. Unfortunately they are not credible for this purpose. A better alternative exists.

Dan Averbukh
4 reasons people don’t trust your backtest 4 reasons people don’t trust your backtest

Some analysts spend months building backtests that no-one is willing to trust. Learn why, and what to do about it.

Dan Averbukh