Realizing the potential of clean mastered data for businesses

Michael Stonebraker

How to Avoid the 10 Big Data Analytics Blunders

Leading organizations are leveraging an analytics-driven approach—fueled and informed by data—to achieve marketplace advantages and create entirely new business models. However, even the savviest companies are repeating common missteps. I recently gave a presentation on this very topic at the…

Aug 27, 2020 Featured Content

Data Mastering at Scale

Data mastering (sometimes called Master Data Management or MDM for short) is now 15 years old. It arose because enterprises have been creating independent business units (IBUs) for a long time with substantial freedom of action. This allows IBUs to…

Sep 23, 2019 Featured Content

Every decade, federated DBMSs reappear. This time around they may play a role in data integration.

Every decade there are three or four new systems who federate data in disparate systems. This has been a regular occurrence for the last forty or so years. In fact, I am responsible for proposing one in the 1970’s (Distributed…

Sep 10, 2019 Featured Content

Data Stewardship in the Age of Machine Learning

Suppose you are a data steward, responsible for integrating a collection of data sources, S1, …, Sn.  Historically, you would perform the following steps: Have your best programmer define a global schema GS,  which the various sources will accommodate. Have…

Aug 21, 2019 Featured Content

Big Data, Disruption, and the 800-Pound Gorilla in the Corner

As Big Data continues to evolve, we see companies working to address the amount of data that’s coming in, the rate at which it’s coming in, and the variation in data itself. There are a number of companies out there…

Feb 12, 2019 Featured Content

Three Generations of AI: From Data Warehouses to Machine Learning

For years, data unification has been a roadblock to effective data analytics. Data scientists report spending 80% of their time locating data, unifying it from multiple sources, and cleaning it before they can begin their analytics work in earnest. As…

Jan 29, 2019 Featured Content

Scalable Data Integration: Five Tenets for Success

By Michael Stonebraker, Tamr Co-Founder and CTO Introduction Data curation involves: ingesting data sources, cleaning errors from the data (-99 often means null), transforming attributes into other ones (for example, Euros to dollars), performing schema integration to connect up disparate…

Apr 7, 2015 Featured Content

Three Generations of Data Integration Systems

(In this entry, I explain the three generations of data integration products and note what appears to have caused the transitions between the product families.) In the 1990s, data warehouses arrived on the scene. Led by the major retailers, customer-facing…

May 18, 2014 Insights