Tamr Insights
Tamr Insights
The Leader in Data Products
October 10, 2019

3 Things CDOs Need to Know about Data Mastering at Scale

3 Things CDOs Need to Know about Data Mastering at Scale

As a Chief Data Officer (CDO), you can recognize that while traditional Master Data Management (MDM) techniques might have worked well for the type and scale of data mastering challenges that existed 15 years ago when the solution came about, today’s data challenges require a different approach.  

Increasingly, utilizing these traditional approaches on the vast amounts and variety of data that enterprises are accumulating is slow, labor intensive, and extremely costly. As you work to help your enterprise fully leverage its data as an asset, here are three things to keep in mind about data mastering at scale.

1. Traditional approaches create traditional results

The fact is, traditional approaches to data mastering produce traditional results—which is to say, limited. The velocity and variety of data has outgrown the old approaches we used, limiting a corporation’s ability to quickly and cost-effectively analyze data. Let’s explore this.

Exploring the limitations of traditional MDM
Master Data Management defines and manages an enterprise’s critical data to provide a single point of reference. This single truth allows you to accurately answer basic questions about any metric or KPI. This has been invaluable for companies seeking a single source of truth about an entity that other sources can reference further down the stream. There’s a lineage to the records and flexibility in how the records are created. Absent of errors, there’s no duplicates or unmatched data—creating accurate views of each entity.

The problem is that MDM requires a human-intensive process to deliver rules-based truths. This is a complex way of saying that it’s not scalable, and moreover, its dependent on constant manual reviews of exceptions. This means that there will be a large portion (and ever growing portion) of data that remains unmastered, and the only solution will be for enterprises to pay a premium in resources.

Exploring the limitations of ETL

Extract, Transform & Load (ETL) creates a global schema up front. Certainly, it’s effective at moving data and performing masterings with simple rules, but it can’t be scaled because it takes too much time. Enterprises with a huge amount of data simply can’t consider ETL to be a viable option. Additionally, it isn’t designed to create a single point of truth, so it’s missing that consistency across consumption points.

These traditional approaches can be perfect for small problems without real time requirements. But to solve for large numbers of complex records with real time requirements that enterprises like yours experience, you have to evolve a data mastering approach at scale. As a new CDO, it’s crucial that you guide your company in rethinking data integration and mastering to tilt the odds of success more heavily in your favor. Ultimately, you need to implement an agile data mastering approach that utilizes machine learning aka MDM 2.0.

2. The Future of Data Management is Agile

The software development industry has been employing agile approaches for years, and you can use those same practices in data management. Agile Data Mastering (ADM) connects people, processes and tools together to treat data unification as an iterative process. Simply, ADM brings humans and machines together—experts train and validate machine learning models, so the tool’s accuracy improves. This means the smarter the model becomes, the less human interaction is required. This collaboration can drive key benefits for enterprises and change the way businesses use and manage their data. Here are just a handful of the benefits of ADM:

  1. It’s scalable: Human expertise combined with machine learning allows companies to integrate datasets from a variety of sources, allowing scale without sacrificing accuracy.
  2. It’s faster: ADM tools can deliver results in days that traditional methods might have needed months or even years to get.
  3. It allows for team optimization: When experts aren’t spending endless days on data prep, they can focus on more specialized efforts.
  4. It opens new opportunities: With cost-to-know reduced, projects shelved for “high costs and risk” can finally get the attention they deserve.
  5. It promotes flexibility: ADM allows teams to respond to the unexpected effectively.

Companies like Tamr utilize Agile Data Mastering to drive analytic outcomes and solve data problems every day. This human-guided, machine learning modern data management model is crucial for enterprises because it unifies data up to 10 times faster than traditional methods at up to 90% lower cost.

3. Data Mastering at Scale Requires Humans in the Loop

You know what agile data mastering can do to advance traditional MDM. Now, you need to know how to do it. There are a few essentials required:

  1. Machine learning: Traditional MDM required manual analysis of systems, coding of data, and definition of rules. Modern data mastering utilizes machine learning models that do these things in a fraction of both the time and the cost.
  2. Expert input: Humans will train the machine learning models in order to ensure accuracy. Instead of juggling thousands of rules, now, human experts can simply define a handful of rules that turn the machine learning models into experts.
  3. Transparency: Visibility is key in machine learning management. This helps the models ensure accuracy and compliance with regulatory audits.
  4. Low latency matching: Data ops teams will integrate MDM systems back into ops systems as a system of record. These operational systems will then need sub-second access to the MDM in order to suggest mastered data to users or update master data at the source. This ensures source data remains better synchronized across the entire enterprise.
  5. Continuous innovation: Future data mastering opportunities will come to market as machine learning models, including data error detection and correction and model management. By remaining open to these options, enterprises can ensure they are on the forefront of data mastering and don’t fall behind the curve.  

These requirements will help ensure you can achieve the full promise of modern data mastering at the scale your organization needs.

Start Building Your Model for Data Mastering at Scale

As a new CDO, your success is based on your ability to hit the ground running. That’s why we’ve developed a guide to help you get started. Learn more by downloading our new ebook, The CDO’s Guide to Data Mastering at Scale, below.