CIO’s: No Greater Than the Sum of Their Parts

When I think of the computer hobbyist, I think of Steve Wozniak.   The computer hobbyist is someone who gets joy from the craft of putting together pieces. They find pleasure in spending hour after hour, long into the night,…

Read More


Three Enablers For Machine Learning In Data Unification: Trust, Legacy, And Scale

Note: This article was originally posted on the O’Reilly website. Data unification is the process of combining multiple, diverse data sets and preparing them for analysis by matching, deduplicating, and otherwise cleaning the records (Figure 1). This effort consumes more…

Read More


AWS Glue, ETL, and the Persistent Challenge of Data Variety

Yesterday Amazon announced the public availability of AWS Glue which they describe as a fully managed ETL service that aims to streamline the challenges of data preparation. The service was previewed back in December 2016 at Amazon’s re:Invent conference, so…

Read More


Death by Data Variety: Two Decades Working On The Cure

Data variety has been the Achilles’ heel of enterprise BI and analytics projects since long before the three V model of Big Data brought it into the spotlight (along with its siblings ‘volume’ and ‘velocity’). Over the past 20 years,…

Read More