Master Data Management (MDM) as a practice has been around for at least a decade but still there is plenty of confusion about what it means and why it is important. It is important to get grounded on what is Master Data and what is Master Data Management (MDM) before we dig into why it […]
Continue readingIn this series on Data Strategy, we covered 5 ‘W’s of Data Strategy and ‘How To Deliver Data Strategy In 4 Steps’. In this article, we’ll cover the various components of Data Strategy. I believe that Data Strategy document should include all or at least some of these components: Background / Context: This section should […]
Continue readingMuch has been written about Data Quality (DQ) in the broader context of Data / Information Management and most of the practitioners can recite it’s dimensions (accuracy, completeness, timeliness, uniqueness, consistency, timeliness etc.), DQ assessment / profiling, and step-by-step approach to enhancing DQ. Unlike, Data Governance though, there hasn’t been much about Data Quality Framework […]
Continue readingAs we hear more and more about Data Governance, the rank and file begin to ask the fundamental question of ‘What is data Governance?” It is probably better to start with Governance. As per the Oxford Dictionary, Governance is the act of controlling, influencing, or regulating a person, action, or course of events. Applying this […]
Continue readingData mining examples can be found in many industries and across many functions. Some data mining examples are listed below in: Telecommunications, Retail, Credit card companies, E-commerce, Human resources, educational institutions, crime agencies. Telecom service Providers Phone and telecom providers attempt to predict ‘churn’ i.e. customers switching their providers by mining through web interactions, customer […]
Continue readingWe all heard of many horrors of poor data quality. Companies with millions of records with “(000)000-0000” as customer contact numbers, “99/99/99” as date of purchase, 12 different gender values, shipping addresses with no state information etc. The cost of ‘dirty data’ to enterprise and organizations is real. For example, US Postal Service estimated that […]
Continue readingJill Dyche, currently Vice President of Best Practices at SAS Institute, is fairly well known in the data science arena. Prior to SAS Institute, Jill founded Baseline Consulting which was acquired by SAS Institute few years ago. I came across Jill personally when she responded to one of my published articles even though I know […]
Continue readingI recently contributed to a book ‘Data Management and Governance Services: Simple and Effective Approaches’ by Tejasvi Addagadda. Here is my review of the book. When Tejasvi first approached me to contribute to his book ‘Data Management and Governance Services’, what intrigued me was the scope of the book. With so much buzz about big […]
Continue readingHer twitter handle gives it away. She is a self-proclaimed “data nerd”. Typically, the word ‘she’ and ‘data nerd’ don’t go hand in hand. But Carla Gentry is not your typical woman. Carla, currently a data scientist at Talent Analytics Corporation, was a high school drop out. Married at 17, had her first kid at […]
Continue readingWell, the world is done watching 2016 US Presidential elections and many are going through withdrawal symptoms. Without passing a judgment on the process or outcome of the elections, I’d like to use the learnings from the US presidential elections to relate them to Data Governance. We all know that politics is everywhere, in our […]
Continue reading