Big Data

What Is Big Data?

When I first heard the term Big Data few years ago, I didn’t think much of it. Soon after, Big Data started appearing in many of my conversations with many of my tech friends. So when I met this Mr. Know It All consultant, I asked him ‘What is Big Data?’. He looked at me […]

Continue reading
Big Data Name

Who Came Up With The Name Big Data?

Big Data has truly come of age in 2013 when Oxford English Dictionary introduced the term “Big Data” for the first time in its dictionary. That of course begs the question ‘When was the term Big Data first used and Why?’. My curiosity led me to lot of research material but I relied mostly on […]

Continue reading
Master Data Management

Master Data Management (MDM) Key Definitions

Master Data Management (MDM) as a practice has been around for at least a decade but still there is plenty of confusion about what it means and why it is important.  It is important to get grounded on what is Master Data and what is Master Data Management (MDM) before we dig into why it […]

Continue reading
Data Strategy components

Data Strategy Components

In this series on Data Strategy, we covered 5 ‘W’s of Data Strategy and ‘How To Deliver Data Strategy In 4 Steps’. In this article, we’ll cover the various components of Data Strategy. I believe that Data Strategy document should include all or at least some of these components: Background / Context: This section should […]

Continue reading
Data Quality Framework

Data Quality Framework

Much has been written about Data Quality (DQ) in the broader context of Data / Information Management and most of the practitioners can recite it’s dimensions (accuracy, completeness, timeliness, uniqueness, consistency, timeliness etc.), DQ assessment / profiling, and step-by-step approach to enhancing DQ. Unlike, Data Governance though, there hasn’t been much about Data Quality Framework […]

Continue reading
data governance definition

What is Data Governance?

As we hear more and more about Data Governance, the rank and file begin to ask the fundamental question of ‘What is data Governance?” It is probably better to start with Governance. As per the Oxford Dictionary, Governance is the act of controlling, influencing, or regulating a person, action, or course of events. Applying this […]

Continue reading
Data mining examples

Data Mining Examples

Data mining examples can be found in many industries and across many functions. Some data mining examples are listed below in: Telecommunications, Retail, Credit card companies, E-commerce, Human resources, educational institutions, crime agencies. Telecom service Providers Phone and telecom providers attempt to predict ‘churn’ i.e. customers switching their providers by mining through web interactions, customer […]

Continue reading
Data quality

Data Quality – Simple 6 Step Process

We all heard of many horrors of poor data quality. Companies with millions of records with “(000)000-0000” as customer contact numbers, “99/99/99” as date of purchase, 12 different gender values, shipping addresses with no state information etc. The cost of ‘dirty data’ to enterprise and organizations is real. For example, US Postal Service estimated that […]

Continue reading

Jill Dyche – Data Science Influencer

Jill Dyche, currently Vice President of Best Practices at SAS Institute, is fairly well known in the data science arena. Prior to SAS Institute, Jill founded Baseline Consulting which was acquired by SAS Institute few years ago. I came across Jill personally when she responded to one of my published articles even though I know […]

Continue reading

Book Review: Data Management and Governance Services

I recently contributed to a book ‘Data Management and Governance Services: Simple and Effective Approaches’ by Tejasvi Addagadda. Here is my review of the book. When Tejasvi first approached me to contribute to his book ‘Data Management and Governance Services’, what intrigued me was the scope of the book. With so much buzz about big […]

Continue reading