Blog: data profiling

Showing blogs: 12 of 2

The Necessity of Data Profiling

Sep 21 2009

Data profiling is the process of analyzing data sets to gather information on data types, statistics, patterns, relationships and other attributes.  This process is a critical task often scheduled just prior to the start of a design phase for a new database initiative.  Typically, data profiling is seen primarily as an input for data modeling and ETL design to identify data types and entity relationships.  The value of thorough data profiling actually extends far beyond these tasks and may help steer the direction of the project and ensure its success.

Below are some of the ways data profiling can add value during a new database initiative: 

Continue Reading

Data Quality - Three Essential Links

Jun 24 2009

  1. Magic Quadrant for Data Quality Tools (Gartner) -- A good starting point if you want to become familiar with the main vendors in the data quality space.
  2. Surprising Value of Data Profiling (Kimball - PDF) -- Ralph Kimball provides a concise explanation about the value of data profiling, one of the essential components of a data quality initiative.
  3. 7 Sources of Poor Data Quality (Melissa DATA) -- William Mcnight identifies seven sources of data quality issues:
    • Entry quality
    • Process quality
    • Identification quality
    • Integration quality
    • Usage quality
    • Aging quality
    • Organizational quality

Continue Reading