Member-only story
Data Profiling
Part 9b of Data Quality by Dr. Alvin Ang
2 min readJan 13, 2025
This is part of a series on:
Data Profiling = Studying the Profile of your Data and Inspecting for Issues.
= it is information ABOUT the data… not information FROM the data….
e.g.
Information ABOUT the data:
- 30% of my 3rd column of my dataset are BLANK (NULL)
- the structure of my dataset is (33, 22) = 33 Rows and 22 Columns
- the Data Type of my “Age” column is an object (String)… it should be a an “Integer”..
versus
Information FROM the data:
- Our Profits Increased by 800% last year!!!
- Our Best Selling Product was our Pineapple Tarts
- Three New Employees joined us and was added to our Payroll System
Data Profiling is ALSO Examining your Data for Issues (or Data Dimensions)…