ksheelak's blog

Data Profiling and Data Cleansing

The approach used for data migration and traceability to source systems

including use of data profililing and cleansing as well as ETL tools.

Data profiling generally involves four tasks

performed before starting an IT project that affects preexisting databases. The first task inventories data assets (tables

and their column attributes). The second task assesses the quality and complexity of inventoried data assets. The third task

is to cleanse the data. The fourth task is staging the data for extraction into the Data warehouse system.