数据质量常见问题

Data Quality Issue Sample Data Problem
Standard Are data elements consistently defined and understood? Gender code =M,F,U in one system and Gender code = 0, 1, 2 in another system
Complete Is all necessary data present? 20% of customers’ last name is blank, 50% of zip-codes are 99999
Accurate Does the data accurately represent reality or a verifiable source? A supplier is listed as ‘Active’ but went out of business six years ago
Valid Do data values fall within acceptable ranges Salary values should be between 60,000 – 12,000
Unique Data appears several times Both John Ryan and Jack Ryan appear in the system – are they the same person?

 

DQS…

posted @ 2011-11-23 15:51  AOT  阅读(256)  评论(0编辑  收藏  举报