The predefined data quality rule definitions are available in the Published Rules folder of the Data Quality workspace.
If you upgrade to Version 9.1, you can use the predefined data quality rule definitions, but must first import them from the \IBM\InformationServer\Clients\Samples\Information Analyzer directory on the client tier computer. If you are using Version 8.7, you can use the predefined data quality rule definitions, but must first import them from IBM developerWorks. For more information, see Using pre-built rule definitions with IBM® InfoSphere® Information Analyzer.
When viewed from the perspective of designing data rules, the predefined data rule definitions can serve several purposes: as educational examples, as accelerators to assess your data quality, as templates, or as models for development.
To reduce the effort in identifying data quality issues in many common information domains and conditions. Some common information domains are keys, national identifiers, dates, country codes, and email addresses. Some common conditions are completeness checks, valid values, range checks, aggregated totals, and equations.
You can immediately use the predefined data rule definitions as they are to test or assess your data sources and generate data rules, which allows you to accelerate your ability to start detailed data quality assessment.
Data rule definitions can be deployed at different points in the process of quality validation and monitoring. These points include: direct analysis of data quality, use in InfoSphere DataStage® and QualityStage®, or use in other IBM InfoSphere Information Analyzer projects.
For example, if you work in a development environment with test data to ensure your data rules work correctly, you might then need to export those data rules to a production environment for ongoing quality monitoring.
You receive a file every day from an external source. The quality of the data source is often low, which results in problems in other information systems, such as your business reporting system. This daily file currently runs through an InfoSphere QualityStage job to standardize the file and load the output to existing data sources. You want to test the incoming data for completeness by using a set of data rule definitions, and validate the results of the standardized output.