Master data extract sample

Use the InfoSphere® MDM master data extract sample in InfoSphere DataStage® to extract data from an MDM database to external files. The extracted information can then be used with reporting and analytical systems. Often this process is used to view incremental data changes that occurred since the last extract. You can download the sample from the IBM® Samples and Assets site for InfoSphere MDM.

The master data extract process is an outbound set of actions that run member get (MemGet) interactions on the source data to retrieve either a golden view or member-level view of members. The basic query of a master data extract gets a list of member record numbers (memrecnos) and entity record numbers (entrecnos). The InfoSphere DataStage MDM Connector stage uses the MDM MemGet interaction to retrieve data.

The extract process can do incremental updates that are based on the last time that the master data extract job ran. The process uses audit record numbers (audrecnos) and record modified time stamps (recmTime) for the incremental updates. An extract can be configured to remove duplicate entities and write the output to a target file or database.

The sample uses the default party model, where the entity type is mdmper, member type is Person, and the filtering attributes are MEMHEAD, PERLEGALNAME, and PERSSN. If your implementation uses a different model, you must update these items in the sample job to match your model.

Note: Before you can use the master data extract sample, you must have your virtual MDM project metadata exported to InfoSphere Information Server. See Exporting virtual MDM metadata for use in InfoSphere DataStage.


Last updated: 23 October 2014