Link Partitioner Stages

These topics describe how to use a Link Partitioner stage in your job design.

The Link Partitioner stage is an active stage which takes one input and allows you to distribute partitioned rows to up to 64 output links. The stage expects the output links to use the same metadata as the input link.

Partitioning your data enables you to take advantage of a multiprocessor system and have the data processed in parallel. It can be used with the Link Collector stage to partition data, process it in parallel, then collect it together again before writing it to a single target. To really understand the benefits, see IBM InfoSphere DataStage Jobs and Processes to learn how IBM® InfoSphere® DataStage® jobs are run as processes.

The following diagram illustrates how the Link Partitioner stage can be used in a job in this way.

Shows a job with Link Partitioner stages

In order for this job to compile and run as intended on a multiprocessor system you must have interprocess buffering turned on, either at project level using the Administrator client, or at the job level from the Job Properties dialog box.

The temporary files generated by this stage are placed in the directory specified by the TEMP environment variable. Use the Administrator client to set TEMP on a per-project basis.