Encode Stage

The Encode stage is a processing stage. It encodes a data set using a UNIX encoding command, such as gzip, that you supply.

The Encode stage is a processing stage. It encodes a data set using a UNIX encoding command, such as gzip, that you supply. The stage converts a data set from a sequence of records into a stream of raw binary data. The companion Decode stage reconverts the data stream to a data set (see c_deeref_Decode_Stage.html).

An encoded data set is similar to an ordinary one, and can be written to a data set stage. You cannot use an encoded data set as an input to stages that performs column-based processing or re-orders rows, but you can input it to stages such as Copy. You can view information about the data set in the data set viewer, but not the data itself. You cannot repartition an encoded data set, and you will be warned at runtime if your job attempts to do that.

As the output is always a single stream, you do not have to define meta data for the output link
Shows an Encode stage taking data, encoding it, and writing it to a data set
.

The stage editor has three pages:

  • Stage Page. This is always present and is used to specify general information about the stage.
  • Input Page. This is where you specify the details about the single input set from which you are selecting records.
  • Output Page. This is where you specify details about the processed data being output from the stage.