The following is some information to consider before loading a
table in a multi-partition database:
- Familiarize yourself with the load configuration options by using
the utility with small amounts of data.
- If the input data is already sorted, or in some chosen order,
and you want to maintain that order during the loading process, only
one database partition should be used for distributing. Parallel distribution
cannot guarantee that the data is loaded in the same order it was
received. The load utility chooses a single partitioning agent by
default if the anyorder modifier is not specified
on the LOAD command.
- If large objects (LOBs) are being loaded from separate files (that
is, if you are using the lobsinfile modifier through
the load utility), all directories containing the LOB files must be
read-accessible to all the database partitions where loading is taking
place. The LOAD lob-path parameter must
be fully qualified when working with LOBs.
- You can force a job running in a multi-partition database to continue
even if the load operation detects (at startup time) that some loading
database partitions or associated table spaces or tables are offline,
by setting the ISOLATE_PART_ERRS option to SETUP_ERRS_ONLY
or SETUP_AND_LOAD_ERRS.
- Use the STATUS_INTERVAL load configuration option
to monitor the progress of a job running in a multi-partition database.
The load operation produces messages at specified intervals indicating
how many megabytes of data have been read by the pre-partitioning
agent. These messages are dumped to the pre-partitioning agent message
file. To view the contents of this file during the load operation,
connect to the coordinator partition and issue a LOAD QUERY command
against the target table.
- Better performance can be expected if the database partitions
participating in the distribution process (as defined by the PARTITIONING_DBPARTNUMS option)
are different from the loading database partitions (as defined by
the OUTPUT_DBPARTNUMS option), since there is less
contention for CPU cycles. When loading data into a multi-partition
database, invoke the load utility on a database partition that is
not participating in either the distributing or the loading operation.
- Specifying the MESSAGES parameter in the LOAD command
saves the messages files from the pre-partitioning, partitioning,
and load agents for reference at the end of the load operation. To
view the contents of these files during a load operation, connect
to the desired database partition and issue a LOAD QUERY command
against the target table.
- The load utility chooses only one output database partition on
which to collect statistics. The RUN_STAT_DBPARTNUM database
configuration option can be used to specify the database partition.
- Before loading data in a multi-partition database, run the Design
Advisor to determine the best partition for each table. For more
information, see The Design Advisor.
Troubleshooting
If the load utility is hanging, you can: