IBM Support

IT22914: XMLPARSE FAILS WITH LARGE XML FILES.

 

APAR status

  • Closed as fixed if next.

Error description

  • When processing large XML files XMLParse fails with the
    following error:
    ERROR #splapptrc,J[4],P[4],PM1015.XMLParser1015,spl_operator
    M[XMLParser1015.cpp:error:110]  - CDISR5025E: The following
    error occurred during XML parsing: internal error: Huge input
    lookup.
    

Local fix

  • The workaround is to use a smaller blockSize.  Change it to
    9000u*1024u to complete large XML parsing successfully.
    For example:
    stream<blob dataBlob,rstring fName> FileLoadedFromFS =
    FileSource(DirFileScanned){
        param format : block;
              blockSize : 10000u*1024u;
              compression : gzip;
              parsing : fast;
              output FileLoadedFromFS:
              fName = FileName();
            }
    

Problem summary

  • Fixed in Streams version 4.2.1.3.  We added a new XMLParse
    parameter called xmlParseHuge.
    XmlParseHuge is an optional boolean parameter for the XMLParse
    operator.  The default is false.  To enable XML_PARSE_HUGE, an
    operator developer adds the parameter to the operator
    invocation, and sets it to true.
    For example:
    stream <TransactiveSignal_type> B = XMLParse(Lines) {
     param trigger: "/tsns:TransactiveSignal";
           ignoreNamespaces : true;
           xmlParseHuge: true;
        }
    

Problem conclusion

Temporary fix

Comments

APAR Information

  • APAR number

    IT22914

  • Reported component name

    STREAMS

  • Reported component ID

    5724Y95IS

  • Reported release

    420

  • Status

    CLOSED FIN

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2017-10-24

  • Closed date

    2017-10-25

  • Last modified date

    2017-10-25

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Modules/Macros

  • XMLParse
    

Fix information

Applicable component levels

  • R421 PSY

       UP

Document information

More support for: IBM Streams

Software version: 420

Reference #: IT22914

Modified date: 25 October 2017