PM95199: SOME REDUCE TASKS OF A PLATFORM SYMPHONY MAPREDUCE JOB COULD BE STUCK DUE TO CONNECTION TIMEOUT

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

APAR status

  • Closed as program error.

Error description

  • When running big scale Platform Symphony MapReduce jobs, you can
    observe some reduce tasks hanging. The reason is connection
    timeout failures could break the communication between Scheduler
     and reducer tasks, and thus a reducer task just waits forever
    in fetching stage because it can not receive any notification of
     other map tasks finish.
    

Local fix

  • Find the job with hanging reducer tasks, suspend the job and
    then resume the job using PSMR commands:
    
    soamcontrol session suspend MapReduce6.1:<jobid>
    soamcontrol session resume MapReduce6.1:<jobid>
    
    All hanging tasks would get retry.
    

Problem summary

  • See Error Description
    

Problem conclusion

  • This problem is fixed in Version 2.1.0.1 and later fix packs.
    

Temporary fix

Comments

APAR Information

  • APAR number

    PM95199

  • Reported component name

    INFO BIGINSIGHT

  • Reported component ID

    5725C0900

  • Reported release

    210

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2013-08-15

  • Closed date

    2013-09-11

  • Last modified date

    2013-09-11

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    INFO BIGINSIGHT

  • Fixed component ID

    5725C0900

Applicable component levels

  • R210 PSY

       UP



Rate this page:

(0 users)Average rating

Document information


More support for:

InfoSphere BigInsights

Software version:

210

Reference #:

PM95199

Modified date:

2013-09-11

Translate my page

Machine Translation

Content navigation