You use InfoSphere® DataStage® to develop jobs, which process and transform your data. You can administer, manage, deploy, and reuse these jobs to integrate data across many systems throughout your organization.
Your organization can use InfoSphere DataStage to complete the following tasks:
By using parallel processing capabilities of multiprocessor hardware platforms, you can scale transformation jobs to address any demands, large or small. During development, the deployment configuration automatically adds the degree of parallelism that you specify. By making a simple change to the configuration file, you can change your application from 2-way processing to 32-way processing to 128-way processing.
Data integration specialists use the rich user interface for all design work, including workflow, data integration, and data quality. Prebuilt transformation functions can dragged to a design, making it easy to determine the flow of information and the transformations that occur. Any portion of the design can be shared and reused across the data integration landscape, maximizing reuse and productivity.
A nearly unlimited number of heterogeneous data sources and targets are supported, including text files, complex data structures in XML, enterprise resource planning (ERP) systems such as SAP and PeopleSoft, nearly any database, web services, and business intelligence (BI) tools like SAS.
Real-time data integration support captures messages from Message Oriented Middleware (MOM) queues using JMS or WebSphere® MQ adapters to combine data into operational and historical analysis perspectives. By using InfoSphere DataStage with InfoSphere Information Services Director, data integration jobs can be deployed with Java™ Message Services, web services, or other services. This service-oriented architecture (SOA) enables numerous developers to share complex data integration processes without having to understand the steps contained in the services.
You can use the InfoSphere DataStage Operations Console to access information about your jobs, job activity, and system resources for each of your InfoSphere Information Server engines. The Operations Console is useful for troubleshooting failed job runs, improving job run performance, and actively monitoring your engines.