IBM United States
Software Announcement 203-021
February 4, 2003

Preview: IBM DB2 Information Integrator (Beta Program) Integrates Diverse Information Across and Beyond the Enterprise

 ENUS203-021.PDF (30KB)


At a Glance

With the IBM DB2 Information Integrator family of products you can:

  • Choose the data access strategy to match business value
    • Centralize information for availability or performance
    • Manage distributed access to required data
  • Integrate diverse and distributed data, without moving the data or changing the platforms
    • Access disparate data as though it were a single source
  • Progress more quickly at a lower cost
    • Increase developer productivity for integrating diverse, distributed, and real-time data
    • Deploy current skills over a greater range of project requirements
  • Rely on proven IBM technology
    • Trust 25 years of data management research and development
    • Integrate with complementary technologies
  • Protect your current and future IT investments
    • Build on industry standards
    • Reduce the modification and replacement of systems to make them work together

Overview

IBM is introducing the IBM DB2® Information Integrator family of products. It provides the foundation for a strategic information integration framework to help customers access, manipulate, and integrate diverse, distributed, and real-time data. The family consists of:

  • IBM DB2 Information Integrator V8.1 — A new product based on IBM DB2 technology
  • IBM DB2 Information Integrator for Content V8.2 — Formerly IBM Enterprise Information Portal

Both products enable customers to abstract a common data model across data and content sources and to access and manipulate them as though they were a single source. Each supports a user community defined primarily by the data they access and the development community that they support. Both products are now available in beta programs.

Key Prerequisites

The offerings will be available for these operating systems:

  • Microsoft™ Windows NT®
  • Microsoft Windows® 2000
  • AIX®
  • Hewlett-Packard HP-UX
  • Sun Solaris
  • Linux on Intel®

Preview Announcements

Preview announcements provide insight into IBM plans and direction. Availability, prices, ordering information, and terms and conditions will be provided when the product is announced.

Description

IBM DB2 Information Integrator Family Benefits

With the IBM DB2 Information Integrator family of products you can:

Choose the data access strategy to match business value — Consolidating data local to the application simplifies application development and provides better data access performance and availability, but it also introduces the cost of moving data, storing it, and managing its synchronization. IBM DB2 Information Integrator V8.1 supports centralized access by providing replication and caching to support performance and availability requirements.

Alternatively, when there is wide diversity in the data accessed, or when the data is owned outside the enterprise, it is impractical or too expensive to replicate it. The IBM DB2 Information Integrator family lets you leave distributed data where it is, yet access it transparently as though it were a single source. This approach maps well to the predominantly read-access scenarios common to enterprise-wide reporting, business intelligence, portal infrastructures, e-commerce applications, customer care, and trading partner linkage requirements.

Integrate data and content without moving the data or changing the platform — by accessing access diverse and distributed data as if it were a single source, no matter where it resides. It provides a broad range of data source access out-of-the-box, covering structured and unstructured data across and beyond the enterprise. Sources currently include relational databases, flat files, XML documents, spreadsheets, content repositories, Web sites, Web services, and message queues. And, you can extend access to proprietary or virtually any data source.

Make more progress, more quickly, and at a lower cost — With the IBM DB2 Information Integrator family of products you can develop the new generation of composite applications that require efficient integration of disparate data. Based on the product selection, developers can use either SQL or an object-oriented access. And, now you have a practical way to integrate diverse relational data and combine it, for example, with unstructured data from content repositories, the World Wide Web, and spreadsheets. Developers can use SQL to create joins or unions over the data, to compute statistical functions, to aggregate data, to use online analytical processing (OLAP) features, to compose or transform XML documents, among other choices — speeding project deployment, leveraging existing skills over a broader range of projects, and reducing ongoing maintenance costs.

Rely on proven IBM technology — These products, developed from proven IBM technologies, are used by customers today and are based on 25 years of data management research and development. They provide a scalable, cross-platform infrastructure that integrates with the IBM WebSphere® business integration portfolio including IBM WebSphere Business Integrator, IBM WebSphere Portal Server, IBM WebSphere MQ, and IBM WebSphere Studio for a complete business integration infrastructure.

Protect your current and future Information Technology (IT) investments — with products based on industry standards such as SQL, XML, Java™, and Web services to provide broad interoperability. The access-in-place capabilities reduce rewriting or replacing systems to make them work together. These products provide a strategic, reusable, and open information integration platform that lets customers choose the right approach for their business.

IBM DB2 Information Integrator Product Family

IBM DB2 Information Integrator is a new family of products from IBM which provides the foundation for a strategic information integration framework to help customers access, manipulate, and integrate diverse, distributed, and real-time data. The family consists of IBM DB2 Information Integrator V8.1 and IBM DB2 Information Integrator for Content V8.2 . Both products enable customers to abstract a common data model across diverse and distributed data and content sources and to access and manipulate them as though they were a single source. Each product supports a user community defined primarily by the data they access and the development community they support. The product family supports the predominantly read-access scenarios common to enterprise-wide reporting, knowledge management, business intelligence, portal infrastructures, e-commerce applications, customer care, and trading partners' linkage requirements.

IBM DB2 Information Integrator V8.1 — IBM DB2 Information Integrator V8.1 is targeted primarily at the application development community familiar with relational database application development. Applications that use SQL or tools that generate SQL (integrated development environments, reporting, and analytical tools) can now access, integrate, and manipulate distributed and diverse data through a federated data server. This product is most appropriate for projects whose primary data sources are relational data augmented by other XML, Web, or content sources. IBM DB2 Information Integrator core capabilities include:

A Federated Data Server

  • Administrators configure data source access and define integrated views across diverse and distributed data
    • Administrators use integrated graphical tools to configure access to source data, representing that data as logical tables in the federated data server.
    • Integrated views can be composed across these sources using standard SQL view definitions and expressions.
  • Data sources include:
    • Relational: IBM DB2, Informix™ Dynamic Server, Informix Extended Parallel Server, Microsoft SQL Server, Oracle, Sybase SQL Server, Sybase Adaptive Server Enterprises, Teradata, and Open Database Connectivity (ODBC) sources.
    • Nonrelational: WebSphere MQ message queues, Web services, Microsoft Access, Microsoft Excel spreadsheets, flat files, XML documents, LDAP directories, and data sources accessible by Entrez, Blast, HMMer, BioRS, Documentum, and IBM Lotus® Extended Search. IBM Lotus Extended Search provides access to multiple data stores, including Domino™, IBM DB2 Information Integrator for Content sources (such as IBM Content Manager, IBM Content Manager OnDemand, and IBM ImagePlus®), relational databases (IBM DB2, Oracle, Sybase, Microsoft SQL Server, Microsoft Access), Lotus Domino.Doc®, Microsoft Index Server, Microsoft Site Server, Microsoft Exchange, and over 18 Web search sites.
    • A developer toolkit is provided to add access to other sources.
  • Applications can query or search across the aggregated data sources as if they were in a single database.
    • The query is expressed using standard SQL. SQL expressions may be used to transform the data for business analysis or data exchange.
    • Text search semantics may be used within the query. A fast, versatile, and intelligent full text search capability is provided across all relational data sources, including those that either don't support native text search or don't provide as broad a range of text search capability. A large set of search operations is supported such as Boolean, wildcard, free-text, fuzzy search, proximity search for words within the same sentence or paragraph, or search within XML documents.
    • The query may produce standard SQL answer sets or XML documents, which can be:
      • Generated from the federated source data to facilitate interchange
      • Automatically validated against DTDs or XML schemas
      • Transformed using XSL for flexible presentation
    • Results can be made available to the rest of the organization by publishing them to a WebSphere MQ message queue using built-in functions
    • The federated server uses a cost-based distributed query optimization to select the best access paths for higher query performance. It leverages intelligence about optimizing access to the data sources provided by the data source wrapper, by database statistics, and by the administrator.
    • The administrator can define data caches over the federated data (called Materialized Query Tables) to improve query responsiveness and availability for read-only access. If cache use is enabled by the application, the optimizer can automatically redirect the query to exploit the cache. Cache refresh is managed by the administrator.
  • Applications can access the server by either traditional database or Web service clients

A Replication Server for Mixed Relational Databases

  • Customers can replicate data between mixed relational data sources. DB2, Informix, Microsoft, Oracle, and Sybase are supported as replication sources and targets; Teradata is supported as a replication target.
  • Customers can configure a variety of topologies, latency, and consistency characteristics:
    • The replication server supports distribution (moving data from one database to many) and consolidation (moving data from many databases to one) scenarios.
    • Transformation can be applied in-line with the data movement via standard SQL expressions or stored procedure execution.
    • Data movement can be automated to occur on a specific schedule, at designated intervals, continuously, or as triggered by events.
    • Data movement can be managed table-at-a-time (such as for warehouse loading during batch windows) or with transaction consistency (for data that is never off-line).

IBM DB2 Information Integrator V8.1 is currently available in beta. If you are interested in participating in the beta program, contact your IBM representative for additional information.

IBM DB2 Information Integrator for Content V8.2 — IBM DB2 Information Integrator for Content V8.2 is targeted at the application development community familiar with content management application development. These customers primarily access content management sources but have requirements for additional sources across the enterprise. This product — the new generation of IBM Enterprise Information Portal — is suited for solutions where the developer is a content application developer familiar with content management programming interfaces (and Object Oriented APIs). It is indicated where:

  • Customers need federated access to content sources such as IBM Content Manager, IBM Content Manager OnDemand, ImagePlus, and Lotus Domino.doc. The application may also need integrated access across non-IBM content repositories such as FileNET Panagon Image Services, or other content sources. Enterprise content may be augmented with relational databases including DB2, Oracle, and Open Database Connectivity (ODBC) sources.
  • Customers need sophisticated analysis of the textual information in their applications, content repositories, e-mail repositories, databases and file systems. To leverage this information, it must be indexed, summarized, and organized by content or classified according to a taxonomy. Text analytics gathers and summarizes information about individual documents as well as groups of documents:
    • Language identification determines the language of each document, important for international businesses.
    • Information extraction identifies information contained in the document and classifies it into meaningful entities such as names of people or organizations, domain technical terms, abbreviations, dates, numbers, or currency amounts.
    • Categorization assigns documents into pre-existing categories based on a taxonomy predefined by the firm (product line or competitors).
    • Information clustered into groups of related documents automatically based on content. This differs from categorization as it does not require predefined classes.
    • Summarization extracts the most relevant sentences from each document to create a document synopsis.
  • Customers need to search across a multitude of other information sources, including file systems, Lotus Domino databases, Microsoft Exchange Servers, and Web Search sites. To access these data sources, IBM DB2 Information Integrator for Content V8.2 integrates with IBM Lotus Extended Search, increasing the range of the data accessible by the application.
  • An integrated workflow component is optional, can involve including case management with data residing in IBM Content Manager repositories and other supported data sources. IBM DB2 Information Integrator for Content V8.2 enables all accessed information to be included in workflow processes. An advanced workflow application provides a graphical workflow builder to easily define the advanced workflow processes across the enterprise.

IBM DB2 Information Integrator for Content V8.2 (formerly IBM Enterprise Information Portal) is currently available in beta. If you are interested in participating in the beta program, contact your IBM representative for additional information.

Global Financing

IBM Global Financing offers competitive financing to credit-qualified customers to assist them in acquiring IT solutions. Our offerings include financing for IT acquisitions, including hardware, software, and services, both from IBM and other manufacturers or vendors. Offerings (for all customer segments: small, medium, and large enterprise), rates, terms, and availability can vary by country. Contact your local IBM Global Financing organization or visit the Web at:

Product Positioning

The IBM DB2 Information Integrator family of products strengthens IBM's industry leading WebSphere business integration portfolio.

Recognizing the market requirement for structure and clarity, IBM introduced a framework for complete business integration. Fundamentally, integration revolves around people, processes, applications, and information. Different integration approaches are necessary for different classes of integration problems. For example, online customer orders must be enabled through an application, not a database application programming interface (API). Business rules embedded in application programming logic protect the database from inappropriate use.

Alternatively, the application that responds with a projected delivery date could access correlated information across manufacturing and shipping databases, and could depend on the data management system to handle the complex joins and mask differences between the data sources. As in this example, the best solution often uses several approaches, emphasizing the need for moving easily among technologies. While competitors may provide only niche integration, IBM can deliver complete integration with offerings that work together smoothly. With over 30 years experience in building and evolving their base offerings for middleware and enabling these offerings to work together in thousands of different business environments, IBM has identified five types of integration based on an open services infrastructure that can be used together or separately to address these issues.

  • Information integration enables the integration of diverse forms of business information across and beyond the enterprise. Instead of sequentially accessing individual information sources, information integration enables coherent search, access, replication, and transformation over a unified view of information assets to meet business needs.
  • Application connectivity allows applications to share and leverage information. Business assets are efficiently connected to allow information across disparate systems to be available across the enterprise.
  • Process integration takes application connectivity to the next level by allowing the business to change how they operate by customizing the modeling, automation, and monitoring of processes across people and heterogeneous systems, both inside and outside the enterprise.
  • User interaction is about creating a single, interactive user experience across applications and devices.
  • Build to integrate focuses on building and deploying new integration-ready applications that leverage Web services and existing assets. Instead of traditional silos, new solutions must enable immediate integration with existing software assets.

These approaches may be used together or separately to address business integration challenges. Information integration, as delivered in the IBM DB2 Information Integrator family of products , is an enabling technology for the other approaches providing integrated, declarative access to diverse data.

Reference Information

Refer to the Statement of General Direction section in these announcements:

Preview Announcement of IBM DB2 Universal Database® V8.1

  • Software Announcement 202-171 , dated July 23, 2002

Worldwide Announcement of IBM DB2 Universal Database V8.1 for Linux, UNIX®, and Windows

  • Software Announcement 202-214 , dated September 17, 2002

The DB2 federated features previously available as DB2 Relational Connect V7.2 and DB2 Life Sciences Data Connect V7.2 have been enhanced and will be reintroduced in IBM DB2 Information Integrator V8.1.

Product Upgrades

IBM DB2 Information Integrator V8.1 is the successor product for IBM DB2 Relational Connect, IBM DB2 Life Sciences Data Connect, and IBM DB2 DataJoiner®. Customers who have a software maintenance agreement for IBM DB2 Relational Connect, IBM DB2 Life Sciences Data Connect, or IBM DB2 DataJoiner will be entitled to upgrade to an IBM DB2 Information Integrator V8.1 product.

Trademarks

 
Informix is a trademark of International Business Machines Corporation in the United States or other countries or both.
 
DB2, WebSphere, AIX, ImagePlus, DB2 Universal Database, and DataJoiner are registered trademarks of International Business Machines Corporation in the United States or other countries or both.
 
Intel is a registered trademark of Intel Corporation.
 
Microsoft is a trademark of Microsoft Corporation.
 
Windows NT and Windows are registered trademarks of Microsoft Corporation.
 
Java is a trademark of Sun Microsystems, Inc.
 
UNIX is a registered trademark of the Open Company in the United States and other countries.
 
Domino is a trademark of Lotus Development Corporation and/or IBM Corporation.
 
Lotus and Domino.Doc are registered trademarks of Lotus Development Corporation and/or IBM Corporation.
 
Other company, product, and service names may be trademarks or service marks of others.