SlideShare a Scribd company logo
DOC ID
IBM® InfoSphere®
Information Server 8.1
An Overview
BHAWANI NANDAN PRASAD
SMP - IIM Calcutta & MBA - STRATFORD UNIVERSITY USA, M.TECH. IT
DOC ID
IBM® InfoSphere Information Server Components
IBM® InfoSphere Information Server 8.1 and its
components:
-- IBM Information Server Console
-- IBM Metadata Workbench
-- IBM Business Glossary (BG)
-- IBM Business Glossary Anywhere
-- IBM MetaBrokers and bridges
-- IBM DataStage & QualityStage
-- IBM Information Analyzer
-- IBM Information Server FastTrack
-- IBM Information Services Director
2
DOC ID
Product framework & History
•IBM® Information Server combines the technologies of previous slide components into a single unified platform that
enables companies to understand, cleanse, transform, and deliver trustworthy and context-rich information.
Product Name Product History
Information Server Shared
Services
New
FastTrack New
Metadata Workbench MetaStage & Unicorn semantic metadata tools
Business Glossary (BG) MetaStage Reporting tool
Business Glossary Anywhere MetaStage Reporting tool
DataStage & QualityStage DataStage & QualityStage
Information Analyzer AuditStage & ProfileStage
Information Services Director RTI /SOA
3
DOC ID
InfoSphere Information Server high-level architecture
4
DOC ID
Infosphere – Shared Services
•InfoSphere provides extensive administrative and reporting facilities that use shared
services and a Web application that offers a common look and feel for all
administrative and reporting tasks.
•Administrative Services
• Security administration
• Licensing administration
• Scheduling administration
• Logging administration
•Reporting Services
•Reporting services manage run time and administrative aspects of reporting for IBM
Information Server. You can create product-specific reports for DataStage,
QualityStage, and Information Analyzer, and cross-product reports for logging,
monitoring, scheduling, and security services. All reporting tasks are set up and run
from a single interface, the IBM Information Server Web console. You can retrieve and
view reports and schedule reports to run at a specific time and frequency.
5
DOC ID
Metadata services
•By using metadata services, you can access data and do data integration tasks such as analysis, modeling,
cleansing, and transformation.
•Metadata services components
-- Metadata Server
• -- Metadata Workbench
-- Business Glossary (BG)
-- Business Glossary Anywhere
-- MetaBrokers and bridges
6
DOC ID
Metadata Server
•It holds the metadata for all products on Information server. It is made up of the
database repository, the software domain layer that lets applications access the
repository and a browser console for administering the Metadata Server.
•The Metadata server has some HTML reporting capabilities but is
complemented by the following metadata products:
– The Metadata Workbench for advanced metadata reporting .
– The Business Glossary for the management of metadata terms and
definitions.
– The metadata Import/Export tool for importing metadata through
bridges and brokers and exporting metadata through brokers.
– One key advantage of Metadata server within IBM Information Server is
that it eliminates the need of other external resources for metadata
management and also improves project transparency across the project
implementation roles.
•Since all functions of IBM Information Server share common metadata artifacts,
it provides an efficient collaboration environment for managing the completion
of the project and drastically reducing downstream project delivery times.
7
DOC ID
Metadata Workbench
•Metadata Workbench offers key metadata visualization and exploration capabilities, acting as a control station for metadata within Information Server.
Users of the different product modules of Information Server can use IBM Metadata Workbench to view the metadata and the data assets in the
Information Server metadata repository.
•It provides following services for Metadata Server
•Traceability of information : You can trance information across tools, allowing data elements in reports to be traced back to their sources. This unique
capability leverages a unique seamless view across design and operational metadata. (This is End-to-end data flow reporting)
•Responsiveness to change : You can now easily understand the impact of any change to any piece of information across tools, showing which reports,
services, or source/target databases will be impacted before a source/target data element is changed. (This is Impact Analysis / Dependency analysis of the
data assets)
•Web-based visualization and navigation of metadata : This feature allows users to use the functionality without expensive client software installation. It
also allows for remote diagnosis and resolution of problems by IT teams.
•Visual depiction of metadata relationships: The visual depiction of the metadata relationships of data lineage and impact analysis makes it easy to find
and understand metadata relationships quickly and allows non-IT users to understand relationships easily.
•Cross-tool impact analysis: This ensures that the complete impact of change beyond a single tool is understood very clearly before a change is made. It
makes IT teams more responsive by reducing the analysis time required before making changes to multiple systems.
•Cross-tool data lineage: This provides an understanding of the complete lineage path of information, including its source, its relations, its destinations,
what happened to it along the way. This enables business personnel to understand the origins of the information and easies to troubleshoot whenever
problem arises to maximize the value of your IT investments, take advantage of the scalability, security, manageability and reliability of the mainframe and
also add mainframe information integration work load
•Metadata Stitching: This automatically connects design and operational metadata elements together to form relationships and ensures a complete
understanding of information. It provides IT with the tools to ensure consistency in the metadata view.
•Links business terms to technical information: The reporting aspect links the business terms to the technical information and thus ensures a better
understanding and collaboration between business and IT. Thus it removes barriers between teams to speed project development times.
8
DOC ID
Business Glossary (BG)
•Business Glossary gives you the tools that you need to author and to manage the metadata in the metadata repository.
•Metadata in the metadata repository includes terms, categories, and information assets such as database tables,
database columns, schemas, and jobs. Business Glossary provides a Web-based tool to edit, browse, search, and
customize metadata in the metadata repository. The business glossary is the interface between the business user and
the metadata repository.
•Business Glossary helps you with the following business tasks:
– Develop a common vocabulary between business and technology
• A common vocabulary gives diverse users a common meaning of data.
– Find business information from metadata
• You can get the meaning of the data, its lineage, and who is responsible for defining and producing the data
– Provide data stewardship
• You can assign a person or group to information assets to manage the data through its life cycle
9
DOC ID
Business Glossary Anywhere
•Business Glossary Anywhere provides instant access to your business terminology from any desktop application.
•It helps you to find business information from any text-based document, Web page, or e-mail.
•You can access Business Glossary Anywhere from any desktop application by clicking on a term and viewing its business
definition in a new window with no loss of context or focus. You can get the meaning of the data, its lineage, and the
person who defined and produces the data.
•You do not need to log in to IBM Information Server and form a query.
10
DOC ID
Business Glossary (BG) Overview
11
DOC ID
DataStage and QualityStage
•DataStage and QualityStage provides a graphical framework that you use to design and run the jobs that transform and
cleanse your data.
•QualityStage includes a set of stages, a match Designer, and related files that provide a development environment
within the DataStage and QualityStage Designer for building jobs to cleanse data. This environment lets you test your
matching and blocking strategies before running match jobs, and lets you manage and edit rules.
•The Designer client provides a common user interface in which you design your data quality jobs. In addition, you have
the power of the parallel processing engine to process large stores of source data.
•The integrated stages available in the Repository provide the basis for accomplishing the following data cleansing
tasks:
– Resolving data conflicts and ambiguities
– Uncovering new or hidden attributes from free-form or loosely controlled source columns
– Conforming data by transforming data types into a standard format
– Creating one unique result
12
DOC ID
QualityStage
•You can access all the QualityStage stages in the Data Quality group in the palette. Stages available are:
• Investigate stage
• Standardize stage
• Match Frequency stage
• Unduplicate Match stage
• Reference Match stage
• Survive stage
13
DOC ID
Information Analyzer
•Information analyzer is used to understand the content, structure, and overall quality of your data at a given point in
time. This analysis aids you in understanding the inputs to your integration process, ranging from individual fields to
high-level data entities. Information analysis enables you to correct problems with structure or validity before they
affect your project.
•Information analyzer is used to understand the content, structure, and overall quality of your data at a given point in
time.
•This analysis aids you in :
– Understanding The inputs to your integration process, ranging from individual fields to high-level data entities.
– Correcting problems with structure or validity before they affect your project.
– Improving the accuracy of your data by making inferences and identifying anomalies.
– Analyzing columns.
– Identifying primary keys.
– Identifying foreign keys.
– Locating overlapping data across domains.
– Identifying changes in your data over time.
– Managing tables.
14
DOC ID
FastTrack
•FastTrack helps translate your business requirements into business applications.
•FastTrack accelerates the design time to create source-to-target mappings and to automatically generate jobs.
•Leveraging metadata integration, FastTrack enables you to discover table column relationships, to link columns to
business glossary terms, and to generate jobs that become the starting point for complex data transformation in
DataStage and QualityStage Designer. Source-to-target mappings can contain data value transformations that, as part
of specifications, define how to build applications.
15
DOC ID
Information Services Director
•Information Services Director provides a unified and consistent way to publish and manage shared information
services. Using Information Services Director, information specialists can design and deploy reusable information
integration tasks including data cleansing, data transformation, and data federation services.
•It allows units from any of the suite components to be deployed as Web services (For SOA and RTI) or Enterprise Java
Beans (EJBs).
•It load balances service requests across multiple IBM Information Server nodes, to ensure smooth pickup of load
spikes, and to ensure fault tolerance and high availability. It provides the following key capabilities:
• Packaging information integration logic as services that insulate developers from underlying sources
• Allowing these services to be invoked as EJBs or Web services
• Provides REST access to services using the XML or JSON format
• Exposing services as RSS feeds
• Using the JMS transport method for asynchronous access to service responses
• Providing load balancing and fault tolerance for requests across multiple servers
• Providing foundation infrastructure for information services
•The extensible architecture of WebSphere Information Services Director allows it to enable a broad range of
information management tasks such as data cleansing, data transformation, and data federation services.
16
DOC ID
•Thankyou !
17

More Related Content

Info sphere overview

  • 1. DOC ID IBM® InfoSphere® Information Server 8.1 An Overview BHAWANI NANDAN PRASAD SMP - IIM Calcutta & MBA - STRATFORD UNIVERSITY USA, M.TECH. IT
  • 2. DOC ID IBM® InfoSphere Information Server Components IBM® InfoSphere Information Server 8.1 and its components: -- IBM Information Server Console -- IBM Metadata Workbench -- IBM Business Glossary (BG) -- IBM Business Glossary Anywhere -- IBM MetaBrokers and bridges -- IBM DataStage & QualityStage -- IBM Information Analyzer -- IBM Information Server FastTrack -- IBM Information Services Director 2
  • 3. DOC ID Product framework & History •IBM® Information Server combines the technologies of previous slide components into a single unified platform that enables companies to understand, cleanse, transform, and deliver trustworthy and context-rich information. Product Name Product History Information Server Shared Services New FastTrack New Metadata Workbench MetaStage & Unicorn semantic metadata tools Business Glossary (BG) MetaStage Reporting tool Business Glossary Anywhere MetaStage Reporting tool DataStage & QualityStage DataStage & QualityStage Information Analyzer AuditStage & ProfileStage Information Services Director RTI /SOA 3
  • 4. DOC ID InfoSphere Information Server high-level architecture 4
  • 5. DOC ID Infosphere – Shared Services •InfoSphere provides extensive administrative and reporting facilities that use shared services and a Web application that offers a common look and feel for all administrative and reporting tasks. •Administrative Services • Security administration • Licensing administration • Scheduling administration • Logging administration •Reporting Services •Reporting services manage run time and administrative aspects of reporting for IBM Information Server. You can create product-specific reports for DataStage, QualityStage, and Information Analyzer, and cross-product reports for logging, monitoring, scheduling, and security services. All reporting tasks are set up and run from a single interface, the IBM Information Server Web console. You can retrieve and view reports and schedule reports to run at a specific time and frequency. 5
  • 6. DOC ID Metadata services •By using metadata services, you can access data and do data integration tasks such as analysis, modeling, cleansing, and transformation. •Metadata services components -- Metadata Server • -- Metadata Workbench -- Business Glossary (BG) -- Business Glossary Anywhere -- MetaBrokers and bridges 6
  • 7. DOC ID Metadata Server •It holds the metadata for all products on Information server. It is made up of the database repository, the software domain layer that lets applications access the repository and a browser console for administering the Metadata Server. •The Metadata server has some HTML reporting capabilities but is complemented by the following metadata products: – The Metadata Workbench for advanced metadata reporting . – The Business Glossary for the management of metadata terms and definitions. – The metadata Import/Export tool for importing metadata through bridges and brokers and exporting metadata through brokers. – One key advantage of Metadata server within IBM Information Server is that it eliminates the need of other external resources for metadata management and also improves project transparency across the project implementation roles. •Since all functions of IBM Information Server share common metadata artifacts, it provides an efficient collaboration environment for managing the completion of the project and drastically reducing downstream project delivery times. 7
  • 8. DOC ID Metadata Workbench •Metadata Workbench offers key metadata visualization and exploration capabilities, acting as a control station for metadata within Information Server. Users of the different product modules of Information Server can use IBM Metadata Workbench to view the metadata and the data assets in the Information Server metadata repository. •It provides following services for Metadata Server •Traceability of information : You can trance information across tools, allowing data elements in reports to be traced back to their sources. This unique capability leverages a unique seamless view across design and operational metadata. (This is End-to-end data flow reporting) •Responsiveness to change : You can now easily understand the impact of any change to any piece of information across tools, showing which reports, services, or source/target databases will be impacted before a source/target data element is changed. (This is Impact Analysis / Dependency analysis of the data assets) •Web-based visualization and navigation of metadata : This feature allows users to use the functionality without expensive client software installation. It also allows for remote diagnosis and resolution of problems by IT teams. •Visual depiction of metadata relationships: The visual depiction of the metadata relationships of data lineage and impact analysis makes it easy to find and understand metadata relationships quickly and allows non-IT users to understand relationships easily. •Cross-tool impact analysis: This ensures that the complete impact of change beyond a single tool is understood very clearly before a change is made. It makes IT teams more responsive by reducing the analysis time required before making changes to multiple systems. •Cross-tool data lineage: This provides an understanding of the complete lineage path of information, including its source, its relations, its destinations, what happened to it along the way. This enables business personnel to understand the origins of the information and easies to troubleshoot whenever problem arises to maximize the value of your IT investments, take advantage of the scalability, security, manageability and reliability of the mainframe and also add mainframe information integration work load •Metadata Stitching: This automatically connects design and operational metadata elements together to form relationships and ensures a complete understanding of information. It provides IT with the tools to ensure consistency in the metadata view. •Links business terms to technical information: The reporting aspect links the business terms to the technical information and thus ensures a better understanding and collaboration between business and IT. Thus it removes barriers between teams to speed project development times. 8
  • 9. DOC ID Business Glossary (BG) •Business Glossary gives you the tools that you need to author and to manage the metadata in the metadata repository. •Metadata in the metadata repository includes terms, categories, and information assets such as database tables, database columns, schemas, and jobs. Business Glossary provides a Web-based tool to edit, browse, search, and customize metadata in the metadata repository. The business glossary is the interface between the business user and the metadata repository. •Business Glossary helps you with the following business tasks: – Develop a common vocabulary between business and technology • A common vocabulary gives diverse users a common meaning of data. – Find business information from metadata • You can get the meaning of the data, its lineage, and who is responsible for defining and producing the data – Provide data stewardship • You can assign a person or group to information assets to manage the data through its life cycle 9
  • 10. DOC ID Business Glossary Anywhere •Business Glossary Anywhere provides instant access to your business terminology from any desktop application. •It helps you to find business information from any text-based document, Web page, or e-mail. •You can access Business Glossary Anywhere from any desktop application by clicking on a term and viewing its business definition in a new window with no loss of context or focus. You can get the meaning of the data, its lineage, and the person who defined and produces the data. •You do not need to log in to IBM Information Server and form a query. 10
  • 11. DOC ID Business Glossary (BG) Overview 11
  • 12. DOC ID DataStage and QualityStage •DataStage and QualityStage provides a graphical framework that you use to design and run the jobs that transform and cleanse your data. •QualityStage includes a set of stages, a match Designer, and related files that provide a development environment within the DataStage and QualityStage Designer for building jobs to cleanse data. This environment lets you test your matching and blocking strategies before running match jobs, and lets you manage and edit rules. •The Designer client provides a common user interface in which you design your data quality jobs. In addition, you have the power of the parallel processing engine to process large stores of source data. •The integrated stages available in the Repository provide the basis for accomplishing the following data cleansing tasks: – Resolving data conflicts and ambiguities – Uncovering new or hidden attributes from free-form or loosely controlled source columns – Conforming data by transforming data types into a standard format – Creating one unique result 12
  • 13. DOC ID QualityStage •You can access all the QualityStage stages in the Data Quality group in the palette. Stages available are: • Investigate stage • Standardize stage • Match Frequency stage • Unduplicate Match stage • Reference Match stage • Survive stage 13
  • 14. DOC ID Information Analyzer •Information analyzer is used to understand the content, structure, and overall quality of your data at a given point in time. This analysis aids you in understanding the inputs to your integration process, ranging from individual fields to high-level data entities. Information analysis enables you to correct problems with structure or validity before they affect your project. •Information analyzer is used to understand the content, structure, and overall quality of your data at a given point in time. •This analysis aids you in : – Understanding The inputs to your integration process, ranging from individual fields to high-level data entities. – Correcting problems with structure or validity before they affect your project. – Improving the accuracy of your data by making inferences and identifying anomalies. – Analyzing columns. – Identifying primary keys. – Identifying foreign keys. – Locating overlapping data across domains. – Identifying changes in your data over time. – Managing tables. 14
  • 15. DOC ID FastTrack •FastTrack helps translate your business requirements into business applications. •FastTrack accelerates the design time to create source-to-target mappings and to automatically generate jobs. •Leveraging metadata integration, FastTrack enables you to discover table column relationships, to link columns to business glossary terms, and to generate jobs that become the starting point for complex data transformation in DataStage and QualityStage Designer. Source-to-target mappings can contain data value transformations that, as part of specifications, define how to build applications. 15
  • 16. DOC ID Information Services Director •Information Services Director provides a unified and consistent way to publish and manage shared information services. Using Information Services Director, information specialists can design and deploy reusable information integration tasks including data cleansing, data transformation, and data federation services. •It allows units from any of the suite components to be deployed as Web services (For SOA and RTI) or Enterprise Java Beans (EJBs). •It load balances service requests across multiple IBM Information Server nodes, to ensure smooth pickup of load spikes, and to ensure fault tolerance and high availability. It provides the following key capabilities: • Packaging information integration logic as services that insulate developers from underlying sources • Allowing these services to be invoked as EJBs or Web services • Provides REST access to services using the XML or JSON format • Exposing services as RSS feeds • Using the JMS transport method for asynchronous access to service responses • Providing load balancing and fault tolerance for requests across multiple servers • Providing foundation infrastructure for information services •The extensible architecture of WebSphere Information Services Director allows it to enable a broad range of information management tasks such as data cleansing, data transformation, and data federation services. 16

Editor's Notes

  1. Notes go here.