Transcript dama-aug00
Leveraging the Microsoft
Repository in Your Warehouse
Saeed K. Rahimi
Graduate Programs in Software
University of St. Thomas
DWSoft Corporation
Contents
Data Warehousing and Repository
The Open Information Model and Meta
Data Coalition
Microsoft’s Data Warehousing Strategy
Integrating the Repository Into Your
Warehouse
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 2
Data Warehouse Architecture
Operational
Data
Data warehouse Life Cycle
DM
• Extract
• Cleans
• Transform
• Load
DW
DM
DM
Metadata Services
SQL Server 7.5 - Object Persistence Services
Repository
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 3
What is Metadata? (Continued)
Information not just data
What is 122599?
Is it an integer number?
A string?
Christmas day 1999?
Repository provides metadata about this:
mmddyy
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 4
What Is Metadata?
Information about your
Systems
Source systems,
Warehouse systems,
Processes
Etc.
Databases
Source databases
Warehouse databases
Table
Columns
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 5
What Is Metadata?
Information about your
Transformations
From source system to data warehouse
From data warehouse to source systems
Calculations
Total-Sales = amount*quantity – discount +
shipping&handling
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 6
What Is Metadata? (Continued)
OLAP information
Sales cube
TotalSales measure
Time, product, geography dimensions
Other information
Reporting Tools
Modeling Tools
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 7
Metadata Benefits
Roadmap to information in the warehouse
What is in the warehouse
What it means
How to get the information
Who owns it
When it was created
How it was created/generated
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 8
Metadata Benefits
All users that interact with the warehouse
will benefit from enhanced information
Analysts can better understand what data is
available, and how it was calculated
Maintains a history prior to current
processes
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 9
Metadata Benefits (Continued)
Provides a common frame of reference –
“what does total sales mean”
Provides a central storage location of
warehouse knowledge (helps with turnover,
growth)
Allows enterprises to ensure data
marts/warehouses are built using common
terms and goals
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 10
Open Information Model
The schema for the repository
Common set of core definitions to promote sharing
of information among different tools and vendors
Covers basic set of sub-models
Created with help/input from over 20 partners and
reviewed by 300
Microsoft has transferred rights to evolve the OIM
to the meta data coalition (MDC)
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 11
MDC OIM Components
Current components
Unified Modeling Language (UML)
Relational database schema
Data transformations
Multidimensional schema (OLAP)
English semantics
Legacy databases
Component descriptions
Components under review
Report definitions
Entity/relationship diagrams
Business engineering
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 12
Sample OIM Constructs
Transformation – mapping from
Source column to Target column
TfmTransformationHasTargetColumns
TfmTransformationHasSourceColumns
ITfmTransformation
TfmObjectSetConsistsOfObject
ITfmTransformableObjectSet
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
ITfmTransformableObject
(Columns)
Page 13
Virtual Relationship
This
This
A
A
B
VS
B
C
C
D
D
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 14
Virtual Properties
This
This
A
A
a
b
VS
B
c
C
a b c d e
B
D
d
© Copyright 2000 Saeed K. Rahimi
c
C
D
e
d
DAMA Presentation, August 2000
e
Page 15
Microsoft’s Strategy Impact
Prior to SQL server 7.0, DW was primarily
available only to very large companies
Tools were expensive, expertise hard to find
Microsoft has lowered the bar by bundling
components (with SQL server for free) and
making DW available to many more
companies
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 16
Microsoft Warehouse Components
Data transformation services
Extract, transform, load
OLAP services
Multidimensional server and analysis
Repository
Information sharing
English query
English to SQL translation
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 17
Repository Maintenance
Importing information into the repository has
always been a challenging task
Information needs to be mapped to the model
components
Information capture needs to be automated
Once in the repository, maintaining versions
of the information is needed
New information load can not destroy the
information already in the repository
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 18
Loading the Repository
Data Transformation
Services
OLAP
Services
Star Schema
Database
OLTP
Database
English Query
Application
Transform
Other
Sources
OLAP
DB Schema
DB Schema
Semantics
Other
Repository
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 19
Loading the Repository
DTS can save to repository
OLAP information can be saved to repository
with SQL Server 2000
OLE DB and ODBC data sources and related
information can be imported into repository
with SQL Server 2000
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 20
Implementation Issues
DTS doesn’t provide complete source to target
mappings for certain transformations
For example
When a query used as source,
Interfaces to many third party tools incomplete or
not available
Modeling tools
ERWin
Visio
PowerDesigner
Warehouse tools
Business Objects
Information Advantage
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 21
Repository Features
Extensible metamodel
Version management
Dynamic COM based interface
Batch XIF file based interface
Workspace concept
Some pre-built interfaces to data warehouse
tools
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 22
Keys to Success
Ensure repository information is current
Provide access to all users of the warehouse
Tackle implementation in well defined, small
steps
Designate someone with the responsibility
and authority to manage the repository
Maintain long term focus
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 23
Access to All Users
Show users what they want to see
Provide simple, easy to understand information
Make warehouse information easily available “in
context”
Use familiar interface and UI concepts
Web based access
Client/server based access
Provide custom integration to existing tools
and interfaces when necessary
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 24
Modes of Access
Administrative Access
Management
Meta-model (schema) changes
Meta-data loading and maintenance
End-user Access
Browsing
Navigation
Impact analysis
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 25
Types of Access
Local and over the LAN access
Metamodel management
Repository bulk loading
Interface from other tools (loading meta-data)
Remote and over the WEB access
Browser based
accessible from anywhere on the globe
Navigation oriented
Interface from/to other tools
May be minor editing capabilities
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 26
Access Control
User Classification
Different Class of Users Need to Access Different
Objects in the Repository
Different User Profiles
Different Read/write Access Control
Possible User Classes
Business Executive
Casual User
Database Administrator
Data Warehouse Administrator
Repository Administrator
© Copyright 2000 Saeed K. Rahimi
DAMA Presentation, August 2000
Page 27
Questions?
Comments or discussions?
Emails: [email protected]
[email protected]