Phil Bernstein: Repository Architecture and Benefits

Download Report

Transcript Phil Bernstein: Repository Architecture and Benefits

Microsoft Repository
Phil Bernstein
Microsoft Corp.
© 1999 Microsoft Corp..
1
Introduction
 Model – a labeled directed graph that
describes a software artifact




SQL Schema
OO interface
XML DTD
UML model
 Workflow definition
 Web site map
 Computer system
description
 A repository is a system to manage
models
© 1999 Microsoft Corp..
2
Microsoft Repository Includes
Customer
Update
Marketing
cust
emp
dept
dno
dna
Order
Product
Authorize
Credit
Order
Entry
Scheduled
Delivery
Bill
Customer
Salesperson
Schedule
Delivery
select all
Information Model
• COM interfaces
• SQL schema
• Highly Extensible
Inventory
Spec
Forms
Customer
ERD
Order
Product
Scheduled
Delivery
Salesperson
Architecture
Tables
VB Code
VC++ Code
Engine API
Repository
Engine
• Objects, properties
• Rich relationships
• Extensibility
• Versioning
© 1999 Microsoft Corp..
Model-Driven Tools
• Browser
• XML import/export
• Scripting language
• Model editor
• Component mgr
Standard DB Engine
Database
System
• SQL Server, Jet
• Others from Platinum
3
The Killer Meta-Data App
 Creating and maintaining a Data Warehouse is hard.
You need tools, which require lots of meta-data



Inconsistent data formats  Data quality & timeliness
Missing or invalid data  Relate tech’l & business models
Semantic inconsistencies  Tracing data lineage
Customer
 Meta-data driven data transformation tools
cust
emp
dept
dno
dna
Update
Marketing
Order
Product
Authorize
Credit
Order
Entry
Bill
Customer
Scheduled
Delivery
Salesperson
select all
Schedule
Delivery
Inventory
Forms
ERD
Spec
Customer
Order
Product
Scheduled
Delivery
Salesperson



Library of schemas and transformations
Generate code for loading a data warehouse
Version schemas & transformations for lineage
© 1999 Microsoft Corp..
Architecture
Tables
VB Code VC++ Code
4
Data Transformation Service
 Tool for building transformation scripts
 Schemas and transformations are stored
(versioned) in MS Repository
Step 1
Source
Step 2
Source
VB
Script
Step 3
© 1999 Microsoft Corp..
SQL Query
Destination
Query
Destination
OLE
Object
JScript
Exec Utility
5
Data Lineage
Repository
Source Table
Col1
Col2
...
Package
Version
Data
Transformation
Services
Package
Executions
Date/Time
User
LineageID
© 1999 Microsoft Corp..
Destination Table
Col1
Col2
...
LineageID
6
Open Information Model
 An OO meta-model to help tools share models
Uml
(Unified Modeling Language)
OO Modeling
Analysis and design
(Visual Modeler)
Cde
Dbm
(Component Description Model)
(Database Model)
Specifications
Sharing and re-use
SQL and OLAP schemas
Transformations
(COM, Visual Component Mgr)
(SQL Srv, DB2, Oracle, Plato)
© 1999 Microsoft Corp..
7
Open Info Model Structure
 It’s highly expressive
 Over 250 interfaces, 100 relationship types
 It’s easy to understand
 Industry-standard concepts, separable pieces
 Originally, COM models for MS Repository
 Now a Meta Data Coalition standard
expressed in UML
 New models: business processes,
business rules, knowledge mgmt
© 1999 Microsoft Corp..
8
To find out more
 Web sites
 http://www.research.microsoft.com/~philbe
 http://msdn.microsoft.com/repository
 Email
 [email protected]
© 1999 Microsoft Corp..
9