Transcript UDW - Abb
© ABB Utilities - 1 2002-01; NM-MW-0169/UTUA/NDF
Mats Werner
Utility Data Warehouse
UTUA/NDF
UDW 200
SPIDER Utility Data Warehouse
Data Warehouse for storage of all kinds of data
Presentation in
One-line diagrams
Presentation as Trends &
Historical information
10.25.32
B9
B8
C1
B3
StationA
T1
T2
B1
B2
B4
B5
B6
C2
B7
Data mining and other
analysis tools
C7
Presentation in
Reports & spreadsheets
Vedlikeholdsplaner: Region Syd
Region/Område
Region Syd
Stasjon/LinjeAlle
Status Alle
Type Alle
Spenning300kv
Skala Uker
1996 27 28 29 30 31 32 33
8
15 22 29 5 August
12
Nr Stasjon/Linje
Anl.DelÅrsak Status 1
Juli
96S432-1
Hasle
10 G2 Ombygging
I Gang
96S530-2
Flesaker G1
Årlig Revisjon
Utsatt
96S532-1
Lysebotn F1
MekaniskOK Reg.
96S541-1
Rendalen G3
UtskiftingOK Land
Lukk
Detaljplan
Ny plan
Utskrifter
Hjelp
© ABB Utilities - 2 -
UDW 200
SPIDER
SCADA/EMS/DMS
Data Warehouse
Based on...
other Utility
Information Systems
UDW – Utility Data Warehouse
© ABB Utilities - 3 -
Oracle-based Data Warehouse for Utility data
Able to store the continuous inflow of data from the power process
Able to store information from any other Utility Information System
Makes all this data available for Data Mining.
High User Performance
High Availability
High Redundancy
UDW – Utility Data Warehouse
Both Utility-wide Data Warehouse and on-line Historian!
Decision Support
Historical Analysis
in Office Environment
in Control Room Environment
Real-Time Network Management Systems
Non-real-time Information Systems
AMS
ERP
BMS
© ABB Utilities - 4 -
CIS
…
Oracle
RDB
EMS
DMS
SCADA
Utility Data Warehouse
Utility Data Warehouse – Original Requirements
RDBMS supporting the latest SQL standard
”Ad Hoc” queries, ”Data Mining”, etc.
Continuous operation
Availability > 99.96 %
Redundancy
Continuous inflow of data (every 10 sec, 24h/day)
Response times (incl. picture presentation): 3-5 seconds
Extremely large database tables: > 500 million rows
Continuous calculations
sum, max, average etc.
Complex calculations using MatlabTM
© ABB Utilities - 5 -
Automatic archiving of data that should not be retained on-line
On-line backup
Automatic maintenance of the database
eliminates the need for a DBA at the customer
Relational Analysis
The ability to analyze relations between data is the basis for Decision
Support and Data Mining
Relational Database Design makes this easy
Object
ID
Time
Stamp
Value
Object
ID
Object
Name
Station
123456
12:34:10
123.456
789123
12:34:10
456789
1
123456
TRF1
2.234123
1
789123
12:34:10
213.4324
1
123456
12:34:20
8.456
789123
12:34:20
2.2455
456789
12:34:20
212.143
Station
Station
ID
Name
446
446
City23
Downtown
TRF2
226
226
FOO
WestRing
456789
BAY1
791
…
2
987654
BAY2
446
…
1
…
Quality
ID
…
1
© ABB Utilities - 6 -
…
Example:
”Give me the name and the location of all lines that has been in alarmed state
during the weekend ”
Location
…
UDW - Open but integrated in SPIDER concept
UDW is a completely open system
UDW is also a well integrated product in the SPIDER concept
© ABB Utilities - 7 -
… as shown in the following areas:
Open Commercial Tools and Interfaces
Application Programming Interface, API
Well Integrated in SPIDER
Powerful Database Structure
UDW as an integration platform
UDW - Open Commercial Tools and Interfaces
Based on Oracle
Oracle is an open commercial relational database
© ABB Utilities - 8 -
delivering ”the industry's highest performance, reliability and security for thousands
of popular applications”
All tools applicable for Oracle is also applicable for UDW
Possibility to access UDW data directly in Oracle, using SQL or Oracle tools
New development in Oracle means increased value of UDW
Open interfaces
ODBC/JDBC
Microsoft COM
SQL*Net
UDW API (see below)
UDW Application Programming Interface, API
Simplifies access to UDW data
From C, C++, Java or Microsoft COM-based applications
Simplifies the creation of custom user applications
Functions for inserting and reading many data points at once
General browser routines, to access data without knowing the data model
An Oracle-independent general database server service,
Implements Additional functionality
User-friendly access routines for setting and getting a range of time-series
data
When data is stored by event, the data is retrieved as if it was stored cyclically
Implements support for Daylight Savings Time, Seamless Data Retrieval and
Data Sets (as described later in the presentation)
The UDW API is used when:
© ABB Utilities - 9 -
for remote SQL access without SQL*Net
accessing UDW data in the SPIDER environment
accessing UDW data through the Microsoft COM interface
accessing UDW data in dedicated programs (including UDW-internal)
UDW - Well Integrated in SPIDER
UDW data can be presented in SPIDER pictures,
on-line diagrams
the report and trend tools used in SPIDER
The object oriented Data Engineering for the UDW is integrated with
SPIDER Data Engineering.
The UDW programs are monitored by SPIDER
Alarms and warnings from UDW are sent to SPIDER alarm list
© ABB Utilities - 10 -
e.g. if the data sampling is not functioning properly due to network or
hardware problems
UDW - Powerful Database Structure
Data is stored in a user-friendly way
Uncompressed in open Oracle tables
Database views to simplify common queries
Data model is temporal
history of each object exist in the database
calculation algorithms related to certain objects are time-stamped
Network relations and calculations are therefore correct even when studying
old data, e.g. data restored from off-line archives
Very good performance regardless of system and database size,
© ABB Utilities - 11 -
By the use of the Oracle partitioning technique
Sampling schemes are easily defined using Object Types and Patterns
UDW – Object Types and Patterns
Sampling schemes defined in Object Types
Each Object Type corresponds to one Oracle Table
Objects to be sampled
Object Types
UDW patterns
sampled Object Types
calculated Object Types
10 sec measurands
1 min average measurands
Measurement pattern 2
indications by event
hourly minimum measurands
Indication pattern
state estimator result
daily maximum measurands
Measurement pattern 1
For each Object Type is defined:
Sampling
method
cyclically – including sampling frequency
by event – with our without integrated dead band
© ABB Utilities - 12 -
Retention
An object to be sampled refers to a Pattern
A Pattern refers to a number of Object Types
If
period
it shall be archived when retention period is over
Enables easy set-up of sampling schemes for sampled objects
UDW as and Integration Platform (example)
Platform for application development and cross-platform integration
Developing a simple
application in MS Excel,
using the UDW COM
interface
Combine the data in the
Utility Data Warehouse
with the functionality of
Excel:
© ABB Utilities - 13 -
• Visual Basic programs
• Excel spreadsheet functions
• Excel Charts
UDW - Data Model
Sampled data is stored in Object Types
A simple Power System model is also provided
including e.g. stations, sub systems, point classes
To enable queries and selections based on relational criteria
e.g. ”show all measured values included in station named AMHERST”
Automatically populated from SPIDER Data Engineering
Temporality is provided
Objects
Stations
10.25.32
B9
B8
Station A
C1
T1
C2
T2
B1
Subsystems
Information and relations
between real-world objects
B2
B7
B3
B4
B5
B6
C7
© ABB Utilities - 14 -
Real-world objects
10 sec measurands
1 hour average
daily maximum
Object Types: values related to
sampled real-world objects
Additional data relations possible to define easily
Since an open relational database technology is used
UDW - Data Sampling
Virtually any type of information can be stored in the UDW
SPIDER SCADA/EMS/DMS Sampling
All types of data in the SPIDER real-time database Avanti can be sampled and
stored in the UDW database.
Cyclically, according to a user-defined sampling rate.
“By Event" (upon change), with or without an integrated dead band
Sampling from other Information Systems
© ABB Utilities - 15 -
The actual sampling is highly dependent on the source system from which the
data is extracted.
There are several ways to store data into UDW:
Direct SQL insert, e.g. through database links (assuming an RDB source)
Direct access through open interfaces (e.g. ODBC, JDBC, Microsoft COM)
Dedicated program that uses the UDW API
Dedicated program that uses the ”bulk data inserter” (as used in SCADA-sampling)
Access through Avanti (e.g. by first inserting data into Avanti)
UDW – By Event sampling, integrated dead band
Sampling data By Event with an integrated dead band
a threshold value in percent of the measurement range is defined
measured value changes are
accumulated between samples
© ABB Utilities - 16 -
if the actual value differs only
slightly from the latest stored,
it will thus eventually be sampled
anyway
UDW - Quality Marking
UDW handles quality information together with all sampled values.
propagates automatically through all calculations
validity of calculation result depends on quality of included values
according to what is defined in Data Engineering
Mapping of quality states is defined in Data Engineering
For SPIDER sampling these mappings are set by default, but are configurable.
Possible to distinguish quality information generated in UDW and quality
information originating from the SPIDER Avanti database.
© ABB Utilities - 17 -
Original quality value
Quality value in UDW
Quality value in UDW
after manual correction in UDW
after calculation (a+b)
Object
Value
Quality
Object
Time
Stamp
Value
Quality
Object
Time
Stamp
Value
Quality
123456
123.46
Invalid
123456
12:34:10
123.46
Invalid
22222
12:34:10
125.46
Invalid
789123
2.000
Corrected
789123
12:34:10
2.000
Corrected in
SPIDER
33333
12:34:10
7.000
OK
456789
213.434
OK
456789
12:34:10
5.00
Corrected in
UDW
UDW - Post Disturbance Review
The Post Disturbance Review, PDR, function allows the user to monitor
the process behavior before, during, and after a disturbance
T rigging
event
(discarded)
T rigging
event
In the DE tool it is specified:
T rigging event
(prolonging)
time
© ABB Utilities - 18 -
period
before
event
Deadzone
which data is to be monitored,
which events that will trigger a PDR
time periods
the number of areas available for
storage of disturbance information
period after
event
A disturbance recording can be manually triggered by an operator
It is possible to playback the recorded disturbance data in a Single Line
diagram.
Upon operator request, data for completed disturbances can be saved on
the off-line medium used for archiving
UDW - Data Archiving
For each Object Type it is defined (in Data Engineering):
how long it shall be retained on-line in the database
if it shall be archived
When retention period has passed, data is automatically exported to the
chosen off-line medium
UDW maintains a library system for the archived data.
Seamless presentation is supported
If restored data and on-line data connects in time
Standard software can be used for the Archiving function.
© ABB Utilities - 19 -
When data shall be restored the user specifies the type of data and for which
time interval the data shall be restored
such as Legato NetworkerTM
UDW - Advanced Data Processing
Powerful and versatile data processing capabilities
Times series calculations:
Snapshot calculations
All power of the Matlab tool is available for utilization
Calculation wizard to ease simple calculation definitions
Complete ROLAP1 aggregation support (configurable)
Data can be automatically aggregated in all dimensions
© ABB Utilities - 20 -
Raw data and processed data are stored and handled the same way,
i.e. both can be used for e.g. data mining
Both during sampling and at manual entry
Quality coding of all values
E.g. summary for e.g. a station or the whole power network
Automatic recalculation of all2 formulas
Any formula/calculation on objects for many points of time
Matlab used as calculation engine
Any formula/calculation on objects for a certain point of time
Custom calculations
Sum, Average, Max, Min
Propagates through calculations
Calculation patterns (“templates”)
For easy connection of sampled objects to a set of pre-defined calculations
and sampling profiles
1ROLAP
= Relational
On Line Analytic
Processing
2Restrictions
apply on
custom calculations to
prevent recursive
deadlocks
UDW - Report and user interaction functions
UDW data is easily accessed from various client tools
There are four main access routes:
Through SPIDER (historical data originating from SPIDER can be presented
in SPIDER trends and spreadsheet reports, and in one-line diagrams).
Direct SQL read (through SQL*Net, Oracle tools, Microsoft Query etc.)
Direct access through open interfaces - ODBC, JDBC, COM, (e.g. Microsoft
Office programs).
Through the API or Java API (dedicated programs accessing UDW data)
Presentation in
Reports & spreadsheets
Presentation in
One-line diagrams
Presentation as Trends &
Historical information
Vedlikeholdsplaner: Region Syd
10.25.32
Station
A
T1
B4
C2
T2
B1
B3
Region/Område
Region Syd TypeAlle
StatusAlle
Stasjon/Linje
Spenning
Skala Uker
Alle
300kv
1996 27 28 29 30 31 32 33
15 22 29 5August
12
Nr Stasjon/Linje
Anl.Del
Årsak Status1 8 Juli
96S432-1
Hasle
10 G2Impinging
I Gang
96S530-2
Flesaker G1 Årlig Revisjon
Utsatt
96S532-1
LysebotnF1 Mekanisk
OK Reg.
96S541-1
RendalenG3 Utskifting
OK Land
B9
B8
C1
B2
B5
B6
Data mining and other
analysis tools
B7
C7
© ABB Utilities - 21 -
Lukk
UDW 200
Database
(Oracle)
Detaljplan Ny plan
Utskrifter
Hjelp
UDW – Client example: SPIDER Time Tagged Trends
The Time Tagged Trend (TTT) function is used for presentation of
historical data in curve diagrams.
Up to 16 curves in each diagram
Automatic scaling of value axis
depending on curve values.
The values can be fetched from
different object types
© ABB Utilities - 22 -
each with its own value axis.
Can have time offsets relative to each other.
Quality coding
UDW – Client example: SPIDER Single Line Diagram
UDW data can be shown in the Single Line Diagram
© ABB Utilities - 23 -
by selecting a historic time and an Object Type or a Data Set.
Possible to start a replay of the data
using tape-recorder like controls: play, advance, rewind, stop
the selected time is automatically changed with a given speed.
UDW – Client example: Microsoft Excel
© ABB Utilities - 24 -
There are several ways to get UDW data into Excel:
via the SPIDER Excel add-in
directly through the
Oracle ODBC driver
copy/paste or export/
import data retrieved
with another tool,
e.g. Oracle™ Browser
use the UDW COMinterface in an Excel
Visual Basic program,
that reads the appropriate information from
UDW
Once the data is in Excel, its built-in functions can be used
i.e. data can be calculated on, charts can be created etc.
UDW – Client example: Microsoft Query
© ABB Utilities - 25 -
Using the Microsoft Query tool it is possible to create custom queries to
the UDW in a user-friendly way, from the Windows environment
© ABB Utilities - 26 -
UDW – Client example: MATLABTM Graphics
Using the integrated MatlabTM server it is possible to define
advanced formulas and calculations
Using a MatlabTM client tool it is also possible to benefit from
the advanced graphical user
interface of MatlabTM, and that
way visualize data and correlations found in UDW data.
UDW - Data Sets
The Data Set function facilitates presentation of UDW data
Groups a set of Object Types
A Data Set may include object groups from different SPIDER
concepts: Measurands, Process values, Indications etc.
© ABB Utilities - 27 -
Can thereafter be referenced as one identity
Practical e.g. when presenting historical data in Single Line diagram
UDW - Seamless Retrieval – Data Set example
Measured values are often sampled with several frequencies, where higher
frequency normally corresponds to a shorter retention period
Each sampling frequency corresponds to one Object Type
For some points of time there may exist data in more than one Object Type
By creating a Data Set of these Object Types, it is possible to
address the sampled values as one entity
get best possible resolution for all points of time
Data that are retrieved when asking for data from
t1 to tn:
10 second
t1-t2 1 hour values
t2-t3 1 minute values
t3-tn 10 second values
1 minute
© ABB Utilities - 28 -
1 hour
t1
t2
t3
tn
UDW - Daylight Saving Time
Calendar time is normal time
with Daylight Savings offset
The UDW has full support for Daylight Savings Time.
It is possible to store both normal time and calendar time in UDW.
It is recommended for most usages to only store normal time and let the UDW
API take care of the DST conversions.
If the times are retrieved through the UDW API,
the UDW takes care of the conversion
both when it comes to individual points of time and
larger time spans.
The UDW API is used when
accessing UDW data in the
SPIDER environment, by the
MS COM interface and when
accessing UDW data in
dedicated programs
Example: If the user wants to get data for a time span that includes a switch between
normal time and Daylight Savings Time, there will be the correct number of times
retrieved (e.g. an extra hour if going from daylight savings to normal time).
© ABB Utilities - 29 -
The calendar time is used in display pictures and reports
UDW - Authority
UDW uses standard Oracle™ authority routines
Users are assigned to a Profile and one or more Roles
Roles provides specific access rights
Profiles are used to set session parameters
Such as the maximum CPU time for a session
Examples of Roles defined in UDW:
system administrator, privileged user, common user, etc.
Number of sessions
Max CPU per session
Connect time during session
etc.
© ABB Utilities - 30 -
Profiles are used to limit the risk for
low priority users or user groups to
impair the performance of the UDW
database by using badly formed
SQL-statements, for instance, a full
table scan.
Profile
User
Database creator
System administrator
Role
UDW - Audit Trail
The Audit Trail function in UDW tracks all changes made in the UDW
database
For all manual updates the following data is stored:
old and new value
the affected time and object identity
the point of time of the modification
an operator identification
The standard Oracle™ audit trail function can also be used
Tracking of insert is not recommended
© ABB Utilities - 31 -
providing tracking of user actions on a table level.
the insert rate is very high in the system, and auditing insert would degrade
performance.
UDW – Planned Values
The UDW supports storage of planned values
A ”window” of future data points are a opened for storage
Can be used to store e.g. forecasts or schedules
Defined as an Object Type in Data Engineering
© ABB Utilities - 32 -
Handling of retention period, archiving etc. just like other Object Types
A number of parameters are possible to define:
if the values shall be overwritten by actual sampled values or not
if the values shall initially be set to blank or ”invalid”
the size of the ”window” – the time span to be opened in the future
initial time period of the plan, i.e. including historical times
UDW - Data Engineering
The Data Engineering of the UDW is object oriented and dialog driven.
Is totally integrated with the Oracle-based SPIDER data engineering tool.
© ABB Utilities - 33 -
All data engineering logic for UDW is contained within the UDW product.
UDW – Specification of Object Types
For sampled Object Types:
sampling method: by event (including dead band) or cyclically (including
frequency)
retention period, i.e. how long the object shall be retained on-line in the
database
mapping between the source system (SCADA) and UDW quality codes
properties to be sampled from source system (SCADA) to UDW
© ABB Utilities - 34 -
for instance the actual value and updated flag for a measured value
For calculated Object Types:
source Object Type (a sampled Object Type or another calculated Object
Type)
type of calculation (e.g. Average, Sum, Max, Min)
required percentage factor of valid source data to get a valid result
if time stamping shall be in the beginning or end of the calculated period
UDW - Backup and recovery
For safety copy reasons the on-line data in UDW can be backed up, using
standard Oracle™ functionality.
The backup is done in a way so that all data, including calculated values,
are consistent.
The interval between backup activities is configurable
UDW Safety Backup and UDW Archiving are complementary functions!
© ABB Utilities - 36 -
Oracle
RDB
Utility Data Warehouse
UDW on-line system
UDW
Safety
Backup
UDW
Archiving
Archiving of a safety
backup of on-line data
Archiving of data older
than retention period
UDW - Configuration
The UDW can be configured in different ways to provide the required
degree of redundancy and data security.
Data Redundancy
The data in the UDW database is stored redundantly so that if one disk fails,
no data is lost
To get full data redundancy, RAID disks of type 0+1 are used
Server Redundancy
© ABB Utilities - 37 -
striping of data and disk mirroring is applied
The UDW servers are redundant so that if one UDW server becomes
inoperable (e.g. due to hardware errors) another server is taking over.
Different configurations are applicable depending on the requirements on
allowable downtime and loss of data during switch-over.
Redundant Server solutions, with automatic switchover
Emergency Center solutions, with manual switchover
Single Server solutions
UDW - Redundant Server solution
Redundant Server solution
No single point of failure
Automatic switchover when the on-line server malfunctions
Separate server for Safety Backup and Archiving
Backup
Server
DLT Automated Tape Library with duplicate
drives for archiving and restoring
SCSI
SCSI
Backup LAN
SCADA/EMS/DMS
The two UDW servers are connected in a
cluster configuration
Both servers receive data to be stored
from the sampled system.
One of the servers is on-line while the
other is hot standby and
ready to take over the processing.
The on-line server updates the UDW
database with the sampled data and
calculation results
Memory Channel
© ABB Utilities - 38 -
UDW
Server 1
Server-RAID
Server-RAID
Connection
Connection
SCSI
SCSI
UDW
Server 2
RAID Disk
LAN
UDW – Emergency Center solution
Emergency Center solution
No single point of failure
Manual switchover when the on-line server malfunctions
Original UDW rebuilt by restoring a safety backup
SCADA/EMS/DMS
HIS Server, online
RAID Disk
Server-RAID
Connection
SCSI
The standby server is kept up-to-date with
the on-line server, using Oracle Standby
It is possible to restart the standby database
as the primary UDW database with minimal
loss of time and data.
LAN
SCADA/EMS/DMS
HIS Server, standby
Emergency Control Center
© ABB Utilities - 39 -
RAID Disk
W
A
N
Server-RAID
Connection
SCSI
LAN
UDW – Single Server solution
Single Server solution
With either RAID or disk shelf
SCADA/EMS/DMS
Disk or RAID
Shelf
UDW Server
SCSI
© ABB Utilities - 40 -
LAN
If a disk shelf is used instead of RAID, the
single disk controller is located in the UDW
Server