Business Intelligence

Download Report

Transcript Business Intelligence

Business Intelligence
BI Fundamentals

Business Transactions

Data Bases

Data Warehouses

Data Marts

Data Mining
Data bases (DB or DBMS)



“A collection of information organized in such
a way that a computer program can quickly
select desired pieces of data.”
An electronic filing system
Organized by



Fields: a single piece of information
Records: one complete set of fields
Files: a collection of record
Data warehouse (DW)



“Contain a wide variety of data that present a
coherent picture of business conditions at a
single point in time.”
“A database system which contains
periodically collected samples or summarized
(aggregated) transactional data; e.g., daily
totals, or monthly averages”
Typically a compilation of information from
multiple transactional databases
Data mart


“A database, or collection of databases,
designed to help managers make strategic
decisions about their business.”
A smaller and more focused form of a data
warehouse.


Usually created for a particular department or position
A data mart created as a subset of data warehouse
data are referred to as a “dependent data mart”.
Data mining

“A class of database applications that look for
patterns in data to be used to predict and
direct future behavior.”

Increasingly being used by marketers to find
consumer data through the web and store
purchases.
What is BI?


The new technology for understanding the past and predicting the
future
A broad category of technologies that allows for



Gathering, storing, accessing and analyzing the data business users
make better decisions
Analyzing business performance through data-driven insight
A broad category of applications, which includes the activities of




Decision support systems
Query and reporting
OLAP
Statistical, forecasting and data mining
7
BI vs. AI



AI systems make decisions for the users
BI systems help users make the right
decisions, based on the available data
However, many BI techniques have roots in
AI
8
BI Processes
9
Business Intelligence
10
Data-Information–Knowledge–Decision Making Cycle
11
What does BI seek to find?

Patterns

What kind of patterns?



Sales
Stocks
Anything useful
Techniques for Finding
Patterns

Statistics


Trends
Correlation
(searching for a
best fit)
Patterns continued

Combinatorial


If-then relationships
Example

If we put chips on sale on a Friday, then
we also sell more soda.
Leading the Industry

Cognos


BI software company
Software

Used for reporting, analysis, scorecarding,
dashboards, business event management, and
data integration
Cognos

Multiple Solutions

Industry
 Banking
 Education
 Defense
 Government

Department
 Executive Management
 Finance
 Marketing
Open Source Tools for BI




ETL (Extract, Transform, Load) tools
OLAP (Online Analytical Processing) servers
OLAP clients
DBMSs (Data Base Management System)
17
ETL Tools

Bee


CloverETL




ROLAP (Relational OLAP) oriented ETL tool
ROLAP oriented ETL tool
Implemented in Java and uses JDBC to transfer
data
cloveretl.berlios.de
Octopus


ROLAP oriented ETL tool
Implemented in Java and uses JDBC
18
OLAP Servers

Bee




Lemur



ROLAP oriented server
Uses mySQL to manage the DB
sourceforge.net/products/bee/
HOLAP oriented server
www.nongnu.org/lemur
Mondrian


ROLAP oriented server
Implemented in Java
19
OLAP Client

Bee




Web-based, used with Bee OLAP server
Generates pie, bar, chat, etc. (in 2D & 3D)
Export data to Excel, PDF, PNG, Powerpoint,
XML
Jpivot




Web-based, used with Mondrian OLAP server
Generates 2D & 3D graphics
Export data to PDF
jvipot.sourceforge.net
20
DBMSs

MonetDB



MySQL



Run on Linux, Windows, Mac OS, etc.
monetdb.cwi.nl
Run on Linux, Windows
www.mysql.com/products/mysql
MaxDB



Formely SAP DB (by SAP AG)
Run on Linux, Windows
www.mysql.com/products/maxdb
21

PostgreSQL


www.postgresql.org
Run on Linux, Unix, Windows (versi > 8.0)
22
PALO OLAP

Palo OLAP Server




Palo ETL Server


http://www.jedox.com/
Open source MOLAP server
be installed locally or in a company network
enables the efficient extraction of mass data from
heterogeneous data sources, ie. all common
relational database systems and flat files
Palo OLAP Client

http://www.jpalo.com/en/
23
Data Mining Softwares

Open sources





Borgelt data mining suite
Gnome data mine
Weka
RapidMiner
Commercials






See5 (Rulequest)
Clementine (SPSS)
Enterprise Miner (SAS)
GhostMiner (Fujitsu)
Statistica Data Miner (StatSoft)
Oracle Data Miner (Oracle)
24
Borgelt Data Mining Suite

Tasks:






Association: apriori, eclat
Classification: bayesian networks, decision
trees, naive bayes
Regression: neural networks
Clustering: self-organizing maps (SOM)
Platforms: Linux, Unix, MS Windows
Website:

http://fuzzy.cs.unimagdeburg.de/~borgelt/software.html
25
Genome Data Mine

Tasks:




Platforms: Linux, Unix, MS Windows
Website:


Association: apriori
Classification: decision trees
http://www.togaware.com/datamining/gdatamine
Owner: Togaware, Canberra, Australia.
26
WEKA

Tasks:


Association: apriori
Classification: decision trees, support vector machines,
conjunctive rules



Platforms: Linux, Unix, MS Windows
Website:


Clustering: k-means
http://www.cs.waikato.ac.nz/ml/
Owner: University of Waikato, Hamilton,
New Zealand
27
RapidMiner




http://rapid-i.com/
The world-leading open-source system for
knowledge discovery and data mining
Multiplaftorm: implemented in Java
Supports about 400 operators data mining
28
Who uses BI?





Businesses
The Government
?
?
What are some ethical implications of the use
of BI?