PowerPoint 簡報

Download Report

Transcript PowerPoint 簡報

Light-path Monitor System of
TWAREN Optical Network
National Center for High-Performance Computing
Speaker: Ming-Chang Liang
1
INTRODUCTION
2
TWAREN phase 2





TWAREN was adapted for more protection methods and
better availability at the end of 2006, called TWAREN
phase 2.
Tens of optical switches and hundreds of lightpaths were
then served as the foundation of the layer 2 VLAN
services and the layer 3 IP routing services.
In 2008, tens of VPLS switches were further incorporated
to provide additional Multi-point VPLS VPN service.
The layer 1 lightpaths can be protected by SNCP, layer 2
VLAN by spanning tree recalculation and layer 2 VPLS
by fast reroute technology.
All these improvements transform TWAREN phase 2 into
a true hybrid network capable of providing multiple
layers of services and high availability .
3
Architecture of Optical Network
4
Architecture of TWAREN phase 2
NCCU
ASCC
NTU
NIU
15454
6509
7609
7609
7609
6509
15454
15454
6509
15454
6509
NDHU
3750
7609
15600
NCU 15454
12816
7609
12816
15454
MOEcc
NCNU
7609C
NHLTC
Taipei
7609
12816
12816
3750
6509
NCTU 15454
15600
15454
7609C
12816
NCHC
Hsinchu Taichung
6509
15454
7609C
7609
NCHC
12816
6509
Tainan
15454
NCHC
NCHU
7609C
7609
NTTU
6509
12816
15454
12816
NTHU 15454
3750
6509
15600
7609
15454
6509
7609
NSYSU
15454
6509
7609
NCKU
15454
6509
7609
CCU
STM64
STM16
10GE
GE
5
CTC is not enough for us
 TWAREN phase 2 inherently has the
ability to guard against a single point of
hardware or circuit failure, so the
failure is less likely to affect the actual
service provisioning.
 When port or circuit is fault, we must
determine which lightpaths are affected
and then correlate with services of
upper layers.
6
DESIGN OF NMS
7
1st Stage Architecture of NMS
Monitor Objs
Control API
Traps
GUI &
Ticket System
Fault Detection
Data Collectors
Fault Location
MIBs
Syslogs
Current Status
DB
Threshold
DB
Net flows
Telnet/SSH
Long Term
DB
Case/Action
DB
TL1
Mirror
Interactive
Auto Action
Threshold Analyzer
Report System
Passive
8
Lightpaths Monitor System
Monitor System in NOC
Alarm trigger
Alarm Email
Database
Trap parser
Light-path data Parser
ONS
TWAREN Optical Network
9
Frequent incident case 1
10
Frequent incident case 2
11
Important information in a trap
Value index
Value
1
Trap agent host name
The hostname which sent this trap
2
Trap agent IP address
The IP address of host which sent this trap
3
sysUpTime
The system uptime of host which sent this trap
4
snmpTrapOID
The mapped OID of this trap
5
Cerent454NodeTime
The ONS clock time
(YYYYMMDDhhmmss)
cerent454AlarmState
1.3.6.1.4.1.3607.6.10.20.30.20.1.80
The severity level of this warning
(defined by Cisco CerentNotificationClass)
6
Description
12
cerent454AlarmState
Alarm State
Meaning
Number
Traps that use this alarm
stats
31
diagnostic
All traps of ONS
40
cleared
All traps of ONS will sent this
state when fault be solved
50
minorNonServiceAffecting
1. carrierLossOnTheLAN
2. transportLayerFailure
80
minorServiceAffecting
All traps of ONS
90
majorServiceAffecting
1. carrierLossOnTheLAN
2. transportLayerFailure
100
criticalServiceAffecting
1. lossOfSignal
2. lossOfFrame
13
The OID of ONS Trap
ONS-15600
Trap Name
lossOfSignal
lossOfFrame
carrierLossOnTheLAN
transportLayerFailure
MIB OID
.1.3.6.1.4.1.3607.2.20.0.430
.1.3.6.1.4.1.3607.2.20.0.390
.1.3.6.1.4.1.3607.2.20.0.220
.1.3.6.1.4.1.3607.2.20.0.3540
ONS-15454
Trap Name
lossOfSignal
lossOfFrame
carrierLossOnTheLAN
transportLayerFailure
MIB OID
.1.3.6.1.4.1.3607.6.10.30.0.430
.1.3.6.1.4.1.3607.6.10.30.0.390
.1.3.6.1.4.1.3607.6.10.30.0.220
.1.3.6.1.4.1.3607.6.10.30.0.3540
14
DESIGN OF DATABASE
15
Relationship of Data Tables
Basic Data Tables
Relationship Tables
Component
Circuit
People
VLAN Services
Location
VPLS Services
Unit
ONS
Light Path
Vendor
ONS
Cross Connection
…., etc
…., etc
16
Basic Data Tables
Component Data Table
Component_ID
Parent_C_ID
Name
1
0
TN7609P
ID
Name
12
1
Slot_1
1
CHT
2
0
TP15454
2
APBT
16
2
Slot_3
3
RingLine
135
12
Port_9
Vendor Data Table
People Data Table
ID
Name
Phone
Address
Service_Time
Service_WeekDay
1
John
0939123123
xxxxxxx
8-17
1,3,5
2
Mary
0958123123
xxxxxxx
ALL
ALL
Location Data Table
Unit Data Table
ID
Name
Address
ID
Name
1
MOEcc
xxxxx
1
NCKU
2
NTU
xxxxx
18
THU
17
Port Table
Field
Type
PortID
int
Type
CardName
Bandwidth
Status
Description
Port Component ID
Port type:
int
Ethernet(0)
SDH(1)
varchar Line Card type name
int
Bandwidth
int
Defined by us
18
Topology Link Table
Field
NodeA
Type
Description
int
NodeB
int
PortA
int
PortB
int
The component IDs of the equipments
connected with the link.
The component IDs of the ports
connected with the link.
19
Cross Connection Table
Field
Type
Description
CRS
int
Cross Connection unique number
SNCP
int
If SNCP protection? (0: No, 1:Yes)
PortFrom1
int
Port Component ID - From 1
PortFrom2
int
Port Component ID - From 2
PortTo1
int
Port Component ID - To 1
PortTo2
int
Port Component ID - To 2
ChannelFrom1
int
Port From 1 – Channel ID
ChannelFrom2
int
Port From 2 – Channel ID
ChannelTo1
int
Port To 1 - Channel ID
ChannelTo2
int
Port To 2 - Channel ID
SNCPPathFrom
int
Current SNCP selector uses WORKING(1) or PROTECTION(2)
SNCPPathTo
int
Size
int
VC bandwidth size, unit is VC1 (155mbps)
Status
int
Defined by us
CKTID
varchar
Circuit Identification string
20
Light-Path Table
Field
LightPath
Name
Type
Description
int
Light Path unique number
varchar
Light Path name (CKTID)
PortFrom
int
Port – From
PortTo
int
Port - To
SNCP
int
If SNCP protection? (0: No, 1:Yes)
Size
int
VC bandwidth size, unit is VC1 (155mbps)
TraceCRS
varchar
Cross-connection path string that a light-path
passed through
TraceConfigured
varchar
Configured ports path (port Component ID string)
that a light-path should pass through.
TraceCurrent
varchar
Actual ports path (port Component ID string) that
a light-path pass through currently.
Status
int
Defined by us
21
Alarm Table
Field
Id
Type
Int
Description
Alarm serial number
EventName
Varchar Alarm identify name
HostName
Varchar The name of the host which sent this alarm
AgentIP
Varchar The IP address of the host which sent this alarm
Category
Varchar The class of this alarm
Severity
Varchar The severity level of this alarm (defined by us)
UpTime
Varchar
TrapTime
Varchar The time that this alarm be generated
Interface
Varchar The port be affected
The uptime of the equipment which sent this
alarm
AlarmStatus Varchar The severity level that be defined by CISCO
LightPath
Varchar The light-path names be affected
22
IMPLEMENTATION
23
Working steps
 Build Port table by reading from
Component table.
 Send some TL1 commands to all ONS.
 Build TopologyLink and CrossConnection
tables by parsing the responses of TL1.
 Build LightPath table by aggregating Port,
TopologyLink, CrossConnection tables.
 Determine the affected ports When
receive traps from ONS, and then
correlate the affected services with
database.
24
TL1 commands
Command
Description
ACT-USER::username:123::password;
Login
RTRV-NE-IPMAP:::123;
Get information about
topology link neighbors
RTRV-CRS::ALL:123;
Get information about cross
connections
RTRV-VC::ALL:123;
Get information about all VC
statuses, including SNCP
selector status
CANC-USER::username:123:;
logout
25
Partial SNCP (1)
ONS-A
ONS-B
ONS-C
ONS-D
RA
RB
Configured working path
Configured protection path
26
Partial SNCP (2)
ONS-A
ONS-B
ONS-C
ONS-D
RA
RB
Actual working path
Configured protection path
27
28
WEB-BASED ALARM LOG
SYSTEM
29
Example 1
30
Example 1
31
Example 2
32
INTEGRATED VISUAL
INTERFACE
33
34
35
36