IP over InfiniBand: Connected Mode

Download Report

Transcript IP over InfiniBand: Connected Mode

IP over InfiniBand: Connected Mode
Vivek Kashyap
[email protected]
© 2002 IBM Corporation
IETF-55: IPoIB Connected Mode
IP over InfiniBand Modes
 Unreliable Datagram
AD review comments
IBTA 1.1 updates
 Reliable Connected
 Unreliable Connected
 Reliable Datagram
2
Vivek Kashyap, IBM
IETF-55: IPoIB Connected Mode
IPoIB non-UD options
 RC
 RD
Large MTU Chart Title 2^31 bytes
 2-4K links
 2^31 bytes
 2-4K links
 2^31 bytes
 2-4K links
Automatic Path
Migration
 Yes
 Yes
 Yes
Reliability
 Yes
 No
 Message lost
 Yes
RDMA
 Yes
 RDMA Write
 Yes
No. of
messages
outstanding
3
 UC
 One
Vivek Kashyap, IBM
IPoIB over Connected Mode
Reliable Connected
Unreliable Connected
© 2002 IBM Corporation
IETF-55: IPoIB Connected Mode
IPoIB Subnet
 IPoIB-Connected subnet is formed over a Non-Broadcast Multiple
Access network
IPoIB-UD is over an abstract broadcast link
Node A A
IPa
IPp
IPs
Node D
IPb
IPd
IPq
Node B
IPr IPc
Node C
IB switched network
IPoIB UD broadcast domain
IPoIB Connected network
5
Vivek Kashyap, IBM
IETF-55: IPoIB Connected Mode
Connection Setup: Address Resolution
 The Connection can be setup only when the peer’s GID is known
– Manually configured (or Permanent Virtual Circuit)
• Inverse ARP
– Dynamically resolved
• ARP Server as in Classical IP over ATM (RFC2225)
or
•
IPoIB address resolution
– Use a UD QP
• may reuse IPoIB-UD QP
– Define ARP server link address same as IPoIB-UD Broadcast GID
• FF12:<401B or 601B>:<P_Key>::< 255.255.255.255 or 1>
6
Vivek Kashyap, IBM
IETF-55: IPoIB Connected Mode
Connection Setup: Option 1
 SendFrom IPb to IPa
IPb creates ARP/ND packet with local hardware address of :
[RC|UC]:FFFFFF:GIDb
A bit indicates RC or UC to let the peer know of the transmitter’s view of
the subnet.
IPa receives the packet
Determine PathRecord to GIDb
Send reply over UD-QP with GIDa
IPb now creates the IB connection to IPa
Service ID of:
1:Protocol or FF:0:0:0:0:0:0
– Protocol indicates IPv4 or IPv6
– FF indicates both IPv4 and IPv6 packets will use this connection
– Private Data includes the IP address that was resolved
7
Vivek Kashyap, IBM
IETF-55: IPoIB Connected Mode
Address Resolution: Option 2
 SendFrom IPb to IPa
IPb creates ARP/ND packet with local hardware address of :
[RC|UC]:FFFFFF:GIDb
A bit indicates RC or UC to let the peer know of the transmitter’s view of
the subnet.
IPa receives the packet
Determine PathRecord to GIDb (same as for IPoIB-UD)
Creates QPc of type RC/UC as indicated
– may reuse existing QP
Send reply with QPc and GIDa
IPb now creates the IB connection to the peer
Service ID of:
1:QPc:0:0:0:0:0:0
IPa updates ARP/ND entry with QP value received in IB connection
8
Vivek Kashyap, IBM
IETF-55: IPoIB Connected Mode
Encapsulation
 Connection can carry IPv4 or IPv6 or both packets. Therefore
– Type of packet is known
– Type of packet is distinguishable from the IP header
 Two options:
– IP packets directly encapsulated in the IB connection mode payload
– IP packets encapsulated with the ‘Ethertype’ indication as in IPoIB-UD
 Address resolution over UD QP needs to follow IPoIB-UD
encapsulation
9
Vivek Kashyap, IBM
IETF-55: IPoIB Connected Mode
MTU
 MTU Negotiated per IB connection
Value requested sent in private data of CM REQ message to peer B
If not acceptable counter value in CM REP message to peer A
If not acceptable REJ with desired value in private data to peer B
10
Vivek Kashyap, IBM
IPoIB Connected Mode
Vivek Kashyap
[email protected]
© 2002 IBM Corporation