Transcript chapter1
Client/Server Networking
Protocol Stack Summary
“turtles all the way down”
each layer uses the services of the layer below and provides a
service to the layer above
Python lets you work at the layer of your choice
programs are “cleaner” the higher the layer you use
layers work by “hiding” the layers below behind function calls
What lies under Socket?
TCP/UDP
Internet Protocol Stack
IP
“link layer”
Networking
About “sharing” resources.
Compare to sharing of disk, IO devices, etc done by programs
running on a computer
Computer Example: OS is the master controller
Network Example: Each participant “plays by the rules” but no
“controller”
Networking
All network cards share the same ethernet cable
all wireless transmitters share the same frequency channels
fundamental unit of sharing is “packet”
individual packets carry addressing info sufficient to arrive at
final destination.
Addressing
two layers: one hop at a time and end-to-end
single hop addressing performed by link layer
end-to-end addressing is IP
process addressing is TCP or UDP
IP Addresses
a.b.c.d: 0 <= a,b,c,d <= 255
New Paltz: 137.140.*.*
localhost: 127.0.0.1
private networks: 10.*.*.*, 172.16-31.*.*, 192.168.*.*
Domain Name Service
DNS converts host names into host IP addresses.
corresponds to directory assistance
address = socket.gethostbyname(name);
How DNS Works
gethostbyname() first looks in /etc/hosts
if this fails then it looks in /etc/resolv.conf for the address of
“directory assistance”, also called a DNS Server.
sends the request to this address
Observation: If your DNS server is down, you won't get
anywhere on the Internet.
Routing
Each time a packet arrives at a new node a decision must be
made at that node as to where to send the packet next.
Guiding principle of routing on the Internet is that each time a
packet “hops” from one node to another it is always one hop
closer to its final destination.
Exercise: Difference between host and node.
Lots of Reading
The classic text is TCP/IP Illustrated: Vol I by Richard Stevens.
PDF file available on the web at books.google.com among
other places
We will concentrate on Chapters 1-4, 9, 11, 14, 17-19.
EXAMPLE:
Given:
207 N. Defiance St, Archbold, OH
Find longitude and latitude
Protocol Stack:
GoogleMaps
?????
TCP
IP
Ethernet
GoogleMaps
googlemaps library (3rd party) uses
urllib, uses
httplib, uses
Socket, uses
TCP, IP,
Ethernet
GoogleMaps
URL
protocol stack
inside the actual
program itself
HTTP
Socket
TCP, IP and
Ethernet make up
the OS part of the
protocol stack
APIs vs Sockets:
well-tested
written by experts
common practice to use them
we still need to understand Sockets to
appreciate things that depend upon them
Wireshark:
lets you look at packets crossing the wire
needs root permissions
easy to filter out unneeded traffic
I saved some traffic and you can view it with Wireshark (see
course web page).
Highest Level API Example:
Fetch a JSON document without realizing it:
#!/usr/bin/env python
# Foundations of Python Network Programming - Chapter 1 - search1.py
# Not even clear you are using a web service
from googlemaps import GoogleMaps
address = '207 N. Defiance St, Archbold, OH'
print GoogleMaps().address_to_latlng(address)
GET Syntax:
#!/usr/bin/env python
# Foundations of Python Network Programming - Chapter 1 - search2.py
# HTML-level abstraction
import urllib, urllib2
try:
import json
except ImportError: # for Python 2.5
import simplejson as json
params = {'q': '207 N. Defiance St, Archbold, OH',
'output': 'json', 'oe': 'utf8'}
url = 'http://maps.google.com/maps/geo?' + urllib.urlencode(params)
rawreply = urllib2.urlopen(url).read()
reply = json.loads(rawreply)
print reply['Placemark'][0]['Point']['coordinates'][:-1]
GET Syntax:
#!/usr/bin/env python
# Foundations of Python Network Programming - Chapter 1 - search3.py
# HTTP level abstraction
import httplib
try:
import json
# json built in with Python 2.6
except ImportError: # for Python 2.5
import simplejson as json
path = ('/maps/geo?q=207+N.+Defiance+St%2C+Archbold%2C+OH'
'&output=json&oe=utf8')
connection = httplib.HTTPConnection('maps.google.com')
connection.request('GET', path)
rawreply = connection.getresponse().read()
reply = json.loads(rawreply)
print reply['Placemark'][0]['Point']['coordinates'][:-1]
GET Syntax:
#!/usr/bin/env python
# Foundations of Python Network Programming - Chapter 1 - search4.py
import socket
sock = socket.socket()
# OS functionality
sock.connect(('maps.google.com', 80))
sock.sendall(
'GET /maps/geo?q=207+N.+Defiance+St%2C+Archbold%2C+OH'
'&output=json&oe=utf8&sensor=false HTTP/1.1\r\n'
'Host: maps.google.com:80\r\n'
'User-Agent: search4.py\r\n'
'Connection: close\r\n'
'\r\n')
rawreply = sock.recv(4096)
print rawreply
GET Syntax:
# search4.py output
HTTP/1.1 200 OK
Content-Type: text/javascript; charset=UTF-8
Vary: Accept-Language
Date: Wed, 21 Jul 2010 16:10:38 GMT
Server: mafe
Cache-Control: private, x-gzip-ok=""
X-XSS-Protection: 1; mode=block
Connection: close
data transmitted by
web server
{
"name": "207 N. Defiance St, Archbold, OH",
"Status": {
"code": 200,
"request": "geocode"
},
"Placemark": [ {
...
"Point": {
"coordinates": [ -84.3063479, 41.5228242, 0 ]
}
}]
}
data read into
program variable
Things We've Seen:
protocols stacked on top of one another
higher level protocols using services of lower levels
programs get more specific and harder to maintain the lower
down you go
the idea behind high-level protocols is precisely to hide lower
levels
there's a whole lot going on below Socket.
The Stack:
Fundamental unit of shared information is the packet.
Typical packet structure:
ethernet
header
IP
header
TCP/UDP
header
Transmitted as a single unit (but serially)
Routing is generally at the packet level
program
data
Things packets contain: data, addresses, layering,
sequencing, protocol bytes, checksums
ethernet packets are called frames.
Ethernet:
14-byte header
addresses: two 6-byte addresses – source and destination
type: 2 bytes – 0800 == IP datagram
the two network cards involved can process the header
without using the CPU, RAM, etc.
cable length (100m) and MTU
CSMA/CD
Some of the details:
http://serverfault.com/questions/422158/
what-is-the-in-the-wire-size-of-a-ethernet-frame-1518-or-1542
IP Addresses:
32 bits: a.b.c.d
network address – n bits; host id – (32-n) bits
some times the network part has a subnet component; some
times the subnet component is carved out of the hostID bits
a.b == 137.140 == New Paltz network address
a.b.c == 137.140.8 == CS subnet at New Paltz
the part of the network address that is not subnet identifies an
organization like New Paltz.
IP Address Classes:
IP Special Addresses:
127.*.*.*: local to the current machine
10.*.*.*, 172.16-31.*.*, 192.168.*.*: private subnets.
none of these address found on the larger Internet.
IP Routing:
Guiding principle: after each hop you are one step closer to
your destination
typical local routing table contains a default entry pointing to
the Internet together with one entry for each local subnet the
host belongs to.
[pletcha@archimedes PPT]$ netstat -nr
Kernel IP routing table
Destination
Gateway
Genmask
Flags
0.0.0.0
192.168.1.1 0.0.0.0
UG
192.168.1.0
0.0.0.0
255.255.255.0 U
192.168.122.0 0.0.0.0
255.255.255.0 U
Iface
wlan0
wlan0
virbr0
IP Routing Next Hop Algorithm:
Search Destination column of table entries with H-flag set
which is an exact match to Destination IP in packet
If found and Flag is G or H then Gateway is next hop;
otherwise Destination IP is next hop.
If not found then calculate Dest IP && Genmask for each entry
that is not the default. If Dest IP && Genmask == Destination
column entry then if Flag is G or H then Gateway is next hop;
otherwise Destination IP is next hop.
Otherwise use the default entry. Flag is almost always G so
Gateway is next hop IP.
IP Routing Next Hop Algorithm:
Once you have the next hop IP you need to determine the
next hop ethernet.
The Address Resolution Protocol (ARP) converts the next hop
IP into a next hop ethernet. More recently replaced by the ip
neigh command
Exercise: Read up on ARP in TCP/IP Illustrated.
[pletcha@archimedes PPT]$ ip neigh
137.140.39.139 dev enp0s25 lladdr 00:c0:17:c2:14:f3 STALE
137.140.193.250 dev wlp3s0 lladdr 00:1f:29:07:e4:6a STALE
137.140.39.250 dev enp0s25 lladdr 00:21:a0:39:65:00 DELAY
ARP Example
●
From my laptop (137.140.8.104) I try to locate joyous
(137.140.8.101)
[pletcha@archimedes PPT]$ ping 137.140.8.101
PING 137.140.8.101 (137.140.8.101) 56(84) bytes of data.
64 bytes from 137.140.8.101: icmp_seq=1 ttl=64 time=0.266 ms
^C
●
Because of my routing table I know it is locally connected so
137.140.8.101 is “next hop”.
[pletcha@archimedes PPT]$ netstat -nr
Kernel IP routing table
Destination Gateway
Genmask
Flags MSS Window irtt Iface
0.0.0.0
137.140.8.250 0.0.0.0
UG
00
0 enp0s25
137.140.8.0 0.0.0.0
255.255.255.0 U
00
0 enp0s25
137.140.192.0 0.0.0.0
255.255.254.0 U
00
0 wlp3s0
ARP Request
ARP Reply
Packet Fragmentation:
The Internet Protocol Suite supports 64k packets but specific
IP networks support much smaller packets.
Ethernet networks support 1500 byte packets.
IP headers contain a Don't Fragment (DF) flag, set by sender.
–
DF not set, then a router can fragment a packet too
large to be forwarded on a particular interface.
–
DF set, router sends an ICMP message to original
sender so sender can fragment the original message
and try again.
UDP: DF unset by OS
TCP: DF set by OS
Packet Fragmentation (continued):
Each subnet has an MTU – Maximum Transmission Unit.
Path MTU = min hop MTU over all hops in a path
DSL providers make MTU = 1492.
–
Initially many service providers used MTU = 1500 and
disabled ICMP so never knew their “large” traffic was
being dropped.
TCP/IP Illustrated discusses how fragmentation actually
happens (Read Section 11.5).
TCP/IP Illustrated:
Pages to Look at
25, 38, 43, 44, 48, 58, 61, 63