Everything Else You Should Know

Download Report

Transcript Everything Else You Should Know

Lecture 23:
Everything
Else You Should
Know (but won’t see
on Exam 2)
CS201j: Engineering Software
University of Virginia
Computer Science
David Evans
http://www.cs.virginia.edu/evans
Menu
• Course Evaluations
• 101 Things Every Computer Scientist
should know
• Course Charge
20 November 2003
CS 201J Fall 2003
2
Course Pledge You Signed
I will provide useful feedback. I realize this
is an experimental course and it is important
that I let the course staff know what they need
to improve the course. I will not wait until the
end of the course to make the course staff
aware of any problems. I will provide feedback
either anonymously (using the course feedback
form) or by contacting the course staff directly.
I will fill out all course evaluation surveys
honestly and thoroughly.
20 November 2003
CS 201J Fall 2003
3
Two Surveys Required
• Official SEAS Survey
– You receive email about it
– Administrators read it to determine whether or not
to fire me
• CS201J Specific Survey: handed out today
– Specific questions to help improve future editions
of CS201
• I do read all of the surveys completely
• Department head and curriculum committee
will read SEAS survey
20 November 2003
CS 201J Fall 2003
4
101 Questions
000 What is Computer Science?
001 What problem did the first electronic programmable
computer solve?
010 What are the world’s most complex programs?
011 How do Computer Scientists manage complexity?
100 What is and is not computable?
101 Who invented the Internet?
20 November 2003
CS 201J Fall 2003
5
0. What is
Computer Science?
20 November 2003
CS 201J Fall 2003
6
Let AB and CD be the two given numbers
not relatively prime. It is required to find the
greatest common measure of AB and CD.
If now CD measures AB, since it also
measures itself, then CD is a common
measure of CD and AB. And it is manifest
that it is also the greatest, for no greater
number than CD measures CD.
Euclid’s Elements, Book VII, Proposition 2 (300BC)
20 November 2003
CS 201J Fall 2003
7
The note on the inflected line is
only difficult to you, because it is so
easy. There is in fact nothing in it, but
you think there must be some grand
mystery hidden under that word
inflected!
Whenever from any point without
a given line, you draw a long to any
point in the given line, you have
inflected a line upon a given line.
Ada Byron (age 19), letter to Annabella
Acheson (explaining Euclid), 1834
20 November 2003
CS 201J Fall 2003
8
What is the
difference
between
Euclid and
Ada?
“It depends on what your
definition of ‘is’ is.”
Bill Gates (at Microsoft’s antitrust trial)
20 November 2003
CS 201J Fall 2003
9
Geometry vs. Computer Science
• Geometry (mathematics) is about
declarative knowledge: “what is”
• Computer Science is about
imperative knowledge: “how to”
– Ways of describing imperative
Language
processes (computations)
– Ways of reasoning about (predicting)
Logic
what imperative processes will do
20 November 2003
CS 201J Fall 2003
10
Was CS201J a Computer
Science Course?
• No!
– Only about 30% of CS201J is computer
science.
– Most of it is software engineering.
If you want to take a Computer Science course,
Consider taking CS200 in the Spring
20 November 2003
CS 201J Fall 2003
11
1. What problem did the first
electronic programmable
computer solve?
20 November 2003
CS 201J Fall 2003
12
Colossus
First Programmable Computer
• Bletchley Park, 1943
• Designed by Tommy
Flowers
• 10 Colossi in operation at
end of WWII
• Destroyed in 1960, kept
secret until 1970s
• 2 years before ENIAC –
calculating artillery tables
20 November 2003
CS 201J Fall 2003
13
Colossus’ Problem
• Decode Nazi high
command messages
from Lorenz Machine
• XOR encoding:
Ci = Mi  Ki
– Perfect cipher, if K is
random and secret
20 November 2003
CS 201J Fall 2003
14
Why perfectly secure?
For any given ciphertext, all plaintexts are
equally possible.
Ciphertext:
Key:
Plaintext:
20 November 2003
0100111110101
1
1100000100110
B
1000111010011
= “CS”
0
CS 201J Fall 2003
15
Breaking Lorenz
• Operator and receiver need
same keys
• Generate key bits using rotor
machine, start with same
configuration
• One operator retransmitted a message (but
abbreviated message header the second
time!)
• Enough for Bletchley Park to figure out key –
and structure of machine that generated it!
• But still had to try all configurations
20 November 2003
CS 201J Fall 2003
16
Colossus
• Read ciphertext and Lorenz wheel
patterns from tapes
• Tried each alignment, calculated
correlation with German
• Decoded messages (63M letters by 10
Colossus machines) that enabled Allies
to know German troop locations to plan
D-Day
20 November 2003
CS 201J Fall 2003
17
2. What are the world’s most
complex programs?
20 November 2003
CS 201J Fall 2003
18
Complex Programs
• Apollo Guidance Software
– ~36K instructions
• F-22 Steath Fighter Avionics Software
– 1.5M lines of code (Ada)
• 5EEE (phone switching software)
– 18M lines
• Windows XP
– ~50M lines (1 error per kloc ~ 50,000 bugs)
• Anything more complex?
20 November 2003
CS 201J Fall 2003
19
Human Genome
Produces
60 Trillion Cells (6 * 1013)
50 Million die every
second!
20 November 2003
CS 201J Fall 2003
20
DNA
• Sequence of
nucleotides: adenine
(A), guanine (G),
cytosine (C), and
thymine (T)
• Two strands, A must
attach to T and G must
attach to C
20 November 2003
CS 201J Fall 2003
G
C
T
A
21
Codons
• Three nucleotides
encode an amino acid
• Sequence of amino
acids makes a protein
• But, there are only 20
amino acids, so there
may be several
different ways to
encode the same one
From http://web.mit.edu/esgbio/www/dogma/dogma.html
20 November 2003
CS 201J Fall 2003
22
Central Dogma of Biology
(Crick, 1957)
Translation
Transcription
DNA
RNA
Protein
Image from http://www.umich.edu/~protein/
• DNA makes RNA
• RNA makes proteins
• Proteins make us
20 November 2003
CS 201J Fall 2003
23
Shortest (Known) Life Program
• Nanoarchaeum equitans
– 490,885 bases (522 genes)
= 490,885 * ¼ * 21/64 = 40,268 bytes
– Parasite: no metabolic capacity,
must steal from host
– Complete components for information processing:
transcription, replication, enzymes for DNA repair
http://www.mediscover.net/Extremophiles.cfm
KO Stetter and Dr Rachel Reinhard
• Size of compiling C++ “Hello World”:
Windows (bcc32): 112,640 bytes
Linux (g++):
11,358 bytes
20 November 2003
CS 201J Fall 2003
24
The Make-Human Program
• 3 Billion Base Pairs
– Each nucleotide is 2 bits (4 possibilities)
– 3B bases * 1 byte/4 pairs = 750 MB CD = 650 MB
– Highly redundant encoding (21/64) ~ 250 MB
• Only 5% is transcribed (exons) ~ 12 MB
– 95% junk (intons): genomes from viruses
reverse transcribed into human genome, but
inactive
Wal-Mart’s database
is 280 Terabytes
20 November 2003
CS 201J Fall 2003
25
Expressiveness of DNA
• Genetic sequence for 2 humans differs
in only 2 million bases
– 4 million bits = 0.5 MB
1/3 of a floppy disk
<1% of Windows 2000
20 November 2003
CS 201J Fall 2003
26
3. How do Computer
Scientists manage
complexity?
20 November 2003
CS 201J Fall 2003
27
Abstraction
Adapted from Gerard Holzmann’s FSE Slides
20 November 2003
CS 201J Fall 2003
28
Abstraction in CS201J
• Abstraction by Specification
– Abstract away how details by saying what a procedure
does
• Data Abstraction
– Abstract away representation details by specifying
what you can do with something
• Subtyping
– Abstract away actual type details by allowing many
types to be used in the same way (?)
• Concurrency
– Abstract away when details (?)
20 November 2003
CS 201J Fall 2003
29
4. What is and is not
computable?
20 November 2003
CS 201J Fall 2003
30
Halting Problem
Input: a procedure P
Output: true if P halts (finishes
execution), false otherwise.
Is it possible it implement a procedure that
correctly implements halts and always
terminates?
20 November 2003
CS 201J Fall 2003
31
Halts is not Computable
boolean contradictHalts (Program P)
if (halts “contradictHalts (P);”)
while (true) ;
else
return true;
If contradictHalts halts, the if test is true if enters
the while loop - it doesn’t halt!
If contradictHalts doesn’t halt, the if test if false,
and it evaluates to true. It halts!
20 November 2003
CS 201J Fall 2003
32
Learned Discussion on
Computability
(Video)
Ali G Multiplication Problem
• Input: a list of n numbers
• Output: the product of all the numbers
Is it computable?
Yes – a straightforward algorithm
solves it.
Can real computers solve it?
20 November 2003
CS 201J Fall 2003
34
Ali G was Right!
• Theory assumes ideal computers:
– Unlimited memory
– Unlimited power
– Unlimited (finite) time
• Real computers have:
– Limited memory, time, power outages, flaky
programming languages, etc.
– There are many decidable problems we cannot
solve with real computer: the numbers do matter
20 November 2003
CS 201J Fall 2003
36
5. Who Invented the Internet?
skip
20 November 2003
CS 201J Fall 2003
37
What is a Network?
A group of three or more connected
entities communicating indirectly
Ancient Greeks had beacon chain
networks on Greek island mountaintops
20 November 2003
CS 201J Fall 2003
38
Chappe’s Semaphore Network
First Line (Paris to Lille), 1794
20 November 2003
Mobile Semaphore Telegraph
Used in the Crimean War 1853-1856
CS 201J Fall 2003
39
internetwork
A collection of multiple networks
connected together, so messages can be
transmitted between nodes on different
networks.
20 November 2003
CS 201J Fall 2003
40
The First Internetwork
• 1800: Sweden and Denmark worried about
Britain invading
• Edelcrantz proposes link across strait
separating Sweden and Denmark to connect
their (signaling) telegraph networks
• 1801: British attack Copenhagen, transmit
message to Sweden, but they don’t help.
• Denmark signs treaty with Britain, and stops
communications with Sweden
20 November 2003
CS 201J Fall 2003
41
First Use of The Internet
• October 1969: First packets on the
ARPANet from UCLA to Stanford.
Starts to send "LOGIN", but it crashes
on the G.
• 20 July 1969:
Live video (b/w) and
audio transmitted from
moon to Earth, and to
several hundred million
televisions worldwide.
20 November 2003
CS 201J Fall 2003
42
Licklider and Taylor’s Vision
Available within the network will be functions and services to
which you subscribe on a regular basis and others that you
call for when you need them. In the former group will be
investment guidance, tax counseling, selective dissemination
of information in your field of specialization, announcement of
cultural, sport, and entertainment events that fit your
interests, etc. In the latter group will be dictionaries,
encyclopedias, indexes, catalogues, editing programs,
teaching programs, testing programs, programming systems,
data bases, and – most important – communication, display,
and modeling programs. All these will be – at some late
date in the history of networking - systematized and
coherent; you will be able to get along in one basic
language up to the point at which you choose a
specialized language for its power or terseness.
20 November 2003
J. C. R. Licklider and Robert W. Taylor, The Computer
as a Communication Device, April 1968
CS 201J Fall 2003
43
The Modern Internet
• Packet Switching: Leonard Kleinrock (UCLA),
Donald Davies and Paul Baran, Edelcrantz’s
signaling network (1809)
• Internet Protocol: Vint Cerf, Bob Kahn
• Vision, Funding: J.C.R. Licklider, Bob Taylor
• Government: Al Gore (first politician to promote
Internet, 1986; act to connect government networks
to form “Interagency Network”)
20 November 2003
CS 201J Fall 2003
44
Charge 1
• Exam 2
– Out today
– Due Friday at 3:55PM
– Turn in to folder outside my office
20 November 2003
CS 201J Fall 2003
45
Charge 2
• Course Evaluations
– SEAS Survey: follow the email instructions
– Course Specific survey
• Handed out today
• Turn in to folder outside my office when you turn in
your exam
20 November 2003
CS 201J Fall 2003
46
Charge 3
Change the
World!
Caveat: before worrying about changing the
world, make sure you turn in your exam Friday!
20 November 2003
CS 201J Fall 2003
47
Small, Simple Programs
that Changed the World
eBay (P. Omidyar)
WorldWideWeb
VisiCalc
(Tim Berners-Lee)
(Dan Bricklin and Bob Frankston)
Smalltalk
(Adele Goldberg, Alan
Kay, Dan Ignalls)
Altair BASIC
(Bill Gates and Paul Allen)
Napster
(Shawn Fanning
and Sean Parker)
Tetris
(Alexey Pazhitnov)
20 November 2003
CS 201J Fall 2003
?
(CS201J Student)
48
Lawn Lighting:
7pm Thursday
Never doubt that a small group of
thoughtful, committed people can
change the world.
Indeed, it is the only thing that ever has.
Margaret Mead