CMPT 880: Internet Architectures and Protocols
Download
Report
Transcript CMPT 880: Internet Architectures and Protocols
School of Computing Science
Simon Fraser University
CMPT 300: Operating Systems I
Ch 5: CPU Scheduling
Dr. Mohamed Hefeeda
1
Chapter 5: Objectives
Understand
Scheduling Criteria
Scheduling Algorithms
Multiple-Processor Scheduling
2
Basic Concepts
Maximum CPU utilization
obtained with
multiprogramming
CPU–I/O Burst Cycle
Process execution consists of a
cycle of CPU execution and I/O
wait
How long is the CPU burst?
3
CPU Burst Distribution
CPU bursts vary greatly from process to
process and from computer to computer
But, in general, they tend to have the following
distribution (exponential)
Many short
bursts
Few long bursts
4
CPU Scheduler
Selects one process from ready queue to run on
CPU
Scheduling can be
Nonpreemptive
•
Once a process is allocated the CPU, it does not leave
unless:
1. it has to wait, e.g., for I/O request or for a child to
terminate
2. it terminates
Preemptive
•
OS can force (preempt) a process from CPU at anytime
– Say, to allocate CPU to another higher-priority process
Which is harder to implement? and why?
Preemptive is harder: Need to maintain consistency of
•
•
data shared between processes, and more importantly,
kernel data structures (e.g., I/O queues)
– Think of a preemption while kernel is executing a sys
5
Dispatcher
Scheduler: selects one process to run on CPU
Dispatcher: allocates CPU to the selected
process, which involves:
switching context
switching to user mode
jumping to the proper location (in the selected
process) and restarting it
Dispatch latency – time it takes for the dispatcher
to stop one process and start another
How does scheduler select a process to run?
6
Scheduling Criteria
CPU utilization – keep the CPU as busy as possible
Maximize
Throughput – # of processes that complete their execution
per time unit
Maximize
Turnaround time – amount of time to execute a particular
process (time from submission to termination)
Minimize
Waiting time – amount of time a process has been waiting in
ready queue
Minimize
Response time – amount of time it takes from when a
request is submitted until the first response is produced
Minimize
7
Scheduling Algorithms
First Come, First Served
Shortest Job First
Priority
Round Robin
Multilevel queues
Note: A process may have many CPU bursts, but
in the following examples we show only one for
simplicity
8
First-Come, First-Served (FCFS) Scheduling
Process
Burst Time
P1
P2
P3
24
3
3
Suppose that the processes arrive in the order: P1 , P2 , P3
The Gantt Chart for the schedule is:
P1
0
P2
24
P3
27
30
Waiting time for P1 = 0; P2 = 24; P3 = 27
Average waiting time: (0 + 24 + 27)/3 = 17
9
FCFS Scheduling (cont’d)
Suppose that the processes arrive in the order
P2 , P3 , P1
3, 3, 24
The Gantt chart for the schedule is:
P2
0
P3
3
P1
6
30
Waiting time for P1 = 6; P2 = 0; P3 = 3
Average waiting time: (6 + 0 + 3)/3 = 3
Much better than previous case
Convoy effect: short process behind long process
10
Shortest-Job-First (SJF) Scheduling
Associate with each process the length of its
next CPU burst. Use these lengths to schedule
the process with the shortest time
Two schemes:
nonpreemptive – once CPU given to the process it
cannot be preempted until completes its CPU burst
preemptive – if a new process arrives with CPU burst
length less than remaining time of current executing
process, preempt. This scheme is know as the
Shortest-Remaining-Time-First (SRTF)
SJF is optimal – gives minimum average waiting
time for a given set of processes
11
Example of Non-Preemptive SJF
Process Arrival Time Burst Time
P1
P2
P3
P4
0.0
7
2.0
4
4.0
1
5.0
4
SJF (non-preemptive)
P1
0
3
P3
7
P2
8
P4
12
16
Average waiting time = (0 + 6 + 3 + 7)/4 = 4
12
Example of Preemptive SJF
Process Arrival Time Burst Time
P1
P2
P3
P4
0.0
7
2.0
4
4.0
1
5.0
4
SJF (preemptive, SRJF)
P1
0
P2
2
P3
4
P2
5
P4
7
P1
11
16
Average waiting time = (9 + 1 + 0 +2)/4 = 3
13
Estimating Length of Next CPU Burst
Can only estimate the length
Can be done by using the length of previous CPU
bursts, using exponential averaging
1. t n actual lenght of n th CPU burst
2. n 1 predicted value for the next CPU burst
3. , 0 1
4. Define :
n 1 t n 1 n
14
Exponential Averaging
If we expand the formula, we get:
n+1 = tn + (1 - ) tn -1 + (1 - )2 tn -2 + … + (1 - )n +1 0
Examples:
= 0 ==> n+1 = n ==> Last CPU burst does not count
(transient value)
=1 ==> n+1 = tn ==> Only last CPU burst counts (history
is stale)
15
Prediction of CPU Burst Lengths: Expo
Average
Assume = 0.5, 0 = 10
16
Priority Scheduling
A priority number (integer) is associated with each process
CPU is allocated to the process with the highest priority
(smallest integer highest priority)
Preemptive
nonpreemptive
SJF is a priority scheduling where priority is the predicted
next CPU burst time
Problem: Starvation – low priority processes may never
execute
Solution?
Aging: Increase the priority of a process as it waits in the
system
17
Round Robin (RR)
Each process gets a small unit of CPU time
(time quantum), usually 10-100 milliseconds
After this time elapses, the process is
preempted and added to end of ready queue
If there are n processes in ready queue and
time quantum is q, then each process gets 1/n
of the CPU time in chunks of at most q time
units at once
No process waits more than (n-1)q time units
Performance
q large FCFS
q small q must be large with respect to context
switch, otherwise overhead is too high
18
Example of RR with Time Quantum = 20
Process
Burst Time
53
17
68
24
P1
P2
P3
P4
The Gantt chart is:
P1
0
P2
20
37
P3
P4
57
P1
77
P3
97 117
P4
P1
P3
P3
121 134 154 162
Typically, higher average turnaround than SJF,
but better response
19
Time Quantum and Context Switch Time
Smaller q more responsive but more context
switches (overhead)
20
Turnaround Time Varies With Time Quantum
Turnaround time varies with quantum, then
stabilizes
Rule of thumb for good performance:
80% of CPU bursts should be shorter than time quantum
21
Multilevel Queue
Ready queue is partitioned into separate queues:
foreground (interactive)
background (batch)
Each queue has its own scheduling algorithm
foreground – RR
background – FCFS
Scheduling must be done between queues
Fixed priority scheduling; (i.e., serve all from
foreground then from background). Possibility of
starvation
Time slice – each queue gets a certain amount of CPU
time which it can schedule amongst its processes; e.g.,
• 80% to foreground in RR
• 20% to background in FCFS
22
Multilevel Queue Scheduling
23
Multilevel Feedback Queue
A process can move between various queues
aging can be implemented this way
Multilevel-feedback-queue scheduler defined by
the following parameters:
number of queues
scheduling algorithms for each queue
method to determine when to upgrade a process
method to determine when to demote a process
method to determine which queue a process will
enter when that process needs service
24
Example of Multilevel Feedback Queue
Three queues:
Q0 – RR with time quantum 8 milliseconds
Q1 – RR time quantum 16 milliseconds
Q2 – FCFS
Scheduling
A new job enters queue Q0 which is served FCFS.
When it gains CPU, job receives 8 milliseconds. If it
does not finish in 8 milliseconds, job is moved to
queue Q1.
At Q1 job is again served FCFS and receives 16
additional milliseconds. If it still does not complete, it
is preempted and moved to queue Q2
25
Multilevel Feedback Queues
Notes:
Short processes get served faster (higher prio) more
responsive
Long processes (CPU bound) sink to bottom served
FCFS more throughput
26
Multiple-Processor Scheduling
Multiple processors ==> divide load among them
More complex than single CPU scheduling
How to divide load?
Asymmetric multiprocessor
• One master processor does the scheduling for others
Symmetric multiprocessor (SMP)
• Each processor runs its own scheduler
• One common ready queue for all processors, or one ready
queue for each
• Win XP, Linux, Solaris, Mac OS X support SMP
27
SMP Issues
Processor affinity
When a process runs on a processor, some data is cached in
that processor’s cache
Process migrates to another processor ==>
• Cache of new processor has to be re-populated
• Cache of old processor has to be invalidated ==>
• Performance penalty
Load balancing
One processor has too much load and another is idle
Balance load using
• Push migration: A specific task periodically checks load on
all processors and evenly distributes it by moving (pushing)
tasks
• Pull migration: Idle processor pulls a waiting task from a
busy processor
• Some systems (e.g., Linux) implement both
28
SMP Issues (cont’d)
Tradeoff between load balancing and processor
affinity: what would you do?
May be, invoke load balancer when imbalance
exceeds threshold
29
Real-time Scheduling
Hard-real time systems
A task must be finished within a deadline
Ex: Control of spacecraft
Soft-real time systems
A task is given higher priority over others
Ex: Multimedia systems
30
Operating System Examples
Windows XP scheduling
Linux scheduling
31
Windows XP Scheduler
Priority-based, preemptive scheduler
The highest-priority thread will always run
32 levels of priorities, each has a separate queue
Scheduler traverses queues from highest to
lowest till it finds a thread that is ready to run
Priorities are divided into classes:
Real-time class (fixed): Levels 16 to 31
Other classes (variable): Levels 1 to 15
• Priority may change (decrease or increase)
32
Windows XP Scheduler (cont’d)
Win 32 API defines
several priority classes, and
within each class several relative priorities
Priority class
33
Windows XP Scheduler (cont’d)
Processes are typically created as members
NORMAL_PRIORITY_CLASS
It gets base (NORMAL) priority of that class
Priority decreases
After thread’s quantum time runs out
• but never goes below the base (normal) value of its class
Limit CPU consumption of CPU-bound threads
Priority increases
After a thread is released from a wait operation
Bigger increase if thread was waiting for mouse or
keyboard
Moderate increase if it was waiting for disk
Also, active window gets a priority boost
Yield good response time
34
Linux Scheduler
Priority-based, preemptive scheduler with two separate ranges
Real-time: 0 to 99
Nice: 100 to 140
Higher priority tasks get larger quanta (unlike Win XP,
Solaris)
35
Linux Scheduler (cont’d)
A task is initially assigned a time slice (quantum)
Runqueue has two arrays: active and expired
A runnable task is eligible for CPU if it has time left in its time slice
If time slice runs out, the task is moved to the expired array
Priority increase/decrease may occur before adding to expired
array
When there are no tasks in the active array, the expired array
becomes the active array and vice versa (change of pointers)
36
Algorithm Evaluation
Deterministic modeling
Takes a particular predetermined workload and defines
the performance of each algorithm for that workload
Not general
Queuing models
Use queuing theory to analyze algorithms
Many (unrealistic) assumptions to facilitate analysis
Simulation
Build a simulator and test
• synthetic workload (e.g., generated randomly), or
• Traces collected from running systems
Implementation
Code it up and test!
37
Summary
Process execution: cycle of CPU bursts and I/O bursts
CPU bursts lengths: many short bursts, and few long
ones
Scheduler selects one process from ready queue
Dispatcher performs the switching
Scheduling criteria (usually conflicting)
CPU utilization, waiting time, response time, throughput,
…
Scheduling Algorithms
FCFS, SJF, Priority, RR, Multilevel Queues, …
Multiprocessor Scheduling
Processor affinity vs. load balancing
Evaluation of Algorithms
Modeling, simulation, implementation
Examples: Win XP, Linux
38