Scribes: Lei Jin, Vasishta Jayanti, Soma Putera

Scheduling:

Round-Robin Preemptive Scheduling:

This technique could potentially fail when newly forked processes are put at the start of the queue. A possible solution is to limit the processes given for forking, but this technique fails if the new processes fork and exit immediately. The simplest solution is to put the new fork at the end of the queue. As a side note, forking fails when the queue is full.
Priority Scheduling:

First, the lower the number of priority, the higher the process is in the queue. For example, priority 1 job is ahead of priority 2 job. However in Unix, priority is called niceness: niceness 1 job is higher priority than niceness 2 job. The default for niceness for most processes is 0. But then some processes can have default niceness less than 0, example: an xserver that interacts with the user or a process with high niceness would be background computation like UFO searching. Mean programs have negative priority or very high priority. For example music programs like itunes or just any program that interacts with the user are mean programs.
- There are two types of priorities:
  - External priority: niceness -> set by the external user
    - Only external priorities can push the low priority jobs to starve by setting the large programs to run first
  - Internal priority: can change dynamically and is set by OS depending on other qualities:
    - OS sees a hog process one that uses a lot of disk and RAM - OS will give you low priority
    - OS sees a well behaved process - OS gives high priority
    - Based on for instance the syscall 'yield' and other factors, which is prone to change at any time(dynamic)
  Actual priority: a combination of external and internal priority. In Bash: 'ps -ef' looks at the niceness number in the first column, and the 'prio' number or actual priority in the second column.
- In Bash: $nice gcc bigprog.c
  Sample code for the implementation of 'nice.c':
  int main(int argc, char* argv) { set priority(10); execvp(argv[1], argv); }
- Question: Can you use yield() well in your program to get better priority?
  Answer: Sure. In a sense it's like a prison - if you want to get out fast, you do what the warden tells you. Programmers often make strategic use of yield() to get better priority for their program.
- Question: If you only have extrinsic priority can you starve?
  Answer: Yes, there will be starvation high priority processes take too much time, low priority processes starve nice guys finish last
- Question: How would appropriate internal priority solve starvation? Answer: run the processes that ran too long ago or process has too much meanness or take internal priority to be the negative of the time since last run.
Two Types of Priority Scheduling:
- FCFS: First Come First Serve: niceness = length of job + time of arrival
- SJF: Shortest Job First: niceness = length of job
Different User Sets For Scheduling:
- Sysadmins have the job of backups: highest priority
- Faculty have the job of software design: 2nd highest priority
- Students have to job of making toy programs: lowest priority
The whole point of distinguishing different types of users is to apply different scheduling algorithms to different user sets. Each user type is put into a separate queue. This idea not popular on single cpu, because single CPU cannot be used in this way. But for a 1000 node cluster, we need to use this queue structure for better organization of priority. It is the operator's job to control which job gets to execute and when, moving a job from one queue to another.
Real Time Scheduler

There are two major types of real time schedulers.

First: Hard real time scheduler: the arrival of every job comes with a deadline and all the deadlines have to be met with all the programs. For example a program for the nuclear power plant cleanup will terminate in the next 10ms needs a hard real time scheduler. With the need for predictability, this tradeoff trumps performance.
- Question: How do we solve the less predictability issue due to scheduler taking the time to look through the caches? Answer: disable caching
- Question: How do we solve the predictability issue when signals interrupts destroy predictability when the kernel takes over forcing us to miss the deadline? Answer: do not use signals, use polling instead
Second: Soft real time scheduler: some deadlines can be missed at a cost. For example, lag from the media player playing a huge video have the soft real time scheduler implemented. Often this results in frame skips (cost) which is 'ok'. To implement this scheduler, we have the sooner deadline finish first. If the job(s) deadlines cannot be met, we discard those jobs and work on the next soonest job.
- Question: If small jobs keep getting scheduled first will the longer job starve? Answer: yes, other processes run much less often and much slower
Another type of real time scheduler is Rate-monotones scheduler, which assigns higher priority to jobs whose tasks run more frequently. There are many different types of scheduler, but the most general two are Linux or Solaris Schedulers. They work to load the current processes as the algorithm tells them to, but the more general scheduler, the harder it is to get right.

Synchronization:

The Goals of Synchronization:

Correctness: all the processes do not access shared objects at the same time, changing the states of these shared objects
Efficiency: we do not want to copy huge chunks of data every time, and we want to use as little memory as possible
Clarity/Simplicity: the use of synchronization rely on simple working code for multiple teammates that work on the same project, for understanding code and debugging
The common error in synchronization is when there are race conditions. The expression comes from race cars, and fast race cars take each own paths. When the paths of two race cars cross, then they crash and explode.

Coding Example: let's say we have a shared bank account: someone is depositing into the account, while someone is withdrawing money from the account: short sample code:

bankaccount: unsigned int balance; //in pennies
deposit(process B):
void deposit(unsigned int amt) { //check for overflow balance += amt; }

withdraw(process A):


			void withdraw(unsigned int amt)

			    	       {

		   //If we do not check ---->

		   if(amt <= balance)

		   //Then what happens?

		   {

 	      balance -= amt;

 	      return true;

 	   }

	   else

			      return false;

				          }

			}

In x86:
  Deposit:                  Withdraw:
  balance += amt;    balance -= amt;
      |        three                |
      |        instructions     |
      V                              V
ld balance, r2 ld balance, r2
add r2, r1, r2 sub r2, r1, r2
st r2, balance st r2, balance
Race Condition happened when:

bankaccount: 10000 (pennies) ->	1. B r2: 100.00
scheduler switches to proc. A ->	2. A r2: 100.00
saved state of B and does A's ->	3. A r2: 50.00
calculation ->	4. A balance 50.00
A is finished, back to B ->	5: B r2 110.00
Does B's calculation ->	6. B balance 110.00

End result? Food for free, while deposit of $10 worked.

Question: What went wrong? Our high level was completely correct! Answer: Ahha! But what happens at low level, not exactly the same as what is written down in high level. We have follow the rules of observability explained below. To find a real solution, we go to Peterson's and Lampert's Algorithms in determining which priority settings is the best.
1. Observability In Synchronization:
  Only the observable events meaning high level code count towards the implementation. If at the low level things get screwed up, we can later fix the program with high level checks. If we can explain the colliding results using events that do not collide and still have the same observable results, then we just need serialization.
2. Serializability In Synchronization:
  - Explains all the behavior of your system as execution of events in some serialized order
    - For example:
      - process A: four states, A1, A2, A3, A4
      - process B: four states, B1, B2, B3, B4
      - state B3 and A3 collide, so we tweak processes to make them not collide and B3' happens after A3'
      - process A: A1, A2, A3', A4
      - process B: B1, B2, B3', B4
    - Serial abstract machine for the system: A1, B1, A2, B2, A3', B3', A4, B4
Question: We can solve the above bank account problem using observability or serialization, but what about the harder cases? What if we also have a function that can transfer money from different account? Now any combination of two or three functions can cause race condition. How to solve this? We can use locks which may work for a small number, but what if we are doing audit and need to look at 10 accounts or more? Should we get locks on 10 or even 20 accounts? When solving the easy case we should keep in mind the hard case.
Partial Answers:
- Solving mechanism depends on underlying machine.
- Say we have hardware where any simultaneous access to memory or disk crashes. Is this feasible to deal with race conditions?
  + Answer: not reasonable! Detecting race condition would make computer crash.
- How about a machine where write + any other access crashes?
  + Answer: still not reasonable! Similar reasoning as above.
- What about write + read => read returns junk
- Write + write => store junk
  + this is Peterson-Lamport bulary => bad hardware
- Why would anyone want to do this then?
  + to show that you can
Complete Answers:
- We have registers and memory connected by bus and some memory stored in cache
  + if two registers are accessing the same area via the bus, we keep polling to see if cache is no longer valid + once cache is not valid, we get message that we have to go to actual RAM or disk to get the correct value
- Peterson's method
  - write + read => read gets old or new value
  - write + write => one wins
- Same as method of atomic store + load
  - common but should be careful
  - big objects, structs, do not have atomic load and store
  - can mess themselves up on race condition
  - small object also has no atomic guarantee
  - small object screws things around it

Oh time is up! See you next time.

Scheduling:

Round-Robin Preemptive Scheduling:

Priority Scheduling:

Real Time Scheduler

Synchronization:

The Goals of Synchronization: