Today we’ll first look at a longer example program, starting with some C code and translating it into our assembly language from last time.

Next we discuss some alternative instruction set architectures.
- There are different ways of specifying memory addresses.
- Instructions can also have different numbers and types of operands.

We will use last week’s datapath and instruction set for the rest of the lectures and the last machine problem, but it’s important to see some other possible designs.
Representing characters

- All computer data must be stored in a numeric (binary) format—including alphabetic characters!
- Each character can be represented by a one-byte **ASCII code**. The codes for some letters, digits and symbols are shown below.

<table>
<thead>
<tr>
<th>ASCII Code</th>
<th>Character</th>
</tr>
</thead>
<tbody>
<tr>
<td>32</td>
<td>space</td>
</tr>
<tr>
<td>33</td>
<td>!</td>
</tr>
<tr>
<td>34</td>
<td>&quot;</td>
</tr>
<tr>
<td>35</td>
<td>#</td>
</tr>
<tr>
<td>36</td>
<td>$</td>
</tr>
<tr>
<td>37</td>
<td>%</td>
</tr>
<tr>
<td>38</td>
<td>&amp;</td>
</tr>
<tr>
<td>39</td>
<td>'</td>
</tr>
<tr>
<td>40</td>
<td>(</td>
</tr>
<tr>
<td>41</td>
<td>)</td>
</tr>
<tr>
<td>42</td>
<td>*</td>
</tr>
<tr>
<td>43</td>
<td>+</td>
</tr>
<tr>
<td>44</td>
<td>,</td>
</tr>
<tr>
<td>45</td>
<td>-</td>
</tr>
<tr>
<td>46</td>
<td>.</td>
</tr>
<tr>
<td>47</td>
<td>/</td>
</tr>
<tr>
<td>48</td>
<td>0</td>
</tr>
<tr>
<td>49</td>
<td>1</td>
</tr>
<tr>
<td>50</td>
<td>2</td>
</tr>
<tr>
<td>51</td>
<td>3</td>
</tr>
<tr>
<td>52</td>
<td>4</td>
</tr>
<tr>
<td>53</td>
<td>5</td>
</tr>
<tr>
<td>54</td>
<td>6</td>
</tr>
<tr>
<td>55</td>
<td>7</td>
</tr>
<tr>
<td>56</td>
<td>8</td>
</tr>
<tr>
<td>57</td>
<td>9</td>
</tr>
<tr>
<td>58</td>
<td>:</td>
</tr>
<tr>
<td>59</td>
<td>;</td>
</tr>
<tr>
<td>60</td>
<td>&lt;</td>
</tr>
<tr>
<td>61</td>
<td>=</td>
</tr>
<tr>
<td>62</td>
<td>&gt;</td>
</tr>
<tr>
<td>63</td>
<td>?</td>
</tr>
<tr>
<td>64</td>
<td>@</td>
</tr>
<tr>
<td>65</td>
<td>A</td>
</tr>
<tr>
<td>66</td>
<td>B</td>
</tr>
<tr>
<td>67</td>
<td>C</td>
</tr>
<tr>
<td>68</td>
<td>D</td>
</tr>
<tr>
<td>69</td>
<td>E</td>
</tr>
<tr>
<td>70</td>
<td>F</td>
</tr>
<tr>
<td>71</td>
<td>G</td>
</tr>
<tr>
<td>72</td>
<td>H</td>
</tr>
<tr>
<td>73</td>
<td>I</td>
</tr>
<tr>
<td>74</td>
<td>J</td>
</tr>
<tr>
<td>75</td>
<td>K</td>
</tr>
<tr>
<td>76</td>
<td>L</td>
</tr>
<tr>
<td>77</td>
<td>M</td>
</tr>
<tr>
<td>78</td>
<td>N</td>
</tr>
<tr>
<td>79</td>
<td>O</td>
</tr>
<tr>
<td>80</td>
<td>P</td>
</tr>
<tr>
<td>81</td>
<td>Q</td>
</tr>
<tr>
<td>82</td>
<td>R</td>
</tr>
<tr>
<td>83</td>
<td>S</td>
</tr>
<tr>
<td>84</td>
<td>T</td>
</tr>
<tr>
<td>85</td>
<td>U</td>
</tr>
<tr>
<td>86</td>
<td>V</td>
</tr>
<tr>
<td>87</td>
<td>W</td>
</tr>
<tr>
<td>88</td>
<td>X</td>
</tr>
<tr>
<td>89</td>
<td>Y</td>
</tr>
<tr>
<td>90</td>
<td>Z</td>
</tr>
<tr>
<td>91</td>
<td>[</td>
</tr>
<tr>
<td>92</td>
<td>\</td>
</tr>
<tr>
<td>93</td>
<td>]</td>
</tr>
<tr>
<td>94</td>
<td>^</td>
</tr>
<tr>
<td>95</td>
<td>_</td>
</tr>
<tr>
<td>96</td>
<td>`</td>
</tr>
<tr>
<td>97</td>
<td>a</td>
</tr>
<tr>
<td>98</td>
<td>b</td>
</tr>
<tr>
<td>99</td>
<td>c</td>
</tr>
<tr>
<td>100</td>
<td>d</td>
</tr>
<tr>
<td>101</td>
<td>e</td>
</tr>
<tr>
<td>102</td>
<td>f</td>
</tr>
<tr>
<td>103</td>
<td>g</td>
</tr>
<tr>
<td>104</td>
<td>h</td>
</tr>
<tr>
<td>105</td>
<td>i</td>
</tr>
<tr>
<td>106</td>
<td>j</td>
</tr>
<tr>
<td>107</td>
<td>k</td>
</tr>
<tr>
<td>108</td>
<td>l</td>
</tr>
<tr>
<td>109</td>
<td>m</td>
</tr>
<tr>
<td>110</td>
<td>n</td>
</tr>
<tr>
<td>111</td>
<td>o</td>
</tr>
<tr>
<td>112</td>
<td>p</td>
</tr>
<tr>
<td>113</td>
<td>q</td>
</tr>
<tr>
<td>114</td>
<td>r</td>
</tr>
<tr>
<td>115</td>
<td>s</td>
</tr>
<tr>
<td>116</td>
<td>t</td>
</tr>
<tr>
<td>117</td>
<td>u</td>
</tr>
<tr>
<td>118</td>
<td>v</td>
</tr>
<tr>
<td>119</td>
<td>w</td>
</tr>
<tr>
<td>120</td>
<td>x</td>
</tr>
<tr>
<td>121</td>
<td>y</td>
</tr>
<tr>
<td>122</td>
<td>z</td>
</tr>
<tr>
<td>123</td>
<td>{</td>
</tr>
<tr>
<td>124</td>
<td></td>
</tr>
<tr>
<td>125</td>
<td>}</td>
</tr>
<tr>
<td>126</td>
<td>~</td>
</tr>
<tr>
<td>127</td>
<td>del</td>
</tr>
</tbody>
</table>
Representing strings

- In C and C++, a string is stored as an array of bytes.
  - Each array element is an ASCII code representing one character.
  - A 0 value marks the end of the array or string.
- For example, “The Godfather” can be stored as a 14-byte array.

<table>
<thead>
<tr>
<th>84</th>
<th>104</th>
<th>101</th>
<th>32</th>
<th>71</th>
<th>111</th>
<th>100</th>
<th>102</th>
<th>97</th>
<th>116</th>
<th>104</th>
<th>101</th>
<th>114</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>T</td>
<td>h</td>
<td>e</td>
<td>G</td>
<td>o</td>
<td>d</td>
<td>f</td>
<td>a</td>
<td>t</td>
<td>h</td>
<td>e</td>
<td>r</td>
<td></td>
<td>\0</td>
</tr>
</tbody>
</table>

- Array elements are stored in contiguous memory locations. For example, if the first letter of “The Godfather” is at address 1000, the terminating zero will be at address 1013.

<table>
<thead>
<tr>
<th>1000</th>
<th>1001</th>
<th>1002</th>
<th>1003</th>
<th>1004</th>
<th>1005</th>
<th>1006</th>
<th>1007</th>
<th>1008</th>
<th>1009</th>
<th>1010</th>
<th>1011</th>
<th>1012</th>
<th>1013</th>
</tr>
</thead>
<tbody>
<tr>
<td>84</td>
<td>104</td>
<td>101</td>
<td>32</td>
<td>71</td>
<td>111</td>
<td>100</td>
<td>102</td>
<td>97</td>
<td>116</td>
<td>104</td>
<td>101</td>
<td>114</td>
<td>0</td>
</tr>
<tr>
<td>T h e</td>
<td>G o d</td>
<td>f a t h e r</td>
<td>\0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
String manipulation example

- Let’s write a short program to count the number of space characters that appear in a string.
- We’ll start with the higher-level C/C++ code given below.
  - Assume the starting address of the string is stored in variable R0, and the number of spaces found will be placed in variable R3.
  - We just loop over the character array, counting the number of times that ASCII code 32 appears. The loop stops when the array element 0, denoting the end of the string, is found.

```c
char R0[];
int i, R3;

R3 = 0;
i = 0;
while (R0[i] != 0) {
    if (R0[i] == 32)
        R3++;
    i++;
}
```
Assembly language translation

- Here is a direct translation of the array version.
  - **R0** contains the string’s starting address.
  - **R1** contains the loop index, which was “i” in the C code.
  - **R3** contains R0[i].
- We also need the *address* of R0[i] to load the data; this is stored in **R2**.

```
LD   R3, #0      // R3 counts number of spaces
LD   R1, #0      // Use R1 as index i
LOOP: ADD  R2, R0, R1  // R2 = address of R0[i]
       LD   R2, (R2)  // R2 = R0[i]
       BZ   R2, DONE  // Exit if R2 = 0 (end of string)
       SUB  R2, R2, #32 // If R2 != 32 then...
       BNZ  R2, NEXT   // ...go on to next character
       ADD  R3, R3, #1 // Increment space counter R3
NEXT: ADD  R1, R1, #1  // Increment string index
       JMP  LOOP      // Repeat
DONE: ...         // Rest of program
```
Translation notes

- Even though this is such a simple program, it illustrates many differences between high-level and low-level languages!

- Our assembly code uses registers to represent C variables. However, the string itself is stored in RAM, since it could be quite long and we wouldn’t necessarily have enough registers.

- The string characters in RAM must be loaded into a register before they can be manipulated.
  - We must specify the exact location of each element that we access. If the RAM word size is one byte and the string starts at address R0, then character R1 will be at address R0 + R1.
  - The C code includes \texttt{R0[i]} twice, but we don’t have to do two load operations. Instead, we can just re-use register R2.

- Our assembly language has only a limited set of branch instructions, so testing if R2 = 32 takes some extra thought. The code here works since if R2 - 32 = 0, then R2 = 32.

- The \texttt{DONE} label represents the rest of the program—what’s shown here is just a fragment.
The loop index \texttt{R1} isn’t really necessary.

- We increment \texttt{R1} on each loop iteration, but all we do is add it to the string address \texttt{R0}, to access the next character of the array.
- We can make our program a bit shorter by incrementing \texttt{R0} itself, and dispensing with \texttt{R1} completely.

\begin{verbatim}
LD   R3, #0
LD   R1, #0
LOOP: ADD  R2, R0, R1
LD   R2, (R2)
BZ   R2, DONE
SUB  R2, R2, #32
BNZ  R2, NEXT
ADD  R3, R3, #1
NEXT: ADD  R1, R1, #1
JMP  LOOP
DONE: ...
\end{verbatim}

\begin{verbatim}
LD   R3, #0
LOOP:
LD   R2, (R0)
BZ   R2, DONE
SUB  R2, R2, #32
BNZ  R2, NEXT
ADD  R3, R3, #1
NEXT: ADD  R0, R0, #1
JMP  LOOP
DONE: ...
\end{verbatim}
Pointers

- This new version corresponds closely to C or C++ code that uses pointers, which you may have worked with before.

```plaintext
LD   R3, #0
LOOP: LD   R2, (R0)
      BZ    R2, DONE
      SUB  R2, R2, #32
      BNZ  R2, NEXT
      ADD  R3, R3, #1
NEXT: ADD  R0, R0, #1
      JMP  LOOP
DONE: ...

char *R0;
int R3;

R3 = 0;
while (*R0 != 0) {
    if (*R0 == 32)
        R3++;
    R0++;
}
```
Addressing modes

- The first ISA design issue we’ll see are different **addressing modes**, which let you specify memory addresses in various ways.
  - Different modes may be useful in different situations.
  - Each mode has its own notation in assembly language.
  - The location that is actually accessed is called the **effective address**.
- The addressing modes that are available will depend on the datapath.
  - Our simple datapath only supports two forms of addressing.
  - Older processors like the 8086 have zillions of addressing modes.
- We’ll introduce some of the more common ones.
Immediate addressing

- One of the simplest modes is immediate addressing, where the operand itself is accessed.

  \[ \text{LD R1, #1999} \quad \text{R1} \leftarrow 1999 \]

- This mode is a good way to specify initial values for registers.
- We’ve already used immediate addressing several times.
  - We introduced it last Wednesday with some short examples.
  - It appears in the string conversion program you just saw.
Direct addressing

- A second mode is **direct addressing**, where the operand is a constant that represents a memory address.

\[
LD \ R1, \ 500 \quad R1 \leftarrow M[500]
\]

- Here the effective address is 500, which is just the operand itself.
- This is useful for working with pointers.
  - You can think of the constant as a pointer.
  - The register gets loaded with the data at that address.
Register indirect addressing

- We already saw register indirect mode, where the operand is a register that contains a memory address.

  \[ \text{LD R1, (R0)} \quad \text{R1} \leftarrow M[R0] \]

- The effective address would be the value in R0.
- This is also useful for working with pointers.
  - R0 might be a pointer, and R1 is loaded with the data at that address.
  - This is similar to \( R1 = *R0 \) in C or C++.
- What’s the difference between this register indirect and direct modes?
  - In direct mode, the address is a constant that is hard-coded into the program and cannot be changed.
  - Here the contents of R0, and hence the address being accessed, can easily be changed.
Stepping through arrays

- Register indirect mode makes it easy to access contiguous locations in memory, such as elements of an array.
- If R0 is the address of the first element in an array, we can easily access the second element too.

```
LD  R1, (R0)  // R1 contains the first element
ADD R0, R0, #1
LD  R2, (R0)  // R2 contains the second element
```

- This is so common that some instruction sets can automatically increment the register for you.

```
LD  R1, (R0)+  // R1 contains the first element
LD  R2, (R0)+  // R2 contains the second element
```

- Such instructions can be used within loops to access an entire array.
Indexed addressing

- Operands with **indexed addressing** include a constant *and* a register.

\[
\text{LD } R1, \ 500(R0) \quad R1 \leftarrow M[R0 + 500]
\]

- The effective address is the register data plus the constant. For instance, if \( R0 \) contains 25, the effective address here would be 525.
- We can use this addressing mode to access arrays also.
  - The constant is the array address, while the register contains an index into the array.
  - For instance, the instruction above might load the 25th element of an array that starts at memory location 500.
- This form of addressing combines the ideas of direct and register-indirect modes, so it’s more complex but also more flexible.
- This is the only mode supported in the MIPS instruction set, which you’ll work with more if you go on to take CS232.
PC-relative addressing

- In **PC-relative addressing**, the operand is a constant that is added to the program counter to produce the effective memory address.

  $$\text{LD } R1, \$30 \quad R1 \leftarrow M[PC + 30]$$

- This is similar to indexed addressing, except the PC is used instead of a regular register.
  - For example, if this instruction was stored at memory address 200, then the effective address would be 230.
  - In some systems the PC actually points to the *next* instruction, in which case the effective address here would be 231.

- Relative addressing is often used in jump and branch instructions.
  - For instance, **JMP $30** lets you skip the next 30 instructions.
  - A negative constant lets you jump backwards, which is common in writing loops as we’ve seen already.
Indirect addressing

- The most complicated mode that we’ll look at is **indirect addressing**. The operand is a constant that represents a memory location, which refers to another location, whose contents are then accessed.

\[
\text{LD R1, [360] \quad R1 \leftarrow M[M[360]]}
\]

- The effective address here is \(M[360]\).
- Indirect addressing is useful for working with pointers to pointers, which are sometimes also called “handles.”
  - The constant represents a pointer to a pointer.
  - In C, we might write something like \(R1 = **\text{ptr}\).
## Addressing mode summary

<table>
<thead>
<tr>
<th>Mode</th>
<th>Notation</th>
<th>Register transfer equivalent</th>
</tr>
</thead>
<tbody>
<tr>
<td>Immediate</td>
<td>LD R1, #CONST</td>
<td>R1 ← CONST</td>
</tr>
<tr>
<td>Direct</td>
<td>LD R1, CONST</td>
<td>R1 ← M[CONST]</td>
</tr>
<tr>
<td>Register indirect</td>
<td>LD R1, (R0)</td>
<td>R1 ← M[R0]</td>
</tr>
<tr>
<td>Indexed</td>
<td>LD R1, CONST(R0)</td>
<td>R1 ← M[R0 + CONST]</td>
</tr>
<tr>
<td>Relative</td>
<td>LD R1, $CONST</td>
<td>R1 ← M[PC + CONST]</td>
</tr>
<tr>
<td>Indirect</td>
<td>LD R1, [CONST]</td>
<td>R1 ← M[M[CONST]]</td>
</tr>
</tbody>
</table>
Another way to classify instruction sets is by the number of operands that each data manipulation instruction can have.

Our examples last week were all three-address instructions, because each one had up to three operands—two sources and one destination.

ADD R0, R1, R2

operation
ADD

operands
R0, R1, R2

Register transfer instruction:

R0 ← R1 + R2

This provides the most flexibility, but some instruction sets allow fewer than three operands.
Two-address instructions

- In two-address instructions, the first operand acts as both the destination and one of the sources.

Here are some other examples, with the corresponding register transfer operation and C code.

- ADD  R3, #1  \[ R3 \leftarrow R3 + 1 \quad R3++; \]
- MUL  R1, #5  \[ R1 \leftarrow R1 \times 5 \quad R1 *= 5; \]
- NOT  R1  \[ R1 \leftarrow \neg R1 \quad R1 = \neg R1; \]
Some computers, like my old Apple II, have one-address instructions.

The CPU has a special register called the accumulator which implicitly serves as the destination and one of the sources—it “accumulates” the result of a computation.

- Here is some code to increment M[R0], using register-indirect addressing.
The ultimate: zero addresses

- If the destination and sources are all implicit, then you wouldn’t have to specify any operands at all!
- This is possible with processors that use a stack architecture.
  - Operands are pushed on a stack, which is just a reserved area in RAM. The most recently pushed element is at the top of the stack, or TOS.
  - Operations use the topmost stack elements as their operands. Those values are then replaced with the operation’s result.
- Examples of stack-based systems include Hewlett-Packard calculators, Java intermediate bytecode, and Intel’s 8087 floating-point architecture.
Stack architecture example

- From left to right, here are three stack instructions, and what the stack looks like after each example instruction is executed.

```
PUSH R1
  R1
  ... stuff 1 ...
  ... stuff 2 ...

PUSH R2
  R2
  R1
  ... stuff 1 ...
  ... stuff 2 ...

ADD
  R1 + R2
  ... stuff 1 ...
  ... stuff 2 ...
```

- This sequence of stack operations corresponds to one register transfer instruction.

\[ \text{TOS} \leftarrow R1 + R2 \]
Data movement instructions

- Finally, the *types* of operands allowed in data manipulation instructions is another way of characterizing instruction sets.
  - So far, we’ve assumed that ALU operations can have only register and constant operands.
  - Many real instruction sets allow memory-based operands as well.
- We’ll use the book’s example and illustrate how the following operation can be translated into some different assembly languages.

\[ X = (A + B) \times (C + D) \]

- Assume that A, B, C, D and X are really memory addresses.
Register-to-register architectures

- Up until now our programs have used a register-to-register or load/store architecture, which matches our datapath from last week nicely.
  - Operands in data manipulation instructions must be registers.
  - Data transfer instructions move data between memory and registers.
- In a register-to-register, three-address instruction set, we can translate the operation $X = (A + B)(C + D)$ into the following code.

```
LD R1, A  R1 ← M[A]  // Use direct addressing
LD R2, B  R2 ← M[B]
ADD R3, R1, R2 R3 ← R1 + R2  // R3 = M[A] + M[B]

LD R1, C  R1 ← M[C]
LD R2, D  R2 ← M[D]
ADD R1, R1, R2 R1 ← R1 + R2  // R1 = M[C] + M[D]

MUL R1, R1, R3 R1 ← R1 * R3  // R1 has the result
ST X, R1  M[X] ← R1  // Store that into M[X]
```
Memory-to-memory architectures

- In a **memory-to-memory** architecture, all data manipulation instructions use memory addresses as operands.
- With a memory-to-memory, three-address instruction set, we could do the operation $X = (A + B)(C + D)$ a little more simply.

```
```

- Here’s the same operation but with a **two-address** instruction set.

```
```
Register-to-memory architectures

- In a register-to-memory architecture, data manipulation instructions can access both registers and memory.
- With two-address instructions, we might do the following.

```plaintext
LD R1, C  R1 ← M[C]  // Load M[C] into R1
ADD R1, D  R1 ← R1 + M[D]  // Add M[D]
MUL X, R1  M[X] ← M[X] * R1  // Multiply
```
Size and speed

- There are many factors to consider when deciding how many and what kind of operands and addressing modes to support in a processor.
- These decisions can affect the size of machine language programs.
  - Permitting more operands leads to longer instructions.
  - Memory addresses are long compared to register file addresses (since memories have larger capacities), so instructions with memory-based operands are typically longer than those with register operands.
- There is also an impact on the speed of the program.
  - Memory accesses are much slower than register accesses.
  - Longer programs access memory more, just to load the program!
- Most newer processors use register-to-register designs.
  - Reading from registers is faster than reading from RAM.
  - Using register operands also leads to shorter instructions.
Summary

- **Addressing modes** specify how instructions access memory.
  - Our sample ISA supports only immediate and register indirect modes.
  - Other processors may support many other addressing modes.
- Data manipulation instructions may have from 0 to 3 operands.
  - We will mostly work with a **three-address** instruction set in this class.
  - **Two-address, one-address** and **zero-address** instructions are possible.
- Instruction sets may permit different types of operands.
  - Our datapath supports a **register-to-register** architecture, where data operands must be registers or constants.
  - In **register-to-memory** or **memory-to-memory** ISAs, the operands could be memory addresses or a mix of addresses and registers.
- “The Godfather” is a 14-byte C string.