Project

General

Profile

MMU description » History » Version 68

Alexander Kamkin, 12/01/2014 02:23 PM

1 24 Alexander Kamkin
h1. MMU Description
2 1 Taya Sergeeva
3 66 Alexander Kamkin
_~By Alexander Kamkin and Taya Sergeeva~_
4 62 Alexander Kamkin
5 65 Alexander Kamkin
*UNDER CONSTRUCTION*
6
7 63 Alexander Kamkin
{{toc}}
8
9 35 Alexander Kamkin
A _memory management unit_ (_MMU_) is known to be one of the most complex and error-prone components of a modern microprocessor. MicroTESK has a special subsystem, called _MMU subsystem_, intended for (1) specifying memory devices and (2) deriving testing knowledge from such specifications. The subsystem provides unified facilities for describing memory buffers (like _L1_ and _L2 caches_, _translation look-aside buffers_ (_TLBs_), etc.) as well as a means for connecting several buffers into a memory hierarchy.
10 1 Taya Sergeeva
11 66 Alexander Kamkin
The grammar is as follows.
12
13
<pre>
14
startRule 
15
    : bufferOrAddress*
16
    ;
17
18
bufferOrAddress
19
    : address
20
    | buffer
21
    ;
22
</pre>
23
24 1 Taya Sergeeva
h2. Address Description
25 56 Taya Sergeeva
26 1 Taya Sergeeva
A buffer is accessed by an _address_, which is typically a _bit vector_ of a fixed length (width). Different buffers are allowed to have a common address space (e.g., L1 and L2 are usually both addressed by physical addresses). However, in general case, each buffer has its own domain.
27
28 68 Alexander Kamkin
An address space is described using a keyword *address*. The @width@ and @format@ optional parameters can be specified inside the address space definition.
29 1 Taya Sergeeva
30 68 Alexander Kamkin
h3. Address Width Parameter
31 1 Taya Sergeeva
32 68 Alexander Kamkin
The @width@ parameter specifies the address width. The parameter is optional. The default value is @0@.
33
34
h3. Address Format Parameter
35
36
The @format@ parameter specifies the address format (a number of named fields). The parameter is optional. By default, the address is unstructured.
37
38
h3. Address Description Examples
39
40 1 Taya Sergeeva
<pre>
41
// The singleton.
42 66 Alexander Kamkin
address Void {
43
  width = 0;
44
}
45 1 Taya Sergeeva
</pre>
46
47 61 Alexander Kamkin
<pre>
48 66 Alexander Kamkin
// An unstructured 64-bit virtual addresses.
49
address VA {
50
  width = 64;
51
}
52 1 Taya Sergeeva
</pre>
53 61 Alexander Kamkin
54 1 Taya Sergeeva
<pre>
55 66 Alexander Kamkin
// A stuctured 40-bit physical addresses.
56
address PA {
57
  width = 40;
58
  format = (tag:24, l1Index:7, dwPosition:2, bytePosition:3);
59 1 Taya Sergeeva
}
60
</pre>
61
62 66 Alexander Kamkin
</pre>
63
64
The code above defines three address spaces: (1) a singleton @Void@; (2) a space @VA@ consisting of 64-bit addresses (_virtual addresses_) and (3) a space @PA@ consisting of 40-bit addresses (_physical addresses_), each being divided into for fields: @tag@ (24 bits), @l1Index@ (7 bits), @dwPosition@ (2 bits) and @bytePosition@ (3 bits).
65
66 68 Alexander Kamkin
h3. Address Description Grammar
67 66 Alexander Kamkin
68
<pre>
69
address
70
    : ''address'' ID ''{''
71
        (addressParameter '';'')*
72
      ''}''
73
    ;
74
75
addressParameter
76
    : width
77
    | format
78
    ;
79
80
width
81
    : ''width'' ''='' expr
82
    ;
83
84
format
85
    : ''format'' ''='' ''(''
86
        field ('','' field)*
87
      '')''
88
    ;
89
90
field
91
    : ID '':'' expr (''='' expr)?
92
    ;
93
</pre>
94 10 Alexander Kamkin
95 2 Taya Sergeeva
h2. Buffer Description
96 1 Taya Sergeeva
97 57 Taya Sergeeva
Buffer is described by the construct *buffer*. Buffer can have different parameters, such as an associativity, a number of lines, the tag computing function, the index computing function, and the structure of data unit displacement, the controlling bits, the strategies of data changing when ''miss'' occurs, and so on. 
98 1 Taya Sergeeva
99 64 Taya Sergeeva
Let as consider a simple buffer which has only 2 attributes, such as the associativity, *associativity*, i.e. the set''s size, and the number of sets in the buffer, *sets*. 
100 57 Taya Sergeeva
101 56 Taya Sergeeva
<pre>
102
buffer TLB 
103
{ 
104 64 Taya Sergeeva
  associativity=8;
105
  sets=64;
106 56 Taya Sergeeva
} 
107 1 Taya Sergeeva
</pre>
108
109 57 Taya Sergeeva
The example above describes translation lookaside buffer (_TLB_), which has an associativity being equal to 8, (i.e. the number of lines in one set in this TLB buffer is equal to 8), and has the number of lines being equal to 64.   
110 56 Taya Sergeeva
111 57 Taya Sergeeva
Each *line* of the buffer can be described optionally by _tag_ and _data_ parameters. 
112 56 Taya Sergeeva
For example, 
113
114 1 Taya Sergeeva
<pre>
115 56 Taya Sergeeva
line = (tag:22, data:1024);
116 1 Taya Sergeeva
</pre>
117 56 Taya Sergeeva
118 1 Taya Sergeeva
describes lines of the cache, each of them containing a 22-bit tag and 1024-bit data.
119 56 Taya Sergeeva
120 57 Taya Sergeeva
In a MMU buffer also can have the *index* computing function. When accessing data, the cache determines a set by calculating a x-bit index. For example,
121 56 Taya Sergeeva
122 1 Taya Sergeeva
<pre>
123 57 Taya Sergeeva
index(addr:PA) = addr<14..13>;
124 1 Taya Sergeeva
</pre>
125
126 57 Taya Sergeeva
The cache calculates a 2-bit index. _index_ returns the initial and the final points of the field kept in bytes.
127 1 Taya Sergeeva
128 57 Taya Sergeeva
Each device stores some data which can be accessed (read from or written into) by their address. If a device contains a line with a given address, this situation is called a ''hit''; the opposite situation referes to as a ''miss''. If a ''miss'' occurs, the device usually displaces one of the set''s line with the line associated with the address given. The predicate which determines if there is a ''miss'' or ''hit'' situation is called *match*. There is the example below:
129
130 56 Taya Sergeeva
<pre>
131 57 Taya Sergeeva
line = (tag:22, data:1024);
132
match(addr:VA) = line.tag == addr<14..1>;
133 56 Taya Sergeeva
</pre>
134
135 57 Taya Sergeeva
If the set contains a line with the tag equal to the 22 upper bits of the physical address, this is a ''hit''. _match_ returns ''true'' if there is a ''hit'' in the line, and returns ''false'' otherwise.
136 56 Taya Sergeeva
137 57 Taya Sergeeva
The strategy which will be used for the lines displacement is specified by *policy*. 
138 56 Taya Sergeeva
139 57 Taya Sergeeva
<pre>
140
policy = LRU;
141
</pre>
142 56 Taya Sergeeva
143 57 Taya Sergeeva
Example above sets the strategy of data replacement to be _Last_ _Recently_ _Used_ policy, i.e. if the ''miss'' occured, the cache displaces the least-recently-used line of the set.
144 56 Taya Sergeeva
145 57 Taya Sergeeva
There is the example below, describing a real ''lower-level'' cache L1: 
146 2 Taya Sergeeva
147 53 Taya Sergeeva
<pre>
148
buffer L1 
149
{
150 64 Taya Sergeeva
	associativity = 4;
151
	sets = 128;
152 53 Taya Sergeeva
	line = (tag:30, data:256);
153 10 Alexander Kamkin
	index(addr:PA) = addr<9..8>;
154 1 Taya Sergeeva
	match(addr:PA) = line.tag == addr<39..10>;
155
	policy = lru;
156
}
157
</pre>
158 19 Taya Sergeeva
159
_Description of each constructor_ in the buffer example is below:
160 49 Taya Sergeeva
161 21 Taya Sergeeva
h3. buffer
162 55 Taya Sergeeva
163 21 Taya Sergeeva
<pre>
164 1 Taya Sergeeva
  has a name, ''L1'' in our example; it can have names ''L2'' and ''TLB'' also;
165 64 Taya Sergeeva
  _buffer_ can be described by different parameters, such _associativity_, _sets_, _index_, _match_, _policy_, and so on, which number is infixed;
166 16 Taya Sergeeva
</pre>
167 15 Taya Sergeeva
168 64 Taya Sergeeva
h3.  associativity 
169 15 Taya Sergeeva
170 1 Taya Sergeeva
<pre>
171 64 Taya Sergeeva
  _associativity_ is an associativity of a buffer; it returns the number of lines in a one set;
172 17 Taya Sergeeva
</pre>
173 15 Taya Sergeeva
174 64 Taya Sergeeva
h3.  sets
175 15 Taya Sergeeva
176 13 Taya Sergeeva
<pre>
177 64 Taya Sergeeva
  _sets_ is the number of sets in a given buffer;
178 1 Taya Sergeeva
</pre>
179 17 Taya Sergeeva
180 54 Taya Sergeeva
h3.  line
181
182 1 Taya Sergeeva
<pre>
183 52 Taya Sergeeva
  _line_ is an optional description of line''s fields;
184 54 Taya Sergeeva
  it designates each line of the cache; 
185 14 Taya Sergeeva
  _line_ includes its own parameters in the braces: _tag_ and _data_, each of them has an appropriate width of the fields kept in bytes;
186 1 Taya Sergeeva
  in our example _line_ has only two parameters, but in general case it can include more;
187
  it contains a 30-bit tag and a 256-bit data;
188 49 Taya Sergeeva
</pre>
189 17 Taya Sergeeva
190 54 Taya Sergeeva
h3.  index
191
192 1 Taya Sergeeva
<pre>
193
   _index_ is the function for index calculation;
194
   returns the initial and the final points of the field kept in bytes; they are marked in a three-cornered brackets, after _addr_; in our case index has 2 bits;
195
  _index_ depends on an _address_, which is ''physical'' (PA) in our case; the type of an address is set in the braces after _index_; 
196 49 Taya Sergeeva
</pre>
197 17 Taya Sergeeva
198 54 Taya Sergeeva
h3.  match 
199
200
<pre>
201 1 Taya Sergeeva
  _match_ is a predicate checking whether the line and the address match each other or not;
202
  it returns ''true'' or ''false'' depending on if the data required is in the given line or not; 
203 52 Taya Sergeeva
  it returns ''true'' if there is a ''hit'' in the line, and returns ''false'' otherwise; if the set contains a line with the tag equal to the 30 upper bits of the physical address, this is a ''hit''; if the set does not contain the line, this is a ''miss'' situation;
204 1 Taya Sergeeva
  _match_ description contains the the initial and the final points of the address field in the triangle brackets after _addr_; 
205
  as _index_ in the round braces _match_ also has the type of the address used; ''PA'' in our case;
206
</pre>
207 49 Taya Sergeeva
208 1 Taya Sergeeva
h3.  policy
209 54 Taya Sergeeva
210 56 Taya Sergeeva
<pre>
211 52 Taya Sergeeva
  _policy_ is the strategy of data displacement; 
212 1 Taya Sergeeva
  sets a policy which will be applied to our buffer, ''lru'' (Least Recently Used) in our example; 
213 25 Alexander Kamkin
  policy also can be ''plru'' (Pseudo LRU) and ''fifo'' (First Input First Out).
214
</pre>
215
216
h2. Code Structure
217
218
The MMU grammar is in ru.ispras.microtesk.translator.mmu.grammar folder. It contains Lexer, Parser and TreeWalker files. These files can be compiled by build.xml file (microtesk++/build.xml). The files generated (MMULexer.java, MMUParser.java, MMUTreeWalker.java) are in microtesk++.gen.ru.ispras.microtesk.translator.mmu.grammar folder. 
219
220
The folders ru.ispras.microtesk.translator.mmu.ir.* contain the inner representation of the MMU hierarchy of one buffer.  
221 1 Taya Sergeeva
222
MMU translator is in the ru.ispras.microtesk.translator.mmu.translator folder. 
223 26 Alexander Kamkin
224
Files in ru.ispras.microtesk.model.api.mmu folder contain different policies of cache. Folder ru.ispras.microtesk.model.api.mmu.buffer contains the model of MMU - the files which describe Buffer, Set, Line, Address expressions.  
225 1 Taya Sergeeva
226
After grammar files being generated the file ''BufferExample'' can be loaded to the translator.