Project

General

Profile

MMU description » History » Version 78

Alexander Kamkin, 12/01/2014 02:56 PM

1 24 Alexander Kamkin
h1. MMU Description
2 1 Taya Sergeeva
3 66 Alexander Kamkin
_~By Alexander Kamkin and Taya Sergeeva~_
4 62 Alexander Kamkin
5 65 Alexander Kamkin
*UNDER CONSTRUCTION*
6
7 63 Alexander Kamkin
{{toc}}
8
9 35 Alexander Kamkin
A _memory management unit_ (_MMU_) is known to be one of the most complex and error-prone components of a modern microprocessor. MicroTESK has a special subsystem, called _MMU subsystem_, intended for (1) specifying memory devices and (2) deriving testing knowledge from such specifications. The subsystem provides unified facilities for describing memory buffers (like _L1_ and _L2 caches_, _translation look-aside buffers_ (_TLBs_), etc.) as well as a means for connecting several buffers into a memory hierarchy.
10 1 Taya Sergeeva
11 72 Alexander Kamkin
h2. Grammar
12 66 Alexander Kamkin
13
<pre>
14
startRule 
15
    : bufferOrAddress*
16
    ;
17
18
bufferOrAddress
19
    : address
20
    | buffer
21
    ;
22
</pre>
23
24 1 Taya Sergeeva
h2. Address Description
25 56 Taya Sergeeva
26 1 Taya Sergeeva
A buffer is accessed by an _address_, which is typically a _bit vector_ of a fixed length (width). Different buffers are allowed to have a common address space (e.g., L1 and L2 are usually both addressed by physical addresses). However, in general case, each buffer has its own domain.
27
28 76 Alexander Kamkin
An address space is described using a keyword @address@. The description can specify two parameters: @width@ (obligatory) and @format@ (optional).
29 1 Taya Sergeeva
30 75 Alexander Kamkin
h3. Grammar
31 69 Alexander Kamkin
32
<pre>
33
address
34
    : ''address'' ID ''{''
35
        (addressParameter '';'')*
36
      ''}''
37
    ;
38
39
addressParameter
40
    : width
41
    | format
42
    ;
43
</pre>
44
45 68 Alexander Kamkin
h3. Address Width Parameter
46 1 Taya Sergeeva
47 77 Alexander Kamkin
The @width@ parameter specifies the address width. The parameter is obligatory.
48 1 Taya Sergeeva
49 69 Alexander Kamkin
h4. Grammar
50
51
<pre>
52
width
53
    : ''width'' ''='' expr
54
    ;
55
</pre>
56
57 68 Alexander Kamkin
h3. Address Format Parameter
58
59
The @format@ parameter specifies the address format (a number of named fields). The parameter is optional. By default, the address is unstructured.
60 1 Taya Sergeeva
61 69 Alexander Kamkin
h4. Grammar
62
63
<pre>
64
format
65
    : ''format'' ''='' ''(''
66
        field ('','' field)*
67
      '')''
68
    ;
69
70
field
71
    : ID '':'' expr (''='' expr)?
72
    ;
73
</pre>
74
75 72 Alexander Kamkin
h2. Examples
76 69 Alexander Kamkin
77 68 Alexander Kamkin
<pre>
78 66 Alexander Kamkin
// The singleton.
79
address Void {
80
  width = 0;
81
}
82
</pre>
83
84
<pre>
85
// An unstructured 64-bit virtual addresses.
86
address VA {
87
  width = 64;
88
}
89
</pre>
90
91
<pre>
92
// A stuctured 40-bit physical addresses.
93
address PA {
94
  width = 40;
95
  format = (tag:24, l1Index:7, dwPosition:2, bytePosition:3);
96
}
97
</pre>
98
99
The code above defines three address spaces: (1) a singleton @Void@; (2) a space @VA@ consisting of 64-bit addresses (_virtual addresses_) and (3) a space @PA@ consisting of 40-bit addresses (_physical addresses_), each being divided into for fields: @tag@ (24 bits), @l1Index@ (7 bits), @dwPosition@ (2 bits) and @bytePosition@ (3 bits).
100 10 Alexander Kamkin
101 2 Taya Sergeeva
h2. Buffer Description
102 1 Taya Sergeeva
103 76 Alexander Kamkin
A buffer is described using a keyword @buffer@. The description specifies a set of parameters, including @ways@, @sets@, @format@, @index@, @match@ and @policy@. All of the parameters except @index@ (if @sets = 1@) and @policy@ are obligatory.
104 1 Taya Sergeeva
105 75 Alexander Kamkin
h3. Grammar
106
107
<pre>
108
buffer
109
    : ''buffer'' ID ''{''
110
        (bufferParameter '';'')*
111
      ''}''
112
    ;
113
114
bufferParameter
115
    : ways
116
    | sets
117
    | format
118
    | index
119
    | match
120
    | policy
121
    ;
122
</pre>
123
124 78 Alexander Kamkin
h3. Buffer Associativity (ways)
125
126
h4. Grammar
127
128
h3. Buffer Size (sets)
129
130
h4. Grammar
131
132
h3. Buffer Line Format (format)
133
134
h4. Grammar
135
136
h3. Buffer Index Function (index)
137
138
h4. Grammar
139
140
h3. Buffer Match Predicate (match)
141
142
h3. Buffer Data Replacement Policy (policy)
143
144
h4. Grammar
145
146
h3. Examples
147
148 56 Taya Sergeeva
Let as consider a simple buffer which has only 2 attributes, such as the associativity, *associativity*, i.e. the set''s size, and the number of sets in the buffer, *sets*. 
149
150
<pre>
151 75 Alexander Kamkin
buffer TLB { 
152
  ways = 8;
153
  sets = 64;
154 1 Taya Sergeeva
} 
155
</pre>
156 57 Taya Sergeeva
157 56 Taya Sergeeva
The example above describes translation lookaside buffer (_TLB_), which has an associativity being equal to 8, (i.e. the number of lines in one set in this TLB buffer is equal to 8), and has the number of lines being equal to 64.   
158 57 Taya Sergeeva
159 56 Taya Sergeeva
Each *line* of the buffer can be described optionally by _tag_ and _data_ parameters. 
160
For example, 
161 1 Taya Sergeeva
162 56 Taya Sergeeva
<pre>
163 1 Taya Sergeeva
line = (tag:22, data:1024);
164 56 Taya Sergeeva
</pre>
165 1 Taya Sergeeva
166 56 Taya Sergeeva
describes lines of the cache, each of them containing a 22-bit tag and 1024-bit data.
167 57 Taya Sergeeva
168 56 Taya Sergeeva
In a MMU buffer also can have the *index* computing function. When accessing data, the cache determines a set by calculating a x-bit index. For example,
169 1 Taya Sergeeva
170 57 Taya Sergeeva
<pre>
171 1 Taya Sergeeva
index(addr:PA) = addr<14..13>;
172
</pre>
173 57 Taya Sergeeva
174 1 Taya Sergeeva
The cache calculates a 2-bit index. _index_ returns the initial and the final points of the field kept in bytes.
175 57 Taya Sergeeva
176
Each device stores some data which can be accessed (read from or written into) by their address. If a device contains a line with a given address, this situation is called a ''hit''; the opposite situation referes to as a ''miss''. If a ''miss'' occurs, the device usually displaces one of the set''s line with the line associated with the address given. The predicate which determines if there is a ''miss'' or ''hit'' situation is called *match*. There is the example below:
177 56 Taya Sergeeva
178 57 Taya Sergeeva
<pre>
179
line = (tag:22, data:1024);
180 56 Taya Sergeeva
match(addr:VA) = line.tag == addr<14..1>;
181
</pre>
182 57 Taya Sergeeva
183 56 Taya Sergeeva
If the set contains a line with the tag equal to the 22 upper bits of the physical address, this is a ''hit''. _match_ returns ''true'' if there is a ''hit'' in the line, and returns ''false'' otherwise.
184 57 Taya Sergeeva
185 56 Taya Sergeeva
The strategy which will be used for the lines displacement is specified by *policy*. 
186 57 Taya Sergeeva
187
<pre>
188
policy = LRU;
189 56 Taya Sergeeva
</pre>
190 57 Taya Sergeeva
191 56 Taya Sergeeva
Example above sets the strategy of data replacement to be _Last_ _Recently_ _Used_ policy, i.e. if the ''miss'' occured, the cache displaces the least-recently-used line of the set.
192 57 Taya Sergeeva
193 2 Taya Sergeeva
There is the example below, describing a real ''lower-level'' cache L1: 
194 53 Taya Sergeeva
195
<pre>
196
buffer L1 
197 64 Taya Sergeeva
{
198
	associativity = 4;
199 53 Taya Sergeeva
	sets = 128;
200 10 Alexander Kamkin
	line = (tag:30, data:256);
201 1 Taya Sergeeva
	index(addr:PA) = addr<9..8>;
202
	match(addr:PA) = line.tag == addr<39..10>;
203
	policy = lru;
204
}
205 19 Taya Sergeeva
</pre>
206
207 49 Taya Sergeeva
_Description of each constructor_ in the buffer example is below:
208 21 Taya Sergeeva
209 55 Taya Sergeeva
h3. buffer
210 21 Taya Sergeeva
211 1 Taya Sergeeva
<pre>
212 64 Taya Sergeeva
  has a name, ''L1'' in our example; it can have names ''L2'' and ''TLB'' also;
213 16 Taya Sergeeva
  _buffer_ can be described by different parameters, such _associativity_, _sets_, _index_, _match_, _policy_, and so on, which number is infixed;
214 15 Taya Sergeeva
</pre>
215 64 Taya Sergeeva
216 15 Taya Sergeeva
h3.  associativity 
217 1 Taya Sergeeva
218 64 Taya Sergeeva
<pre>
219 17 Taya Sergeeva
  _associativity_ is an associativity of a buffer; it returns the number of lines in a one set;
220 15 Taya Sergeeva
</pre>
221 64 Taya Sergeeva
222 15 Taya Sergeeva
h3.  sets
223 13 Taya Sergeeva
224 64 Taya Sergeeva
<pre>
225 1 Taya Sergeeva
  _sets_ is the number of sets in a given buffer;
226 17 Taya Sergeeva
</pre>
227 54 Taya Sergeeva
228
h3.  line
229 1 Taya Sergeeva
230 52 Taya Sergeeva
<pre>
231 54 Taya Sergeeva
  _line_ is an optional description of line''s fields;
232 14 Taya Sergeeva
  it designates each line of the cache; 
233 1 Taya Sergeeva
  _line_ includes its own parameters in the braces: _tag_ and _data_, each of them has an appropriate width of the fields kept in bytes;
234
  in our example _line_ has only two parameters, but in general case it can include more;
235 49 Taya Sergeeva
  it contains a 30-bit tag and a 256-bit data;
236 17 Taya Sergeeva
</pre>
237 54 Taya Sergeeva
238
h3.  index
239 1 Taya Sergeeva
240
<pre>
241
   _index_ is the function for index calculation;
242
   returns the initial and the final points of the field kept in bytes; they are marked in a three-cornered brackets, after _addr_; in our case index has 2 bits;
243 49 Taya Sergeeva
  _index_ depends on an _address_, which is ''physical'' (PA) in our case; the type of an address is set in the braces after _index_; 
244 17 Taya Sergeeva
</pre>
245 54 Taya Sergeeva
246
h3.  match 
247
248 1 Taya Sergeeva
<pre>
249
  _match_ is a predicate checking whether the line and the address match each other or not;
250 52 Taya Sergeeva
  it returns ''true'' or ''false'' depending on if the data required is in the given line or not; 
251 1 Taya Sergeeva
  it returns ''true'' if there is a ''hit'' in the line, and returns ''false'' otherwise; if the set contains a line with the tag equal to the 30 upper bits of the physical address, this is a ''hit''; if the set does not contain the line, this is a ''miss'' situation;
252
  _match_ description contains the the initial and the final points of the address field in the triangle brackets after _addr_; 
253
  as _index_ in the round braces _match_ also has the type of the address used; ''PA'' in our case;
254 49 Taya Sergeeva
</pre>
255 1 Taya Sergeeva
256 54 Taya Sergeeva
h3.  policy
257 56 Taya Sergeeva
258 52 Taya Sergeeva
<pre>
259 1 Taya Sergeeva
  _policy_ is the strategy of data displacement; 
260 25 Alexander Kamkin
  sets a policy which will be applied to our buffer, ''lru'' (Least Recently Used) in our example; 
261
  policy also can be ''plru'' (Pseudo LRU) and ''fifo'' (First Input First Out).
262
</pre>
263
264
h2. Code Structure
265
266
The MMU grammar is in ru.ispras.microtesk.translator.mmu.grammar folder. It contains Lexer, Parser and TreeWalker files. These files can be compiled by build.xml file (microtesk++/build.xml). The files generated (MMULexer.java, MMUParser.java, MMUTreeWalker.java) are in microtesk++.gen.ru.ispras.microtesk.translator.mmu.grammar folder. 
267
268 1 Taya Sergeeva
The folders ru.ispras.microtesk.translator.mmu.ir.* contain the inner representation of the MMU hierarchy of one buffer.  
269
270 26 Alexander Kamkin
MMU translator is in the ru.ispras.microtesk.translator.mmu.translator folder. 
271
272 1 Taya Sergeeva
Files in ru.ispras.microtesk.model.api.mmu folder contain different policies of cache. Folder ru.ispras.microtesk.model.api.mmu.buffer contains the model of MMU - the files which describe Buffer, Set, Line, Address expressions.  
273
274
After grammar files being generated the file ''BufferExample'' can be loaded to the translator.