1 |
106 |
markom |
\input texinfo
|
2 |
|
|
@c %**start of header
|
3 |
|
|
@setfilename agentexpr.info
|
4 |
|
|
@settitle GDB Agent Expressions
|
5 |
|
|
@setchapternewpage off
|
6 |
|
|
@c %**end of header
|
7 |
|
|
|
8 |
|
|
Revision: $Id: agentexpr.texi,v 1.1.1.1 2001-05-18 11:14:09 markom Exp $
|
9 |
|
|
|
10 |
|
|
@node The GDB Agent Expression Mechanism
|
11 |
|
|
@chapter The GDB Agent Expression Mechanism
|
12 |
|
|
|
13 |
|
|
In some applications, it is not feasable for the debugger to interrupt
|
14 |
|
|
the program's execution long enough for the developer to learn anything
|
15 |
|
|
helpful about its behavior. If the program's correctness depends on its
|
16 |
|
|
real-time behavior, delays introduced by a debugger might cause the
|
17 |
|
|
program to fail, even when the code itself is correct. It is useful to
|
18 |
|
|
be able to observe the program's behavior without interrupting it.
|
19 |
|
|
|
20 |
|
|
Using GDB's @code{trace} and @code{collect} commands, the user can
|
21 |
|
|
specify locations in the program, and arbitrary expressions to evaluate
|
22 |
|
|
when those locations are reached. Later, using the @code{tfind}
|
23 |
|
|
command, she can examine the values those expressions had when the
|
24 |
|
|
program hit the trace points. The expressions may also denote objects
|
25 |
|
|
in memory --- structures or arrays, for example --- whose values GDB
|
26 |
|
|
should record; while visiting a particular tracepoint, the user may
|
27 |
|
|
inspect those objects as if they were in memory at that moment.
|
28 |
|
|
However, because GDB records these values without interacting with the
|
29 |
|
|
user, it can do so quickly and unobtrusively, hopefully not disturbing
|
30 |
|
|
the program's behavior.
|
31 |
|
|
|
32 |
|
|
When GDB is debugging a remote target, the GDB @dfn{agent} code running
|
33 |
|
|
on the target computes the values of the expressions itself. To avoid
|
34 |
|
|
having a full symbolic expression evaluator on the agent, GDB translates
|
35 |
|
|
expressions in the source language into a simpler bytecode language, and
|
36 |
|
|
then sends the bytecode to the agent; the agent then executes the
|
37 |
|
|
bytecode, and records the values for GDB to retrieve later.
|
38 |
|
|
|
39 |
|
|
The bytecode language is simple; there are forty-odd opcodes, the bulk
|
40 |
|
|
of which are the usual vocabulary of C operands (addition, subtraction,
|
41 |
|
|
shifts, and so on) and various sizes of literals and memory reference
|
42 |
|
|
operations. The bytecode interpreter operates strictly on machine-level
|
43 |
|
|
values --- various sizes of integers and floating point numbers --- and
|
44 |
|
|
requires no information about types or symbols; thus, the interpreter's
|
45 |
|
|
internal data structures are simple, and each bytecode requires only a
|
46 |
|
|
few native machine instructions to implement it. The interpreter is
|
47 |
|
|
small, and strict limits on the memory and time required to evaluate an
|
48 |
|
|
expression are easy to determine, making it suitable for use by the
|
49 |
|
|
debugging agent in real-time applications.
|
50 |
|
|
|
51 |
|
|
@menu
|
52 |
|
|
* General Bytecode Design:: Overview of the interpreter.
|
53 |
|
|
* Bytecode Descriptions:: What each one does.
|
54 |
|
|
* Using Agent Expressions:: How agent expressions fit into the big picture.
|
55 |
|
|
* Varying Target Capabilities:: How to discover what the target can do.
|
56 |
|
|
* Tracing on Symmetrix:: Special info for implementation on EMC's
|
57 |
|
|
boxes.
|
58 |
|
|
* Rationale:: Why we did it this way.
|
59 |
|
|
@end menu
|
60 |
|
|
|
61 |
|
|
|
62 |
|
|
@c @node Rationale
|
63 |
|
|
@c @section Rationale
|
64 |
|
|
|
65 |
|
|
|
66 |
|
|
@node General Bytecode Design
|
67 |
|
|
@section General Bytecode Design
|
68 |
|
|
|
69 |
|
|
The agent represents bytecode expressions as an array of bytes. Each
|
70 |
|
|
instruction is one byte long (thus the term @dfn{bytecode}). Some
|
71 |
|
|
instructions are followed by operand bytes; for example, the @code{goto}
|
72 |
|
|
instruction is followed by a destination for the jump.
|
73 |
|
|
|
74 |
|
|
The bytecode interpreter is a stack-based machine; most instructions pop
|
75 |
|
|
their operands off the stack, perform some operation, and push the
|
76 |
|
|
result back on the stack for the next instruction to consume. Each
|
77 |
|
|
element of the stack may contain either a integer or a floating point
|
78 |
|
|
value; these values are as many bits wide as the largest integer that
|
79 |
|
|
can be directly manipulated in the source language. Stack elements
|
80 |
|
|
carry no record of their type; bytecode could push a value as an
|
81 |
|
|
integer, then pop it as a floating point value. However, GDB will not
|
82 |
|
|
generate code which does this. In C, one might define the type of a
|
83 |
|
|
stack element as follows:
|
84 |
|
|
@example
|
85 |
|
|
union agent_val @{
|
86 |
|
|
LONGEST l;
|
87 |
|
|
DOUBLEST d;
|
88 |
|
|
@};
|
89 |
|
|
@end example
|
90 |
|
|
@noindent
|
91 |
|
|
where @code{LONGEST} and @code{DOUBLEST} are @code{typedef} names for
|
92 |
|
|
the largest integer and floating point types on the machine.
|
93 |
|
|
|
94 |
|
|
By the time the bytecode interpreter reaches the end of the expression,
|
95 |
|
|
the value of the expression should be the only value left on the stack.
|
96 |
|
|
For tracing applications, @code{trace} bytecodes in the expression will
|
97 |
|
|
have recorded the necessary data, and the value on the stack may be
|
98 |
|
|
discarded. For other applications, like conditional breakpoints, the
|
99 |
|
|
value may be useful.
|
100 |
|
|
|
101 |
|
|
Separate from the stack, the interpreter has two registers:
|
102 |
|
|
@table @code
|
103 |
|
|
@item pc
|
104 |
|
|
The address of the next bytecode to execute.
|
105 |
|
|
|
106 |
|
|
@item start
|
107 |
|
|
The address of the start of the bytecode expression, necessary for
|
108 |
|
|
interpreting the @code{goto} and @code{if_goto} instructions.
|
109 |
|
|
|
110 |
|
|
@end table
|
111 |
|
|
@noindent
|
112 |
|
|
Neither of these registers is directly visible to the bytecode language
|
113 |
|
|
itself, but they are useful for defining the meanings of the bytecode
|
114 |
|
|
operations.
|
115 |
|
|
|
116 |
|
|
There are no instructions to perform side effects on the running
|
117 |
|
|
program, or call the program's functions; we assume that these
|
118 |
|
|
expressions are only used for unobtrusive debugging, not for patching
|
119 |
|
|
the running code.
|
120 |
|
|
|
121 |
|
|
Most bytecode instructions do not distinguish between the various sizes
|
122 |
|
|
of values, and operate on full-width values; the upper bits of the
|
123 |
|
|
values are simply ignored, since they do not usually make a difference
|
124 |
|
|
to the value computed. The exceptions to this rule are:
|
125 |
|
|
@table @asis
|
126 |
|
|
|
127 |
|
|
@item memory reference instructions (@code{ref}@var{n})
|
128 |
|
|
There are distinct instructions to fetch different word sizes from
|
129 |
|
|
memory. Once on the stack, however, the values are treated as full-size
|
130 |
|
|
integers. They may need to be sign-extended; the @code{ext} instruction
|
131 |
|
|
exists for this purpose.
|
132 |
|
|
|
133 |
|
|
@item the sign-extension instruction (@code{ext} @var{n})
|
134 |
|
|
These clearly need to know which portion of their operand is to be
|
135 |
|
|
extended to occupy the full length of the word.
|
136 |
|
|
|
137 |
|
|
@end table
|
138 |
|
|
|
139 |
|
|
If the interpreter is unable to evaluate an expression completely for
|
140 |
|
|
some reason (a memory location is inaccessible, or a divisor is zero,
|
141 |
|
|
for example), we say that interpretation ``terminates with an error''.
|
142 |
|
|
This means that the problem is reported back to the interpreter's caller
|
143 |
|
|
in some helpful way. In general, code using agent expressions should
|
144 |
|
|
assume that they may attempt to divide by zero, fetch arbitrary memory
|
145 |
|
|
locations, and misbehave in other ways.
|
146 |
|
|
|
147 |
|
|
Even complicated C expressions compile to a few bytecode instructions;
|
148 |
|
|
for example, the expression @code{x + y * z} would typically produce
|
149 |
|
|
code like the following, assuming that @code{x} and @code{y} live in
|
150 |
|
|
registers, and @code{z} is a global variable holding a 32-bit
|
151 |
|
|
@code{int}:
|
152 |
|
|
@example
|
153 |
|
|
reg 1
|
154 |
|
|
reg 2
|
155 |
|
|
const32 @i{address of z}
|
156 |
|
|
ref32
|
157 |
|
|
ext 32
|
158 |
|
|
mul
|
159 |
|
|
add
|
160 |
|
|
end
|
161 |
|
|
@end example
|
162 |
|
|
|
163 |
|
|
In detail, these mean:
|
164 |
|
|
@table @code
|
165 |
|
|
|
166 |
|
|
@item reg 1
|
167 |
|
|
Push the value of register 1 (presumably holding @code{x}) onto the
|
168 |
|
|
stack.
|
169 |
|
|
|
170 |
|
|
@item reg 2
|
171 |
|
|
Push the value of register 2 (holding @code{y}).
|
172 |
|
|
|
173 |
|
|
@item const32 @i{address of z}
|
174 |
|
|
Push the address of @code{z} onto the stack.
|
175 |
|
|
|
176 |
|
|
@item ref32
|
177 |
|
|
Fetch a 32-bit word from the address at the top of the stack; replace
|
178 |
|
|
the address on the stack with the value. Thus, we replace the address
|
179 |
|
|
of @code{z} with @code{z}'s value.
|
180 |
|
|
|
181 |
|
|
@item ext 32
|
182 |
|
|
Sign-extend the value on the top of the stack from 32 bits to full
|
183 |
|
|
length. This is necessary because @code{z} is a signed integer.
|
184 |
|
|
|
185 |
|
|
@item mul
|
186 |
|
|
Pop the top two numbers on the stack, multiply them, and push their
|
187 |
|
|
product. Now the top of the stack contains the value of the expression
|
188 |
|
|
@code{y * z}.
|
189 |
|
|
|
190 |
|
|
@item add
|
191 |
|
|
Pop the top two numbers, add them, and push the sum. Now the top of the
|
192 |
|
|
stack contains the value of @code{x + y * z}.
|
193 |
|
|
|
194 |
|
|
@item end
|
195 |
|
|
Stop executing; the value left on the stack top is the value to be
|
196 |
|
|
recorded.
|
197 |
|
|
|
198 |
|
|
@end table
|
199 |
|
|
|
200 |
|
|
|
201 |
|
|
@node Bytecode Descriptions
|
202 |
|
|
@section Bytecode Descriptions
|
203 |
|
|
|
204 |
|
|
Each bytecode description has the following form:
|
205 |
|
|
|
206 |
|
|
@table @asis
|
207 |
|
|
|
208 |
|
|
@item @code{add} (0x02): @var{a} @var{b} @result{} @var{a+b}
|
209 |
|
|
|
210 |
|
|
Pop the top two stack items, @var{a} and @var{b}, as integers; push
|
211 |
|
|
their sum, as an integer.
|
212 |
|
|
|
213 |
|
|
@end table
|
214 |
|
|
|
215 |
|
|
In this example, @code{add} is the name of the bytecode, and
|
216 |
|
|
@code{(0x02)} is the one-byte value used to encode the bytecode, in
|
217 |
|
|
hexidecimal. The phrase ``@var{a} @var{b} @result{} @var{a+b}'' shows
|
218 |
|
|
the stack before and after the bytecode executes. Beforehand, the stack
|
219 |
|
|
must contain at least two values, @var{a} and @var{b}; since the top of
|
220 |
|
|
the stack is to the right, @var{b} is on the top of the stack, and
|
221 |
|
|
@var{a} is underneath it. After execution, the bytecode will have
|
222 |
|
|
popped @var{a} and @var{b} from the stack, and replaced them with a
|
223 |
|
|
single value, @var{a+b}. There may be other values on the stack below
|
224 |
|
|
those shown, but the bytecode affects only those shown.
|
225 |
|
|
|
226 |
|
|
Here is another example:
|
227 |
|
|
|
228 |
|
|
@table @asis
|
229 |
|
|
|
230 |
|
|
@item @code{const8} (0x22) @var{n}: @result{} @var{n}
|
231 |
|
|
Push the 8-bit integer constant @var{n} on the stack, without sign
|
232 |
|
|
extension.
|
233 |
|
|
|
234 |
|
|
@end table
|
235 |
|
|
|
236 |
|
|
In this example, the bytecode @code{const8} takes an operand @var{n}
|
237 |
|
|
directly from the bytecode stream; the operand follows the @code{const8}
|
238 |
|
|
bytecode itself. We write any such operands immediately after the name
|
239 |
|
|
of the bytecode, before the colon, and describe the exact encoding of
|
240 |
|
|
the operand in the bytecode stream in the body of the bytecode
|
241 |
|
|
description.
|
242 |
|
|
|
243 |
|
|
For the @code{const8} bytecode, there are no stack items given before
|
244 |
|
|
the @result{}; this simply means that the bytecode consumes no values
|
245 |
|
|
from the stack. If a bytecode consumes no values, or produces no
|
246 |
|
|
values, the list on either side of the @result{} may be empty.
|
247 |
|
|
|
248 |
|
|
If a value is written as @var{a}, @var{b}, or @var{n}, then the bytecode
|
249 |
|
|
treats it as an integer. If a value is written is @var{addr}, then the
|
250 |
|
|
bytecode treats it as an address.
|
251 |
|
|
|
252 |
|
|
We do not fully describe the floating point operations here; although
|
253 |
|
|
this design can be extended in a clean way to handle floating point
|
254 |
|
|
values, they are not of immediate interest to the customer, so we avoid
|
255 |
|
|
describing them, to save time.
|
256 |
|
|
|
257 |
|
|
|
258 |
|
|
@table @asis
|
259 |
|
|
|
260 |
|
|
@item @code{float} (0x01): @result{}
|
261 |
|
|
|
262 |
|
|
Prefix for floating-point bytecodes. Not implemented yet.
|
263 |
|
|
|
264 |
|
|
@item @code{add} (0x02): @var{a} @var{b} @result{} @var{a+b}
|
265 |
|
|
Pop two integers from the stack, and push their sum, as an integer.
|
266 |
|
|
|
267 |
|
|
@item @code{sub} (0x03): @var{a} @var{b} @result{} @var{a-b}
|
268 |
|
|
Pop two integers from the stack, subtract the top value from the
|
269 |
|
|
next-to-top value, and push the difference.
|
270 |
|
|
|
271 |
|
|
@item @code{mul} (0x04): @var{a} @var{b} @result{} @var{a*b}
|
272 |
|
|
Pop two integers from the stack, multiply them, and push the product on
|
273 |
|
|
the stack. Note that, when one multiplies two @var{n}-bit numbers
|
274 |
|
|
yielding another @var{n}-bit number, it is irrelevant whether the
|
275 |
|
|
numbers are signed or not; the results are the same.
|
276 |
|
|
|
277 |
|
|
@item @code{div_signed} (0x05): @var{a} @var{b} @result{} @var{a/b}
|
278 |
|
|
Pop two signed integers from the stack; divide the next-to-top value by
|
279 |
|
|
the top value, and push the quotient. If the divisor is zero, terminate
|
280 |
|
|
with an error.
|
281 |
|
|
|
282 |
|
|
@item @code{div_unsigned} (0x06): @var{a} @var{b} @result{} @var{a/b}
|
283 |
|
|
Pop two unsigned integers from the stack; divide the next-to-top value
|
284 |
|
|
by the top value, and push the quotient. If the divisor is zero,
|
285 |
|
|
terminate with an error.
|
286 |
|
|
|
287 |
|
|
@item @code{rem_signed} (0x07): @var{a} @var{b} @result{} @var{a modulo b}
|
288 |
|
|
Pop two signed integers from the stack; divide the next-to-top value by
|
289 |
|
|
the top value, and push the remainder. If the divisor is zero,
|
290 |
|
|
terminate with an error.
|
291 |
|
|
|
292 |
|
|
@item @code{rem_unsigned} (0x08): @var{a} @var{b} @result{} @var{a modulo b}
|
293 |
|
|
Pop two unsigned integers from the stack; divide the next-to-top value
|
294 |
|
|
by the top value, and push the remainder. If the divisor is zero,
|
295 |
|
|
terminate with an error.
|
296 |
|
|
|
297 |
|
|
@item @code{lsh} (0x09): @var{a} @var{b} @result{} @var{a<<b}
|
298 |
|
|
Pop two integers from the stack; let @var{a} be the next-to-top value,
|
299 |
|
|
and @var{b} be the top value. Shift @var{a} left by @var{b} bits, and
|
300 |
|
|
push the result.
|
301 |
|
|
|
302 |
|
|
@item @code{rsh_signed} (0x0a): @var{a} @var{b} @result{} @var{@code{(signed)}a>>b}
|
303 |
|
|
Pop two integers from the stack; let @var{a} be the next-to-top value,
|
304 |
|
|
and @var{b} be the top value. Shift @var{a} right by @var{b} bits,
|
305 |
|
|
inserting copies of the top bit at the high end, and push the result.
|
306 |
|
|
|
307 |
|
|
@item @code{rsh_unsigned} (0x0b): @var{a} @var{b} @result{} @var{a>>b}
|
308 |
|
|
Pop two integers from the stack; let @var{a} be the next-to-top value,
|
309 |
|
|
and @var{b} be the top value. Shift @var{a} right by @var{b} bits,
|
310 |
|
|
inserting zero bits at the high end, and push the result.
|
311 |
|
|
|
312 |
|
|
@item @code{log_not} (0x0e): @var{a} @result{} @var{!a}
|
313 |
|
|
Pop an integer from the stack; if it is zero, push the value one;
|
314 |
|
|
otherwise, push the value zero.
|
315 |
|
|
|
316 |
|
|
@item @code{bit_and} (0x0f): @var{a} @var{b} @result{} @var{a&b}
|
317 |
|
|
Pop two integers from the stack, and push their bitwise @code{and}.
|
318 |
|
|
|
319 |
|
|
@item @code{bit_or} (0x10): @var{a} @var{b} @result{} @var{a|b}
|
320 |
|
|
Pop two integers from the stack, and push their bitwise @code{or}.
|
321 |
|
|
|
322 |
|
|
@item @code{bit_xor} (0x11): @var{a} @var{b} @result{} @var{a^b}
|
323 |
|
|
Pop two integers from the stack, and push their bitwise
|
324 |
|
|
exclusive-@code{or}.
|
325 |
|
|
|
326 |
|
|
@item @code{bit_not} (0x12): @var{a} @result{} @var{~a}
|
327 |
|
|
Pop an integer from the stack, and push its bitwise complement.
|
328 |
|
|
|
329 |
|
|
@item @code{equal} (0x13): @var{a} @var{b} @result{} @var{a=b}
|
330 |
|
|
Pop two integers from the stack; if they are equal, push the value one;
|
331 |
|
|
otherwise, push the value zero.
|
332 |
|
|
|
333 |
|
|
@item @code{less_signed} (0x14): @var{a} @var{b} @result{} @var{a<b}
|
334 |
|
|
Pop two signed integers from the stack; if the next-to-top value is less
|
335 |
|
|
than the top value, push the value one; otherwise, push the value zero.
|
336 |
|
|
|
337 |
|
|
@item @code{less_unsigned} (0x15): @var{a} @var{b} @result{} @var{a<b}
|
338 |
|
|
Pop two unsigned integers from the stack; if the next-to-top value is less
|
339 |
|
|
than the top value, push the value one; otherwise, push the value zero.
|
340 |
|
|
|
341 |
|
|
@item @code{ext} (0x16) @var{n}: @var{a} @result{} @var{a}, sign-extended from @var{n} bits
|
342 |
|
|
Pop an unsigned value from the stack; treating it as an @var{n}-bit
|
343 |
|
|
twos-complement value, extend it to full length. This means that all
|
344 |
|
|
bits to the left of bit @var{n-1} (where the least significant bit is bit
|
345 |
|
|
0) are set to the value of bit @var{n-1}. Note that @var{n} may be
|
346 |
|
|
larger than or equal to the width of the stack elements of the bytecode
|
347 |
|
|
engine; in this case, the bytecode should have no effect.
|
348 |
|
|
|
349 |
|
|
The number of source bits to preserve, @var{n}, is encoded as a single
|
350 |
|
|
byte unsigned integer following the @code{ext} bytecode.
|
351 |
|
|
|
352 |
|
|
@item @code{zero_ext} (0x2a) @var{n}: @var{a} @result{} @var{a}, zero-extended from @var{n} bits
|
353 |
|
|
Pop an unsigned value from the stack; zero all but the bottom @var{n}
|
354 |
|
|
bits. This means that all bits to the left of bit @var{n-1} (where the
|
355 |
|
|
least significant bit is bit 0) are set to the value of bit @var{n-1}.
|
356 |
|
|
|
357 |
|
|
The number of source bits to preserve, @var{n}, is encoded as a single
|
358 |
|
|
byte unsigned integer following the @code{zero_ext} bytecode.
|
359 |
|
|
|
360 |
|
|
@item @code{ref8} (0x17): @var{addr} @result{} @var{a}
|
361 |
|
|
@itemx @code{ref16} (0x18): @var{addr} @result{} @var{a}
|
362 |
|
|
@itemx @code{ref32} (0x19): @var{addr} @result{} @var{a}
|
363 |
|
|
@itemx @code{ref64} (0x1a): @var{addr} @result{} @var{a}
|
364 |
|
|
Pop an address @var{addr} from the stack. For bytecode
|
365 |
|
|
@code{ref}@var{n}, fetch an @var{n}-bit value from @var{addr}, using the
|
366 |
|
|
natural target endianness. Push the fetched value as an unsigned
|
367 |
|
|
integer.
|
368 |
|
|
|
369 |
|
|
Note that @var{addr} may not be aligned in any particular way; the
|
370 |
|
|
@code{ref@var{n}} bytecodes should operate correctly for any address.
|
371 |
|
|
|
372 |
|
|
If attempting to access memory at @var{addr} would cause a processor
|
373 |
|
|
exception of some sort, terminate with an error.
|
374 |
|
|
|
375 |
|
|
@item @code{ref_float} (0x1b): @var{addr} @result{} @var{d}
|
376 |
|
|
@itemx @code{ref_double} (0x1c): @var{addr} @result{} @var{d}
|
377 |
|
|
@itemx @code{ref_long_double} (0x1d): @var{addr} @result{} @var{d}
|
378 |
|
|
@itemx @code{l_to_d} (0x1e): @var{a} @result{} @var{d}
|
379 |
|
|
@itemx @code{d_to_l} (0x1f): @var{d} @result{} @var{a}
|
380 |
|
|
Not implemented yet.
|
381 |
|
|
|
382 |
|
|
@item @code{dup} (0x28): @var{a} => @var{a} @var{a}
|
383 |
|
|
Push another copy of the stack's top element.
|
384 |
|
|
|
385 |
|
|
@item @code{swap} (0x2b): @var{a} @var{b} => @var{b} @var{a}
|
386 |
|
|
Exchange the top two items on the stack.
|
387 |
|
|
|
388 |
|
|
@item @code{pop} (0x29): @var{a} =>
|
389 |
|
|
Discard the top value on the stack.
|
390 |
|
|
|
391 |
|
|
@item @code{if_goto} (0x20) @var{offset}: @var{a} @result{}
|
392 |
|
|
Pop an integer off the stack; if it is non-zero, branch to the given
|
393 |
|
|
offset in the bytecode string. Otherwise, continue to the next
|
394 |
|
|
instruction in the bytecode stream. In other words, if @var{a} is
|
395 |
|
|
non-zero, set the @code{pc} register to @code{start} + @var{offset}.
|
396 |
|
|
Thus, an offset of zero denotes the beginning of the expression.
|
397 |
|
|
|
398 |
|
|
The @var{offset} is stored as a sixteen-bit unsigned value, stored
|
399 |
|
|
immediately following the @code{if_goto} bytecode. It is always stored
|
400 |
|
|
most signficant byte first, regardless of the target's normal
|
401 |
|
|
endianness. The offset is not guaranteed to fall at any particular
|
402 |
|
|
alignment within the bytecode stream; thus, on machines where fetching a
|
403 |
|
|
16-bit on an unaligned address raises an exception, you should fetch the
|
404 |
|
|
offset one byte at a time.
|
405 |
|
|
|
406 |
|
|
@item @code{goto} (0x21) @var{offset}: @result{}
|
407 |
|
|
Branch unconditionally to @var{offset}; in other words, set the
|
408 |
|
|
@code{pc} register to @code{start} + @var{offset}.
|
409 |
|
|
|
410 |
|
|
The offset is stored in the same way as for the @code{if_goto} bytecode.
|
411 |
|
|
|
412 |
|
|
@item @code{const8} (0x22) @var{n}: @result{} @var{n}
|
413 |
|
|
@itemx @code{const16} (0x23) @var{n}: @result{} @var{n}
|
414 |
|
|
@itemx @code{const32} (0x24) @var{n}: @result{} @var{n}
|
415 |
|
|
@itemx @code{const64} (0x25) @var{n}: @result{} @var{n}
|
416 |
|
|
Push the integer constant @var{n} on the stack, without sign extension.
|
417 |
|
|
To produce a small negative value, push a small twos-complement value,
|
418 |
|
|
and then sign-extend it using the @code{ext} bytecode.
|
419 |
|
|
|
420 |
|
|
The constant @var{n} is stored in the appropriate number of bytes
|
421 |
|
|
following the @code{const}@var{b} bytecode. The constant @var{n} is
|
422 |
|
|
always stored most significant byte first, regardless of the target's
|
423 |
|
|
normal endianness. The constant is not guaranteed to fall at any
|
424 |
|
|
particular alignment within the bytecode stream; thus, on machines where
|
425 |
|
|
fetching a 16-bit on an unaligned address raises an exception, you
|
426 |
|
|
should fetch @var{n} one byte at a time.
|
427 |
|
|
|
428 |
|
|
@item @code{reg} (0x26) @var{n}: @result{} @var{a}
|
429 |
|
|
Push the value of register number @var{n}, without sign extension. The
|
430 |
|
|
registers are numbered following GDB's conventions.
|
431 |
|
|
|
432 |
|
|
The register number @var{n} is encoded as a 16-bit unsigned integer
|
433 |
|
|
immediately following the @code{reg} bytecode. It is always stored most
|
434 |
|
|
signficant byte first, regardless of the target's normal endianness.
|
435 |
|
|
The register number is not guaranteed to fall at any particular
|
436 |
|
|
alignment within the bytecode stream; thus, on machines where fetching a
|
437 |
|
|
16-bit on an unaligned address raises an exception, you should fetch the
|
438 |
|
|
register number one byte at a time.
|
439 |
|
|
|
440 |
|
|
@item @code{trace} (0x0c): @var{addr} @var{size} @result{}
|
441 |
|
|
Record the contents of the @var{size} bytes at @var{addr} in a trace
|
442 |
|
|
buffer, for later retrieval by GDB.
|
443 |
|
|
|
444 |
|
|
@item @code{trace_quick} (0x0d) @var{size}: @var{addr} @result{} @var{addr}
|
445 |
|
|
Record the contents of the @var{size} bytes at @var{addr} in a trace
|
446 |
|
|
buffer, for later retrieval by GDB. @var{size} is a single byte
|
447 |
|
|
unsigned integer following the @code{trace} opcode.
|
448 |
|
|
|
449 |
|
|
This bytecode is equivalent to the sequence @code{dup const8 @var{size}
|
450 |
|
|
trace}, but we provide it anyway to save space in bytecode strings.
|
451 |
|
|
|
452 |
|
|
@item @code{trace16} (0x30) @var{size}: @var{addr} @result{} @var{addr}
|
453 |
|
|
Identical to trace_quick, except that @var{size} is a 16-bit big-endian
|
454 |
|
|
unsigned integer, not a single byte. This should probably have been
|
455 |
|
|
named @code{trace_quick16}, for consistency.
|
456 |
|
|
|
457 |
|
|
@item @code{end} (0x27): @result{}
|
458 |
|
|
Stop executing bytecode; the result should be the top element of the
|
459 |
|
|
stack. If the purpose of the expression was to compute an lvalue or a
|
460 |
|
|
range of memory, then the next-to-top of the stack is the lvalue's
|
461 |
|
|
address, and the top of the stack is the lvalue's size, in bytes.
|
462 |
|
|
|
463 |
|
|
@end table
|
464 |
|
|
|
465 |
|
|
|
466 |
|
|
@node Using Agent Expressions
|
467 |
|
|
@section Using Agent Expressions
|
468 |
|
|
|
469 |
|
|
Here is a sketch of a full non-stop debugging cycle, showing how agent
|
470 |
|
|
expressions fit into the process.
|
471 |
|
|
|
472 |
|
|
@itemize @bullet
|
473 |
|
|
|
474 |
|
|
@item
|
475 |
|
|
The user selects trace points in the program's code at which GDB should
|
476 |
|
|
collect data.
|
477 |
|
|
|
478 |
|
|
@item
|
479 |
|
|
The user specifies expressions to evaluate at each trace point. These
|
480 |
|
|
expressions may denote objects in memory, in which case those objects'
|
481 |
|
|
contents are recorded as the program runs, or computed values, in which
|
482 |
|
|
case the values themselves are recorded.
|
483 |
|
|
|
484 |
|
|
@item
|
485 |
|
|
GDB transmits the tracepoints and their associated expressions to the
|
486 |
|
|
GDB agent, running on the debugging target.
|
487 |
|
|
|
488 |
|
|
@item
|
489 |
|
|
The agent arranges to be notified when a trace point is hit. Note that,
|
490 |
|
|
on some systems, the target operating system is completely responsible
|
491 |
|
|
for collecting the data; see @ref{Tracing on Symmetrix}.
|
492 |
|
|
|
493 |
|
|
@item
|
494 |
|
|
When execution on the target reaches a trace point, the agent evaluates
|
495 |
|
|
the expressions associated with that trace point, and records the
|
496 |
|
|
resulting values and memory ranges.
|
497 |
|
|
|
498 |
|
|
@item
|
499 |
|
|
Later, when the user selects a given trace event and inspects the
|
500 |
|
|
objects and expression values recorded, GDB talks to the agent to
|
501 |
|
|
retrieve recorded data as necessary to meet the user's requests. If the
|
502 |
|
|
user asks to see an object whose contents have not been recorded, GDB
|
503 |
|
|
reports an error.
|
504 |
|
|
|
505 |
|
|
@end itemize
|
506 |
|
|
|
507 |
|
|
|
508 |
|
|
@node Varying Target Capabilities
|
509 |
|
|
@section Varying Target Capabilities
|
510 |
|
|
|
511 |
|
|
Some targets don't support floating-point, and some would rather not
|
512 |
|
|
have to deal with @code{long long} operations. Also, different targets
|
513 |
|
|
will have different stack sizes, and different bytecode buffer lengths.
|
514 |
|
|
|
515 |
|
|
Thus, GDB needs a way to ask the target about itself. We haven't worked
|
516 |
|
|
out the details yet, but in general, GDB should be able to send the
|
517 |
|
|
target a packet asking it to describe itself. The reply should be a
|
518 |
|
|
packet whose length is explicit, so we can add new information to the
|
519 |
|
|
packet in future revisions of the agent, without confusing old versions
|
520 |
|
|
of GDB, and it should contain a version number. It should contain at
|
521 |
|
|
least the following information:
|
522 |
|
|
|
523 |
|
|
@itemize @bullet
|
524 |
|
|
|
525 |
|
|
@item
|
526 |
|
|
whether floating point is supported
|
527 |
|
|
|
528 |
|
|
@item
|
529 |
|
|
whether @code{long long} is supported
|
530 |
|
|
|
531 |
|
|
@item
|
532 |
|
|
maximum acceptable size of bytecode stack
|
533 |
|
|
|
534 |
|
|
@item
|
535 |
|
|
maximum acceptable length of bytecode expressions
|
536 |
|
|
|
537 |
|
|
@item
|
538 |
|
|
which registers are actually available for collection
|
539 |
|
|
|
540 |
|
|
@item
|
541 |
|
|
whether the target supports disabled tracepoints
|
542 |
|
|
|
543 |
|
|
@end itemize
|
544 |
|
|
|
545 |
|
|
|
546 |
|
|
|
547 |
|
|
@node Tracing on Symmetrix
|
548 |
|
|
@section Tracing on Symmetrix
|
549 |
|
|
|
550 |
|
|
This section documents the API used by the GDB agent to collect data on
|
551 |
|
|
Symmetrix systems.
|
552 |
|
|
|
553 |
|
|
Cygnus originally implemented these tracing features to help EMC
|
554 |
|
|
Corporation debug their Symmetrix high-availability disk drives. The
|
555 |
|
|
Symmetrix application code already includes substantial tracing
|
556 |
|
|
facilities; the GDB agent for the Symmetrix system uses those facilities
|
557 |
|
|
for its own data collection, via the API described here.
|
558 |
|
|
|
559 |
|
|
@deftypefn Function DTC_RESPONSE adbg_find_memory_in_frame (FRAME_DEF *@var{frame}, char *@var{address}, char **@var{buffer}, unsigned int *@var{size})
|
560 |
|
|
Search the trace frame @var{frame} for memory saved from @var{address}.
|
561 |
|
|
If the memory is available, provide the address of the buffer holding
|
562 |
|
|
it; otherwise, provide the address of the next saved area.
|
563 |
|
|
|
564 |
|
|
@itemize @bullet
|
565 |
|
|
|
566 |
|
|
@item
|
567 |
|
|
If the memory at @var{address} was saved in @var{frame}, set
|
568 |
|
|
@code{*@var{buffer}} to point to the buffer in which that memory was
|
569 |
|
|
saved, set @code{*@var{size}} to the number of bytes from @var{address}
|
570 |
|
|
that are saved at @code{*@var{buffer}}, and return
|
571 |
|
|
@code{OK_TARGET_RESPONSE}. (Clearly, in this case, the function will
|
572 |
|
|
always set @code{*@var{size}} to a value greater than zero.)
|
573 |
|
|
|
574 |
|
|
@item
|
575 |
|
|
If @var{frame} does not record any memory at @var{address}, set
|
576 |
|
|
@code{*@var{size}} to the distance from @var{address} to the start of
|
577 |
|
|
the saved region with the lowest address higher than @var{address}. If
|
578 |
|
|
there is no memory saved from any higher address, set @code{*@var{size}}
|
579 |
|
|
to zero. Return @code{NOT_FOUND_TARGET_RESPONSE}.
|
580 |
|
|
@end itemize
|
581 |
|
|
|
582 |
|
|
These two possibilities allow the caller to either retrieve the data, or
|
583 |
|
|
walk the address space to the next saved area.
|
584 |
|
|
@end deftypefn
|
585 |
|
|
|
586 |
|
|
This function allows the GDB agent to map the regions of memory saved in
|
587 |
|
|
a particular frame, and retrieve their contents efficiently.
|
588 |
|
|
|
589 |
|
|
This function also provides a clean interface between the GDB agent and
|
590 |
|
|
the Symmetrix tracing structures, making it easier to adapt the GDB
|
591 |
|
|
agent to future versions of the Symmetrix system, and vice versa. This
|
592 |
|
|
function searches all data saved in @var{frame}, whether the data is
|
593 |
|
|
there at the request of a bytecode expression, or because it falls in
|
594 |
|
|
one of the format's memory ranges, or because it was saved from the top
|
595 |
|
|
of the stack. EMC can arbitrarily change and enhance the tracing
|
596 |
|
|
mechanism, but as long as this function works properly, all collected
|
597 |
|
|
memory is visible to GDB.
|
598 |
|
|
|
599 |
|
|
The function itself is straightforward to implement. A single pass over
|
600 |
|
|
the trace frame's stack area, memory ranges, and expression blocks can
|
601 |
|
|
yield the address of the buffer (if the requested address was saved),
|
602 |
|
|
and also note the address of the next higher range of memory, to be
|
603 |
|
|
returned when the search fails.
|
604 |
|
|
|
605 |
|
|
As an example, suppose the trace frame @code{f} has saved sixteen bytes
|
606 |
|
|
from address @code{0x8000} in a buffer at @code{0x1000}, and thirty-two
|
607 |
|
|
bytes from address @code{0xc000} in a buffer at @code{0x1010}. Here are
|
608 |
|
|
some sample calls, and the effect each would have:
|
609 |
|
|
|
610 |
|
|
@table @code
|
611 |
|
|
|
612 |
|
|
@item adbg_find_memory_in_frame (f, (char*) 0x8000, &buffer, &size)
|
613 |
|
|
This would set @code{buffer} to @code{0x1000}, set @code{size} to
|
614 |
|
|
sixteen, and return @code{OK_TARGET_RESPONSE}, since @code{f} saves
|
615 |
|
|
sixteen bytes from @code{0x8000} at @code{0x1000}.
|
616 |
|
|
|
617 |
|
|
@item adbg_find_memory_in_frame (f, (char *) 0x8004, &buffer, &size)
|
618 |
|
|
This would set @code{buffer} to @code{0x1004}, set @code{size} to
|
619 |
|
|
twelve, and return @code{OK_TARGET_RESPONSE}, since @file{f} saves the
|
620 |
|
|
twelve bytes from @code{0x8004} starting four bytes into the buffer at
|
621 |
|
|
@code{0x1000}. This shows that request addresses may fall in the middle
|
622 |
|
|
of saved areas; the function should return the address and size of the
|
623 |
|
|
remainder of the buffer.
|
624 |
|
|
|
625 |
|
|
@item adbg_find_memory_in_frame (f, (char *) 0x8100, &buffer, &size)
|
626 |
|
|
This would set @code{size} to @code{0x3f00} and return
|
627 |
|
|
@code{NOT_FOUND_TARGET_RESPONSE}, since there is no memory saved in
|
628 |
|
|
@code{f} from the address @code{0x8100}, and the next memory available
|
629 |
|
|
is at @code{0x8100 + 0x3f00}, or @code{0xc000}. This shows that request
|
630 |
|
|
addresses may fall outside of all saved memory ranges; the function
|
631 |
|
|
should indicate the next saved area, if any.
|
632 |
|
|
|
633 |
|
|
@item adbg_find_memory_in_frame (f, (char *) 0x7000, &buffer, &size)
|
634 |
|
|
This would set @code{size} to @code{0x1000} and return
|
635 |
|
|
@code{NOT_FOUND_TARGET_RESPONSE}, since the next saved memory is at
|
636 |
|
|
@code{0x7000 + 0x1000}, or @code{0x8000}.
|
637 |
|
|
|
638 |
|
|
@item adbg_find_memory_in_frame (f, (char *) 0xf000, &buffer, &size)
|
639 |
|
|
This would set @code{size} to zero, and return
|
640 |
|
|
@code{NOT_FOUND_TARGET_RESPONSE}. This shows how the function tells the
|
641 |
|
|
caller that no further memory ranges have been saved.
|
642 |
|
|
|
643 |
|
|
@end table
|
644 |
|
|
|
645 |
|
|
As another example, here is a function which will print out the
|
646 |
|
|
addresses of all memory saved in the trace frame @code{frame} on the
|
647 |
|
|
Symmetrix INLINES console:
|
648 |
|
|
@example
|
649 |
|
|
void
|
650 |
|
|
print_frame_addresses (FRAME_DEF *frame)
|
651 |
|
|
@{
|
652 |
|
|
char *addr;
|
653 |
|
|
char *buffer;
|
654 |
|
|
unsigned long size;
|
655 |
|
|
|
656 |
|
|
addr = 0;
|
657 |
|
|
for (;;)
|
658 |
|
|
@{
|
659 |
|
|
/* Either find out how much memory we have here, or discover
|
660 |
|
|
where the next saved region is. */
|
661 |
|
|
if (adbg_find_memory_in_frame (frame, addr, &buffer, &size)
|
662 |
|
|
== OK_TARGET_RESPONSE)
|
663 |
|
|
printp ("saved %x to %x\n", addr, addr + size);
|
664 |
|
|
if (size == 0)
|
665 |
|
|
break;
|
666 |
|
|
addr += size;
|
667 |
|
|
@}
|
668 |
|
|
@}
|
669 |
|
|
@end example
|
670 |
|
|
|
671 |
|
|
Note that there is not necessarily any connection between the order in
|
672 |
|
|
which the data is saved in the trace frame, and the order in which
|
673 |
|
|
@code{adbg_find_memory_in_frame} will return those memory ranges. The
|
674 |
|
|
code above will always print the saved memory regions in order of
|
675 |
|
|
increasing address, while the underlying frame structure might store the
|
676 |
|
|
data in a random order.
|
677 |
|
|
|
678 |
|
|
[[This section should cover the rest of the Symmetrix functions the stub
|
679 |
|
|
relies upon, too.]]
|
680 |
|
|
|
681 |
|
|
@node Rationale
|
682 |
|
|
@section Rationale
|
683 |
|
|
|
684 |
|
|
Some of the design decisions apparent above are arguable.
|
685 |
|
|
|
686 |
|
|
@table @b
|
687 |
|
|
|
688 |
|
|
@item What about stack overflow/underflow?
|
689 |
|
|
GDB should be able to query the target to discover its stack size.
|
690 |
|
|
Given that information, GDB can determine at translation time whether a
|
691 |
|
|
given expression will overflow the stack. But this spec isn't about
|
692 |
|
|
what kinds of error-checking GDB ought to do.
|
693 |
|
|
|
694 |
|
|
@item Why are you doing everything in LONGEST?
|
695 |
|
|
|
696 |
|
|
Speed isn't important, but agent code size is; using LONGEST brings in a
|
697 |
|
|
bunch of support code to do things like division, etc. So this is a
|
698 |
|
|
serious concern.
|
699 |
|
|
|
700 |
|
|
First, note that you don't need different bytecodes for different
|
701 |
|
|
operand sizes. You can generate code without @emph{knowing} how big the
|
702 |
|
|
stack elements actually are on the target. If the target only supports
|
703 |
|
|
32-bit ints, and you don't send any 64-bit bytecodes, everything just
|
704 |
|
|
works. The observation here is that the MIPS and the Alpha have only
|
705 |
|
|
fixed-size registers, and you can still get C's semantics even though
|
706 |
|
|
most instructions only operate on full-sized words. You just need to
|
707 |
|
|
make sure everything is properly sign-extended at the right times. So
|
708 |
|
|
there is no need for 32- and 64-bit variants of the bytecodes. Just
|
709 |
|
|
implement everything using the largest size you support.
|
710 |
|
|
|
711 |
|
|
GDB should certainly check to see what sizes the target supports, so the
|
712 |
|
|
user can get an error earlier, rather than later. But this information
|
713 |
|
|
is not necessary for correctness.
|
714 |
|
|
|
715 |
|
|
|
716 |
|
|
@item Why don't you have @code{>} or @code{<=} operators?
|
717 |
|
|
I want to keep the interpreter small, and we don't need them. We can
|
718 |
|
|
combine the @code{less_} opcodes with @code{log_not}, and swap the order
|
719 |
|
|
of the operands, yielding all four asymmetrical comparison operators.
|
720 |
|
|
For example, @code{(x <= y)} is @code{! (x > y)}, which is @code{! (y <
|
721 |
|
|
x)}.
|
722 |
|
|
|
723 |
|
|
@item Why do you have @code{log_not}?
|
724 |
|
|
@itemx Why do you have @code{ext}?
|
725 |
|
|
@itemx Why do you have @code{zero_ext}?
|
726 |
|
|
These are all easily synthesized from other instructions, but I expect
|
727 |
|
|
them to be used frequently, and they're simple, so I include them to
|
728 |
|
|
keep bytecode strings short.
|
729 |
|
|
|
730 |
|
|
@code{log_not} is equivalent to @code{const8 0 equal}; it's used in half
|
731 |
|
|
the relational operators.
|
732 |
|
|
|
733 |
|
|
@code{ext @var{n}} is equivalent to @code{const8 @var{s-n} lsh const8
|
734 |
|
|
@var{s-n} rsh_signed}, where @var{s} is the size of the stack elements;
|
735 |
|
|
it follows @code{ref@var{m}} and @var{reg} bytecodes when the value
|
736 |
|
|
should be signed. See the next bulleted item.
|
737 |
|
|
|
738 |
|
|
@code{zero_ext @var{n}} is equivalent to @code{const@var{m} @var{mask}
|
739 |
|
|
log_and}; it's used whenever we push the value of a register, because we
|
740 |
|
|
can't assume the upper bits of the register aren't garbage.
|
741 |
|
|
|
742 |
|
|
@item Why not have sign-extending variants of the @code{ref} operators?
|
743 |
|
|
Because that would double the number of @code{ref} operators, and we
|
744 |
|
|
need the @code{ext} bytecode anyway for accessing bitfields.
|
745 |
|
|
|
746 |
|
|
@item Why not have constant-address variants of the @code{ref} operators?
|
747 |
|
|
Because that would double the number of @code{ref} operators again, and
|
748 |
|
|
@code{const32 @var{address} ref32} is only one byte longer.
|
749 |
|
|
|
750 |
|
|
@item Why do the @code{ref@var{n}} operators have to support unaligned fetches?
|
751 |
|
|
GDB will generate bytecode that fetches multi-byte values at unaligned
|
752 |
|
|
addresses whenever the executable's debugging information tells it to.
|
753 |
|
|
Furthermore, GDB does not know the value the pointer will have when GDB
|
754 |
|
|
generates the bytecode, so it cannot determine whether a particular
|
755 |
|
|
fetch will be aligned or not.
|
756 |
|
|
|
757 |
|
|
In particular, structure bitfields may be several bytes long, but follow
|
758 |
|
|
no alignment rules; members of packed structures are not necessarily
|
759 |
|
|
aligned either.
|
760 |
|
|
|
761 |
|
|
In general, there are many cases where unaligned references occur in
|
762 |
|
|
correct C code, either at the programmer's explicit request, or at the
|
763 |
|
|
compiler's discretion. Thus, it is simpler to make the GDB agent
|
764 |
|
|
bytecodes work correctly in all circumstances than to make GDB guess in
|
765 |
|
|
each case whether the compiler did the usual thing.
|
766 |
|
|
|
767 |
|
|
@item Why are there no side-effecting operators?
|
768 |
|
|
Because our current client doesn't want them? That's a cheap answer. I
|
769 |
|
|
think the real answer is that I'm afraid of implementing function
|
770 |
|
|
calls. We should re-visit this issue after the present contract is
|
771 |
|
|
delivered.
|
772 |
|
|
|
773 |
|
|
@item Why aren't the @code{goto} ops PC-relative?
|
774 |
|
|
The interpreter has the base address around anyway for PC bounds
|
775 |
|
|
checking, and it seemed simpler.
|
776 |
|
|
|
777 |
|
|
@item Why is there only one offset size for the @code{goto} ops?
|
778 |
|
|
Offsets are currently sixteen bits. I'm not happy with this situation
|
779 |
|
|
either:
|
780 |
|
|
|
781 |
|
|
Suppose we have multiple branch ops with different offset sizes. As I
|
782 |
|
|
generate code left-to-right, all my jumps are forward jumps (there are
|
783 |
|
|
no loops in expressions), so I never know the target when I emit the
|
784 |
|
|
jump opcode. Thus, I have to either always assume the largest offset
|
785 |
|
|
size, or do jump relaxation on the code after I generate it, which seems
|
786 |
|
|
like a big waste of time.
|
787 |
|
|
|
788 |
|
|
I can imagine a reasonable expression being longer than 256 bytes. I
|
789 |
|
|
can't imagine one being longer than 64k. Thus, we need 16-bit offsets.
|
790 |
|
|
This kind of reasoning is so bogus, but relaxation is pathetic.
|
791 |
|
|
|
792 |
|
|
The other approach would be to generate code right-to-left. Then I'd
|
793 |
|
|
always know my offset size. That might be fun.
|
794 |
|
|
|
795 |
|
|
@item Where is the function call bytecode?
|
796 |
|
|
|
797 |
|
|
When we add side-effects, we should add this.
|
798 |
|
|
|
799 |
|
|
@item Why does the @code{reg} bytecode take a 16-bit register number?
|
800 |
|
|
|
801 |
|
|
Intel's IA-64 architecture has 128 general-purpose registers,
|
802 |
|
|
and 128 floating-point registers, and I'm sure it has some random
|
803 |
|
|
control registers.
|
804 |
|
|
|
805 |
|
|
@item Why do we need @code{trace} and @code{trace_quick}?
|
806 |
|
|
Because GDB needs to record all the memory contents and registers an
|
807 |
|
|
expression touches. If the user wants to evaluate an expression
|
808 |
|
|
@code{x->y->z}, the agent must record the values of @code{x} and
|
809 |
|
|
@code{x->y} as well as the value of @code{x->y->z}.
|
810 |
|
|
|
811 |
|
|
@item Don't the @code{trace} bytecodes make the interpreter less general?
|
812 |
|
|
They do mean that the interpreter contains special-purpose code, but
|
813 |
|
|
that doesn't mean the interpreter can only be used for that purpose. If
|
814 |
|
|
an expression doesn't use the @code{trace} bytecodes, they don't get in
|
815 |
|
|
its way.
|
816 |
|
|
|
817 |
|
|
@item Why doesn't @code{trace_quick} consume its arguments the way everything else does?
|
818 |
|
|
In general, you do want your operators to consume their arguments; it's
|
819 |
|
|
consistent, and generally reduces the amount of stack rearrangement
|
820 |
|
|
necessary. However, @code{trace_quick} is a kludge to save space; it
|
821 |
|
|
only exists so we needn't write @code{dup const8 @var{SIZE} trace}
|
822 |
|
|
before every memory reference. Therefore, it's okay for it not to
|
823 |
|
|
consume its arguments; it's meant for a specific context in which we
|
824 |
|
|
know exactly what it should do with the stack. If we're going to have a
|
825 |
|
|
kludge, it should be an effective kludge.
|
826 |
|
|
|
827 |
|
|
@item Why does @code{trace16} exist?
|
828 |
|
|
That opcode was added by the customer that contracted Cygnus for the
|
829 |
|
|
data tracing work. I personally think it is unnecessary; objects that
|
830 |
|
|
large will be quite rare, so it is okay to use @code{dup const16
|
831 |
|
|
@var{size} trace} in those cases.
|
832 |
|
|
|
833 |
|
|
Whatever we decide to do with @code{trace16}, we should at least leave
|
834 |
|
|
opcode 0x30 reserved, to remain compatible with the customer who added
|
835 |
|
|
it.
|
836 |
|
|
|
837 |
|
|
@end table
|
838 |
|
|
|
839 |
|
|
@bye
|