OpenCores
URL https://opencores.org/ocsvn/s6soc/s6soc/trunk

Subversion Repositories s6soc

[/] [s6soc/] [trunk/] [rtl/] [cpu/] [cpudefs.v] - Blame information for rev 51

Details | Compare with Previous | View Log

Line No. Rev Author Line
1 46 dgisselq
////////////////////////////////////////////////////////////////////////////////
2 2 dgisselq
//
3
// Filename:    cpudefs.v
4
//
5
// Project:     Zip CPU -- a small, lightweight, RISC CPU soft core
6
//
7
// Purpose:     Some architectures have some needs, others have other needs.
8
//              Some of my projects need a Zip CPU with pipelining, others
9
//      can't handle the timing required to get the answer from the ALU
10
//      back into the input for the ALU.  As each different projects has
11
//      different needs, I can either 1) reconfigure my entire baseline prior
12
//      to building each project, or 2) host a configuration file which contains
13
//      the information regarding each baseline.  This file is that
14
//      configuration file.  It controls how the CPU (not the system,
15
//      peripherals, or other) is defined and implemented.  Several options
16
//      are available within here, making the Zip CPU pipelined or not,
17
//      able to handle a faster clock with more stalls or a slower clock with
18
//      no stalls, etc.
19
//
20
//      This file encapsulates those control options.
21
//
22
//      The number of LUTs the Zip CPU uses varies dramatically with the
23
//      options defined in this file.
24
//
25
//
26
// Creator:     Dan Gisselquist, Ph.D.
27
//              Gisselquist Technology, LLC
28
//
29 46 dgisselq
////////////////////////////////////////////////////////////////////////////////
30 2 dgisselq
//
31 46 dgisselq
// Copyright (C) 2015-2016, Gisselquist Technology, LLC
32 2 dgisselq
//
33
// This program is free software (firmware): you can redistribute it and/or
34
// modify it under the terms of  the GNU General Public License as published
35
// by the Free Software Foundation, either version 3 of the License, or (at
36
// your option) any later version.
37
//
38
// This program is distributed in the hope that it will be useful, but WITHOUT
39
// ANY WARRANTY; without even the implied warranty of MERCHANTIBILITY or
40
// FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public License
41
// for more details.
42
//
43 46 dgisselq
// You should have received a copy of the GNU General Public License along
44 51 dgisselq
// with this program.  (It's in the $(ROOT)/doc directory.  Run make with no
45 46 dgisselq
// target there if the PDF file isn't present.)  If not, see
46
// <http://www.gnu.org/licenses/> for a copy.
47
//
48 2 dgisselq
// License:     GPL, v3, as defined and found on www.gnu.org,
49
//              http://www.gnu.org/licenses/gpl.html
50
//
51
//
52 46 dgisselq
////////////////////////////////////////////////////////////////////////////////
53
//
54
//
55 2 dgisselq
`ifndef CPUDEFS_H
56
`define CPUDEFS_H
57
//
58
//
59
// The first couple options control the Zip CPU instruction set, and how
60
// it handles various instructions within the set:
61
//
62
//
63
// OPT_ILLEGAL_INSTRUCTION is part of a new section of code that is supposed
64
// to recognize illegal instructions and interrupt the CPU whenever one such
65
// instruction is encountered.  The goal is to create a soft floating point
66
// unit via this approach, that can then be replaced with a true floating point
67
// unit.  As I'm not there yet, it just catches illegal instructions and
68
// interrupts the CPU on any such instruction--when defined.  Otherwise,
69
// illegal instructions are quietly ignored and their behaviour is ...
70
// undefined. (Many get treated like NOOPs ...)
71
//
72 46 dgisselq
// I recommend setting this flag so highly, that I'm likely going to remove
73
// the option to turn this off in future versions of this CPU.
74 2 dgisselq
//
75
`define OPT_ILLEGAL_INSTRUCTION
76
//
77
//
78
//
79
// OPT_MULTIPLY controls whether or not the multiply is built and included
80
// in the ALU by default.  Set this option and a parameter will be set that
81
// includes the multiply.  (This parameter may still be overridden, as with
82
// any parameter ...)  If the multiply is not included and
83
// OPT_ILLEGAL_INSTRUCTION is set, then the multiply will create an illegal
84
// instruction that will then trip the illegal instruction trap.
85
//
86 46 dgisselq
// Either not defining this value, or defining it to zero will disable the
87
// hardware multiply.  A value of '1' will cause the multiply to occurr in one
88
// clock cycle only--often at the expense of the rest of the CPUs speed.
89
// A value of 2 will cause the multiply to have a single delay cycle, 3 will
90
// have two delay cycles, and 4 (or more) will have 3 delay cycles.
91 2 dgisselq
//
92
//
93 46 dgisselq
`define OPT_MULTIPLY    4
94 2 dgisselq
//
95
//
96 46 dgisselq
//
97 2 dgisselq
// OPT_DIVIDE controls whether or not the divide instruction is built and
98
// included into the ZipCPU by default.  Set this option and a parameter will
99
// be set that causes the divide unit to be included.  (This parameter may
100
// still be overridden, as with any parameter ...)  If the divide is not
101
// included and OPT_ILLEGAL_INSTRUCTION is set, then the multiply will create
102
// an illegal instruction exception that will send the CPU into supervisor
103
// mode.
104
//
105
//
106 51 dgisselq
`define OPT_DIVIDE
107 2 dgisselq
//
108
//
109
//
110
// OPT_IMPLEMENT_FPU will (one day) control whether or not the floating point
111 51 dgisselq
// unit (once I have one) is built and included into the ZipCPU by default.
112 2 dgisselq
// At that time, if this option is set then a parameter will be set that
113
// causes the floating point unit to be included.  (This parameter may
114
// still be overridden, as with any parameter ...)  If the floating point unit
115
// is not included and OPT_ILLEGAL_INSTRUCTION is set, then as with the
116
// multiply and divide any floating point instruction will result in an illegal
117
// instruction exception that will send the CPU into supervisor mode.
118
//
119
//
120
// `define      OPT_IMPLEMENT_FPU
121
//
122
//
123
//
124
//
125 51 dgisselq
// OPT_SINGLE_FETCH controls whether or not the prefetch has a cache, and
126 2 dgisselq
// whether or not it can issue one instruction per clock.  When set, the
127
// prefetch has no cache, and only one instruction is fetched at a time.
128 51 dgisselq
// This effectively sets the CPU so that only one instruction is ever
129
// in the pipeline at once, and hence you may think of this as a "kill
130 2 dgisselq
// pipeline" option.  However, since the pipelined fetch component uses so
131
// much area on the FPGA, this is an important option to use in trimming down
132
// used area if necessary.  Hence, it needs to be maintained for that purpose.
133
// Be aware, though, it will drop your performance by a factor between 2x and
134
// 3x.
135
//
136
// We can either pipeline our fetches, or issue one fetch at a time.  Pipelined
137
// fetches are more complicated and therefore use more FPGA resources, while
138 51 dgisselq
// single fetches will cause the CPU to stall for about 5 stalls each
139 2 dgisselq
// instruction cycle, effectively reducing the instruction count per clock to
140
// about 0.2.  However, the area cost may be worth it.  Consider:
141
//
142
//      Slice LUTs              ZipSystem       ZipCPU
143
//      Single Fetching         2521            1734
144
//      Pipelined fetching      2796            2046
145
//      (These numbers may be dated, but should still be representative ...)
146
//
147
// I recommend only defining this if you "need" to, if area is tight and
148
// speed isn't as important.  Otherwise, just leave this undefined.
149
//
150 51 dgisselq
//`define       OPT_SINGLE_FETCH // 2047 total LUTs (savings of 181 from before)
151 2 dgisselq
//
152
//
153 51 dgisselq
// OPT_DOUBLE_FETCH is an alternative to OPT_SINGLE_FETCH.  It is designed to
154
// increase performance primarily when using an instruction memory which has
155
// one cost for a random access, and a second (lower) cost for sequential
156
// access.  The driving example behind this implementation was flash memory
157
// with 34 clocks for an initial access and 16 clocks for any subsequent access,
158
// but SDRAM memory with 27 clocks for an initial access and 1 clock for a
159
// subsequent access is also a good example.  Even block RAM might be a good
160
// example, if there were any bus delays in getting to the RAM device.  Using
161
// OPT_DOUBLE_FETCH also increases the pipeline speed, as it allows CIS
162
// instructions and therefore partial pipelining.  (No work is done to resolve
163
// pipeline conflicts past the decode stage, as is the case with full pipeline
164
// mode.
165 2 dgisselq
//
166 51 dgisselq
// Do not define OPT_DOUBLE_FETCH if you wish to fully pipeline the CPU.  Do
167
// not define both OPT_DOUBLE_FETCH and OPT_SINGLE_FETCH (the ifndef below
168
// should prevent that).
169
//
170
//
171
// // COST: about 79 LUTs over and above the SINGLE_FETCH cost [2091 LUTs]
172
`ifndef OPT_SINGLE_FETCH
173
`define OPT_DOUBLE_FETCH
174
`endif
175
//
176
//
177
//
178
// The ZipCPU ISA defines an optional compressed instruction set (CIS)
179
// complement.  This compressed instruction format allows two instructions to
180
// be packed into the same instruction word.  Some instructions can be so
181
// compressed, although not all.  Compressed instructions take the same time to
182
// complete--they are just compressed within memory to spare troubles with the
183
// prefetch.  Set OPT_CIS to include these compressed instructions as part of
184
// the instruction set.
185
//
186
//
187
// // COST: about 87 LUTs
188
//
189
`define OPT_CIS
190
//
191
//
192
//
193
//
194
// OPT_EARLY_BRANCHING is an attempt to execute a BRA statement as early
195
// as possible, to avoid as many pipeline stalls on a branch as possible.
196
// With the OPT_TRADITIONAL_PFCACHE, BRA instructions cost only a single
197
// extra stall cycle, while LJMP instructions cost two (assuming the target is
198
// in the cache).  Indeed, the result is that a BRA instruction can be used as
199
// the compiler's branch prediction optimizer: BRA's barely stall, while
200
// conditional branches will always suffer about 4 stall cycles or so.
201
//
202
// I recommend setting this flag, so as to turn early branching on---if you
203
// have the LUTs available to afford it.
204
//
205
// `define      OPT_EARLY_BRANCHING
206
//
207
//
208
//
209
//
210 2 dgisselq
// The next several options are pipeline optimization options.  They make no
211
// sense in a single instruction fetch mode, hence we #ifndef them so they
212
// are only defined if we are in a full pipelined mode (i.e. OPT_SINGLE_FETCH
213
// is not defined).
214
//
215
`ifndef OPT_SINGLE_FETCH
216 51 dgisselq
`ifndef OPT_DOUBLE_FETCH
217 2 dgisselq
//
218
//
219
//
220 51 dgisselq
// OPT_PIPELINED is the natural result and opposite of using the single
221 2 dgisselq
// instruction fetch unit.  If you are not using that unit, the ZipCPU will
222 51 dgisselq
// be pipelined.  The option is defined here more for readability than
223 2 dgisselq
// anything else, since OPT_PIPELINED makes more sense than OPT_SINGLE_FETCH,
224
// well ... that and it does a better job of explaining what is going on.
225
//
226
// In other words, leave this define alone--lest you break the ZipCPU.
227
//
228
`define OPT_PIPELINED
229
//
230
//
231
//
232
// OPT_TRADITIONAL_PFCACHE allows you to switch between one of two prefetch
233
// caches.  If enabled, a more traditional cache is implemented.  This more
234
// traditional cache (currently) uses many more LUTs, but it also reduces
235
// the stall count tremendously over the alternative hacked pipeline cache.
236
// (The traditional pfcache is also pipelined, whereas the pipeline cache
237
// implements a windowed approach to caching.)
238
//
239
// If you have the fabric to support this option, I recommend including it.
240
//
241
// `define      OPT_TRADITIONAL_PFCACHE
242
//
243
//
244
//
245
//
246
// OPT_PIPELINED_BUS_ACCESS controls whether or not LOD/STO instructions
247
// can take advantaged of pipelined bus instructions.  To be eligible, the
248
// operations must be identical (cannot pipeline loads and stores, just loads
249
// only or stores only), and the addresses must either be identical or one up
250
// from the previous address.  Further, the load/store string must all have
251
// the same conditional.  This approach gains the must use, in my humble
252
// opinion, when saving registers to or restoring registers from the stack
253
// at the beginning/end of a procedure, or when doing a context swap.
254
//
255
// I recommend setting this flag, for performance reasons, especially if your
256
// wishbone bus can handle pipelined bus accesses.
257
//
258
`define OPT_PIPELINED_BUS_ACCESS
259
//
260
//
261
//
262
//
263
//
264 51 dgisselq
`endif  // OPT_DOUBLE_FETCH
265 2 dgisselq
`endif  // OPT_SINGLE_FETCH
266
//
267
//
268
//
269
// Now let's talk about peripherals for a moment.  These next two defines
270
// control whether the DMA controller is included in the Zip System, and
271
// whether or not the 8 accounting timers are also included.  Set these to
272
// include the respective peripherals, comment them out not to.
273
//
274
// `define      INCLUDE_DMA_CONTROLLER
275
// `define      INCLUDE_ACCOUNTING_COUNTERS
276
//
277
//
278 16 dgisselq
`define DEBUG_SCOPE
279 2 dgisselq
//
280
`endif  // CPUDEFS_H

powered by: WebSVN 2.1.0

© copyright 1999-2024 OpenCores.org, equivalent to Oliscience, all rights reserved. OpenCores®, registered trademark.