Line 39... |
Line 39... |
// ANY WARRANTY; without even the implied warranty of MERCHANTIBILITY or
|
// ANY WARRANTY; without even the implied warranty of MERCHANTIBILITY or
|
// FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
|
// FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
|
// for more details.
|
// for more details.
|
//
|
//
|
// You should have received a copy of the GNU General Public License along
|
// You should have received a copy of the GNU General Public License along
|
// with this program. (It's in the $(ROOT)/doc directory, run make with no
|
// with this program. (It's in the $(ROOT)/doc directory. Run make with no
|
// target there if the PDF file isn't present.) If not, see
|
// target there if the PDF file isn't present.) If not, see
|
// <http://www.gnu.org/licenses/> for a copy.
|
// <http://www.gnu.org/licenses/> for a copy.
|
//
|
//
|
// License: GPL, v3, as defined and found on www.gnu.org,
|
// License: GPL, v3, as defined and found on www.gnu.org,
|
// http://www.gnu.org/licenses/gpl.html
|
// http://www.gnu.org/licenses/gpl.html
|
Line 101... |
Line 101... |
// included and OPT_ILLEGAL_INSTRUCTION is set, then the multiply will create
|
// included and OPT_ILLEGAL_INSTRUCTION is set, then the multiply will create
|
// an illegal instruction exception that will send the CPU into supervisor
|
// an illegal instruction exception that will send the CPU into supervisor
|
// mode.
|
// mode.
|
//
|
//
|
//
|
//
|
// `define OPT_DIVIDE
|
`define OPT_DIVIDE
|
//
|
//
|
//
|
//
|
//
|
//
|
// OPT_IMPLEMENT_FPU will (one day) control whether or not the floating point
|
// OPT_IMPLEMENT_FPU will (one day) control whether or not the floating point
|
// unit (once I have one) is built and included into the ZipCPU by default.
|
// unit (once I have one) is built and included into the ZipCPU by default.
|
Line 120... |
Line 120... |
// `define OPT_IMPLEMENT_FPU
|
// `define OPT_IMPLEMENT_FPU
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
// The instruction set defines an optional compressed instruction set (CIS)
|
|
// complement. These were at one time erroneously called Very Long Instruction
|
|
// Words. They are more appropriately referred to as compressed instructions.
|
|
// The compressed instruction format allows two instructions to be packed into
|
|
// the same instruction word. Some instructions can be compressed, not all.
|
|
// Compressed instructions take the same time to complete. Set OPT_CIS to
|
|
// include these double instructions as part of the instruction set. These
|
|
// instructions are designed to get more code density from the instruction set,
|
|
// and to hopefully take some pain off of the performance of the pre-fetch and
|
|
// instruction cache.
|
|
//
|
|
// These new instructions, however, also necessitate a change in the Zip
|
|
// CPU--the Zip CPU can no longer execute instructions atomically. It must
|
|
// now execute non-CIS instructions, or CIS instruction pairs, atomically.
|
|
// This logic has been added into the ZipCPU, but it has not (yet) been
|
|
// tested thoroughly.
|
|
//
|
|
// Oh, and the debugger and the simulator also need to be updated as well
|
|
// to properly handle these.
|
|
//
|
|
// `define OPT_CIS // Adds about 80 LUTs on a Spartan 6
|
|
//
|
|
//
|
|
//
|
|
//
|
|
// OPT_SINGLE_FETCH controls whether or not the prefetch has a cache, and
|
// OPT_SINGLE_FETCH controls whether or not the prefetch has a cache, and
|
// whether or not it can issue one instruction per clock. When set, the
|
// whether or not it can issue one instruction per clock. When set, the
|
// prefetch has no cache, and only one instruction is fetched at a time.
|
// prefetch has no cache, and only one instruction is fetched at a time.
|
// This effectively sets the CPU so that only one instruction is ever
|
// This effectively sets the CPU so that only one instruction is ever
|
// in the pipeline at once, and hence you may think of this as a "kill
|
// in the pipeline at once, and hence you may think of this as a "kill
|
Line 170... |
Line 145... |
// (These numbers may be dated, but should still be representative ...)
|
// (These numbers may be dated, but should still be representative ...)
|
//
|
//
|
// I recommend only defining this if you "need" to, if area is tight and
|
// I recommend only defining this if you "need" to, if area is tight and
|
// speed isn't as important. Otherwise, just leave this undefined.
|
// speed isn't as important. Otherwise, just leave this undefined.
|
//
|
//
|
`define OPT_SINGLE_FETCH
|
//`define OPT_SINGLE_FETCH // 2047 total LUTs (savings of 181 from before)
|
|
//
|
|
//
|
|
// OPT_DOUBLE_FETCH is an alternative to OPT_SINGLE_FETCH. It is designed to
|
|
// increase performance primarily when using an instruction memory which has
|
|
// one cost for a random access, and a second (lower) cost for sequential
|
|
// access. The driving example behind this implementation was flash memory
|
|
// with 34 clocks for an initial access and 16 clocks for any subsequent access,
|
|
// but SDRAM memory with 27 clocks for an initial access and 1 clock for a
|
|
// subsequent access is also a good example. Even block RAM might be a good
|
|
// example, if there were any bus delays in getting to the RAM device. Using
|
|
// OPT_DOUBLE_FETCH also increases the pipeline speed, as it allows CIS
|
|
// instructions and therefore partial pipelining. (No work is done to resolve
|
|
// pipeline conflicts past the decode stage, as is the case with full pipeline
|
|
// mode.
|
|
//
|
|
// Do not define OPT_DOUBLE_FETCH if you wish to fully pipeline the CPU. Do
|
|
// not define both OPT_DOUBLE_FETCH and OPT_SINGLE_FETCH (the ifndef below
|
|
// should prevent that).
|
|
//
|
|
//
|
|
// // COST: about 79 LUTs over and above the SINGLE_FETCH cost [2091 LUTs]
|
|
`ifndef OPT_SINGLE_FETCH
|
|
`define OPT_DOUBLE_FETCH
|
|
`endif
|
|
//
|
|
//
|
|
//
|
|
// The ZipCPU ISA defines an optional compressed instruction set (CIS)
|
|
// complement. This compressed instruction format allows two instructions to
|
|
// be packed into the same instruction word. Some instructions can be so
|
|
// compressed, although not all. Compressed instructions take the same time to
|
|
// complete--they are just compressed within memory to spare troubles with the
|
|
// prefetch. Set OPT_CIS to include these compressed instructions as part of
|
|
// the instruction set.
|
|
//
|
|
//
|
|
// // COST: about 87 LUTs
|
|
//
|
|
`define OPT_CIS
|
|
//
|
|
//
|
|
//
|
|
//
|
|
// OPT_EARLY_BRANCHING is an attempt to execute a BRA statement as early
|
|
// as possible, to avoid as many pipeline stalls on a branch as possible.
|
|
// With the OPT_TRADITIONAL_PFCACHE, BRA instructions cost only a single
|
|
// extra stall cycle, while LJMP instructions cost two (assuming the target is
|
|
// in the cache). Indeed, the result is that a BRA instruction can be used as
|
|
// the compiler's branch prediction optimizer: BRA's barely stall, while
|
|
// conditional branches will always suffer about 4 stall cycles or so.
|
|
//
|
|
// I recommend setting this flag, so as to turn early branching on---if you
|
|
// have the LUTs available to afford it.
|
|
//
|
|
// `define OPT_EARLY_BRANCHING
|
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
// The next several options are pipeline optimization options. They make no
|
// The next several options are pipeline optimization options. They make no
|
// sense in a single instruction fetch mode, hence we #ifndef them so they
|
// sense in a single instruction fetch mode, hence we #ifndef them so they
|
// are only defined if we are in a full pipelined mode (i.e. OPT_SINGLE_FETCH
|
// are only defined if we are in a full pipelined mode (i.e. OPT_SINGLE_FETCH
|
// is not defined).
|
// is not defined).
|
//
|
//
|
`ifndef OPT_SINGLE_FETCH
|
`ifndef OPT_SINGLE_FETCH
|
|
`ifndef OPT_DOUBLE_FETCH
|
//
|
//
|
//
|
//
|
//
|
//
|
// OPT_PIPELINED is the natural result and opposite of using the single
|
// OPT_PIPELINED is the natural result and opposite of using the single
|
// instruction fetch unit. If you are not using that unit, the ZipCPU will
|
// instruction fetch unit. If you are not using that unit, the ZipCPU will
|
Line 208... |
Line 240... |
//
|
//
|
// `define OPT_TRADITIONAL_PFCACHE
|
// `define OPT_TRADITIONAL_PFCACHE
|
//
|
//
|
//
|
//
|
//
|
//
|
// OPT_EARLY_BRANCHING is an attempt to execute a BRA statement as early
|
|
// as possible, to avoid as many pipeline stalls on a branch as possible.
|
|
// It's not tremendously successful yet--BRA's still suffer stalls,
|
|
// but I intend to keep working on this approach until the number of stalls
|
|
// gets down to one or (ideally) zero. (With the OPT_TRADITIONAL_PFCACHE, this
|
|
// gets down to a single stall cycle ...) That way a "BRA" can be used as the
|
|
// compiler's branch prediction optimizer: BRA's barely stall, while branches
|
|
// on conditions will always suffer about 4 stall cycles or so.
|
|
//
|
|
// I recommend setting this flag, so as to turn early branching on.
|
|
//
|
|
`define OPT_EARLY_BRANCHING
|
|
//
|
|
//
|
|
//
|
//
|
// OPT_PIPELINED_BUS_ACCESS controls whether or not LOD/STO instructions
|
// OPT_PIPELINED_BUS_ACCESS controls whether or not LOD/STO instructions
|
// can take advantaged of pipelined bus instructions. To be eligible, the
|
// can take advantaged of pipelined bus instructions. To be eligible, the
|
// operations must be identical (cannot pipeline loads and stores, just loads
|
// operations must be identical (cannot pipeline loads and stores, just loads
|
// only or stores only), and the addresses must either be identical or one up
|
// only or stores only), and the addresses must either be identical or one up
|
Line 241... |
Line 259... |
//
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
//
|
`endif // OPT_DOUBLE_FETCH
|
//
|
|
`endif // OPT_SINGLE_FETCH
|
`endif // OPT_SINGLE_FETCH
|
//
|
//
|
//
|
//
|
//
|
//
|
// Now let's talk about peripherals for a moment. These next two defines
|
// Now let's talk about peripherals for a moment. These next two defines
|