URL
https://opencores.org/ocsvn/neorv32/neorv32/trunk
Subversion Repositories neorv32
[/] [neorv32/] [trunk/] [README.md] - Rev 34
Go to most recent revision | Compare with Previous | Blame | View Log
# [The NEORV32 Processor](https://github.com/stnolting/neorv32) (RISC-V)[](https://travis-ci.com/stnolting/neorv32)[](https://github.com/stnolting/neorv32/blob/master/LICENSE)[](https://github.com/stnolting/neorv32/releases)## Table of Content* [Overview](#Overview)* [Project Status](#Status)* [Features](#Features)* [FPGA Implementation Results](#FPGA-Implementation-Results)* [Performance](#Performance)* [Top Entities](#Top-Entities)* [**Getting Started**](#Getting-Started)* [Contribute](#Contribute)* [Legal](#Legal)## OverviewThe NEORV32 Processor is a customizable microcontroller-like system on chip (SoC) that is basedon the RISC-V-compliant NEORV32 CPU. The project consists of two main parts:### [NEORV32 CPU](#CPU-Features)The CPU implements a `rv32i RISC-V` core with optional `C`, `E`, `M`, `U`, `Zicsr`, `Zifencei` and`PMP` (physical memory protection) extensions. It passes the official [RISC-V compliance tests](https://github.com/stnolting/neorv32_riscv_compliance)and is compliant to the *Unprivileged ISA Specification [Version 2.2](https://github.com/stnolting/neorv32/blob/master/docs/riscv-privileged.pdf)*and a subset of the *Privileged Architecture Specification [Version 1.12-draft](https://github.com/stnolting/neorv32/blob/master/docs/riscv-spec.pdf)*.If you do not want to use the NEORV32 Processor setup, you can also use the CPU instand-alone mode and build your own SoC around it.### [NEORV32 Processor](#Processor-Features)Based on the NEORV32 CPU, the NEORV32 Processor is a full-scale RISC-V microcontroller system (**SoC**)that already provides common peripherals like GPIO, serial interfaces, timers, embeddedmemories and an external bus interface for connectivity and custom extension.All optional features and modules beyond the base CPU can be enabled and configured via[VHDL generics](#Top-Entities).The processor is intended as ready-to-use auxiliary processor within a larger SoCdesigns or as stand-alone custom microcontroller. Its top entity can be directlysynthesized for any target technology without modifications.This project comes with a complete software ecosystem that features corelibraries for high-level usage of the provided functions and peripherals,makefiles, a runtime environment, several example programs to start with - including a free RTOS demo - andeven a builtin bootloader for easy program upload via UART.### [How to get started?](#Getting-Started)The processor is intended to work "out of the box". Just synthesize the[test setup](#Create-a-new-Hardware-Project), upload it to your FPGA board of choice and start playingwith the NEORV32. For more information take a look at the [NEORV32 data sheet](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf) (pdf).The project’s change log is available in the [CHANGELOG.md](https://github.com/stnolting/neorv32/blob/master/CHANGELOG.md) file in the root directory of this repository.To see the changes between releases visit the project's [release page](https://github.com/stnolting/neorv32/releases).### Key Features* RISC-V-compliant `rv32i` CPU with optional `C`, `E`, `M`, `U`, `Zicsr`, `Zifencei` and `PMP` (physical memory protection) extensions* GCC-based toolchain ([pre-compiled rv32i and rv32e toolchains available](https://github.com/stnolting/riscv_gcc_prebuilt))* Application compilation based on [GNU makefiles](https://github.com/stnolting/neorv32/blob/master/sw/example/blink_led/makefile)* [Doxygen-based](https://github.com/stnolting/neorv32/blob/master/docs/doxygen_makefile_sw) documentation of the software framework: available on [GitHub pages](https://stnolting.github.io/neorv32/files.html)* [**Full-blown data sheet**](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf) (pdf)* Completely described in behavioral, platform-independent VHDL - no primitives, macros, etc.* Fully synchronous design, no latches, no gated clocks* Small hardware footprint and high operating frequency* Highly configurable CPU and processor setup* [FreeRTOS port](https://github.com/stnolting/neorv32/blob/master/sw/example/demo_freeRTOS) available### Design Principles* From zero to `main()`: Completely open source and documented.* Plain VHDL without technology-specific parts like attributes, macros or primitives.* Easy to use – working out of the box.* Clean synchronous design, no wacky combinatorial interfaces.* Be as small as possible – but with a reasonable size-performance tradeoff.* The processor has to fit in a Lattice iCE40 UltraPlus 5k FPGA running at 20+ MHz.## StatusThe processor is [synthesizable](#FPGA-Implementation-Results) (tested on *real hardware* using Intel Quartus Prime, Xilinx Vivado and Lattice Radiant/Synplify Pro) and can successfully executeall the [provided example programs](https://github.com/stnolting/neorv32/tree/master/sw/example) including the [CoreMark benchmark](#CoreMark-Benchmark).The processor passes the official `rv32i`, `rv32im`, `rv32imc`, `rv32Zicsr` and `rv32Zifencei` [RISC-V compliance tests](https://github.com/riscv/riscv-compliance).| Project component | CI status | Note ||:--------------------------------------------------------------------------------|:----------|:---------|| [NEORV32 processor](https://github.com/stnolting/neorv32) | [](https://travis-ci.com/stnolting/neorv32) | [](https://stnolting.github.io/neorv32/files.html) || [Pre-built toolchain](https://github.com/stnolting/riscv_gcc_prebuilt) | [](https://travis-ci.com/stnolting/riscv_gcc_prebuilt) | || [RISC-V compliance test](https://github.com/stnolting/neorv32_riscv_compliance) | [](https://travis-ci.com/stnolting/neorv32_riscv_compliance) | |### To-Do / Wish List* Further size and performance optimization* Add AXI(-Lite) bridges* Synthesis results (+ wrappers?) for more platforms* Maybe port additional RTOSs (like [Zephyr](https://github.com/zephyrproject-rtos/zephyr) or [RIOT](https://www.riot-os.org))* Use LaTeX for data sheet* Implement further CPU extensions:* Atomic operations (`A`)* Bitmanipulation operations (`B`), when they are "official"* Floating-point instructions (`F`)* ...## FeaturesThe full-blown data sheet of the NEORV32 Processor and CPU is available as pdf file:[ NEORV32 data sheet](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf).### Processor FeaturesThe NEORV32 Processor provides a full-scale microcontroller-like SoC based on the NEORV32 CPU. The setupis highly customizable via the processor's top generics.* Optional processor-internal data and instruction memories (**DMEM** / **IMEM**)* Optional internal **Bootloader** with UART console and automatic SPI flash boot option* Optional machine system timer (**MTIME**), RISC-V-compliant* Optional universal asynchronous receiver and transmitter (**UART**) with simulation output option via text.io* Optional 8/16/24/32-bit serial peripheral interface controller (**SPI**) with 8 dedicated chip select lines* Optional two wire serial interface controller (**TWI**), compatible to the I²C standard* Optional general purpose parallel IO port (**GPIO**), 32xOut & 32xIn, with pin-change interrupt* Optional 32-bit external bus interface, Wishbone b4 compliant (**WISHBONE**), *standard* or *pipelined* handshake/transactions mode* Optional watchdog timer (**WDT**)* Optional PWM controller with 4 channels and 8-bit duty cycle resolution (**PWM**)* Optional GARO-based true random number generator (**TRNG**)* Optional custom functions units (**CFU0** and **CFU1**) for tightly-coupled custom co-processors* System configuration information memory to check hardware configuration by software (**SYSINFO**)### CPU FeaturesThe CPU is [compliant](https://github.com/stnolting/neorv32_riscv_compliance) to the[official RISC-V specifications (2.2)](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/riscv-spec.pdf) including a subset of the[RISC-V privileged architecture specifications (1.12-draft)](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/riscv-spec.pdf).More information regarding the CPU including a detailed list of the instruction set and the available CSRs can be found inthe [NEORV32 data sheet](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf).**General**:* Modified Harvard architecture (separate CPU interfaces for data and instructions; NEORV32 processor: Single processor-internal bus via I/D mux)* Two stages in-order pipeline (FETCH, EXECUTE); each stage uses a multi-cycle processing scheme* No hardware support of unaligned accesses - they will trigger an exception* Little-endian byte order* All reserved or unimplemented instructions will raise an illegal instruction exception* Privilege levels: `machine` mode, `user` mode (if enabled via `U` extension)* Official [RISC-V open-source architecture ID](https://github.com/riscv/riscv-isa-manual/blob/master/marchid.md)**RV32I base instruction set** (`I` extension):* ALU instructions: `LUI` `AUIPC` `ADDI` `SLTI` `SLTIU` `XORI` `ORI` `ANDI` `SLLI` `SRLI` `SRAI` `ADD` `SUB` `SLL` `SLT` `SLTU` `XOR` `SRL` `SRA` `OR` `AND`* Jump and branch instructions: `JAL` `JALR` `BEQ` `BNE` `BLT` `BGE` `BLTU` `BGEU`* Memory instructions: `LB` `LH` `LW` `LBU` `LHU` `SB` `SH` `SW`* System instructions: `ECALL` `EBREAK` `FENCE`**Compressed instructions** (`C` extension):* ALU instructions: `C.ADDI4SPN` `C.ADDI` `C.ADD` `C.ADDI16SP` `C.LI` `C.LUI` `C.SLLI` `C.SRLI` `C.SRAI` `C.ANDI` `C.SUB` `C.XOR` `C.OR` `C.AND` `C.MV` `C.NOP`* Jump and branch instructions: `C.J` `C.JAL` `C.JR` `C.JALR` `C.BEQZ` `C.BNEZ`* Memory instructions: `C.LW` `C.SW` `C.LWSP` `C.SWSP`* System instructions: `C.EBREAK` (only with `Zicsr` extension)**Embedded CPU version** (`E` extension):* Reduced register file (only the 16 lowest registers)**Integer multiplication and division hardware** (`M` extension):* Multiplication instructions: `MUL` `MULH` `MULHSU` `MULHU`* Division instructions: `DIV` `DIVU` `REM` `REMU`* By default, the multiplier and divider cores use an iterative bit-serial processing scheme* Multiplications can be mapped to DSPs via the `FAST_MUL_EN` generic to increase performance**Privileged architecture / CSR access** (`Zicsr` extension):* Privilege levels: `M-mode` (Machine mode)* CSR access instructions: `CSRRW` `CSRRS` `CSRRC` `CSRRWI` `CSRRSI` `CSRRCI`* System instructions: `MRET` `WFI`* Counter CSRs: `cycle` `cycleh` `instret` `instreth` `time` `timeh` `mcycle` `mcycleh` `minstret` `minstreth`* Machine CSRs: `mstatus` `misa`(read-only!) `mie` `mtvec` `mscratch` `mepc` `mcause` `mtval` `mip` `mvendorid` [`marchid`](https://github.com/riscv/riscv-isa-manual/blob/master/marchid.md) `mimpid` `mhartid` `mzext`(custom)* Supported exceptions and interrupts:* Misaligned instruction address* Instruction access fault* Illegal instruction* Breakpoint (via `ebreak` instruction)* Load address misaligned* Load access fault* Store address misaligned* Store access fault* Environment call from M-mode (via `ecall` instruction)* Machine timer interrupt `mti` (via processor's MTIME unit)* Machine software interrupt `msi` (via external signal)* Machine external interrupt `mei` (via external signal)* Four fast interrupt requests (custom extension)**Privileged architecture / User mode** (`U` extension, requires `Zicsr` extension):* Privilege levels: `M-mode` (Machine mode) + `U-mode` (User mode)**Privileged architecture / FENCE.I** (`Zifencei` extension):* System instructions: `FENCE.I`**Privileged architecture / Physical memory protection** (`PMP`, requires `Zicsr` extension):* Additional machine CSRs: `pmpcfg0` `pmpcfg1` `pmpaddr0` `pmpaddr1` `pmpaddr2` `pmpaddr3` `pmpaddr4` `pmpaddr5` `pmpaddr6` `pmpaddr7`### Non-RISC-V-Compliant Issues* `misa` CSR is read-only - no dynamic enabling/disabling of synthesized CPU extensions during runtime; for compatibility: write accesses (in m-mode) are ignored and do not cause an exception* The physical memory protection (**PMP**) only supports `NAPOT` mode, a minimal granularity of 8 bytes and only up to 8 regions### NEORV32-Specific CPU ExtensionsThe NEORV32-specific extensions are always enabled and are indicated via the `X` bit in the `misa` CSR.* Four *fast interrupt* request channels with according control/status bits in `mie` and `mip` and custom exception codes in `mcause`* `mzext` CSR to check for implemented `Z*` CPU extensions (like `Zifencei`)## FPGA Implementation Results### NEORV32 CPUThis chapter shows exemplary implementation results of the NEORV32 CPU for an **Intel Cyclone IV EP4CE22F17C6N FPGA** ona DE0-nano board. The design was synthesized using **Intel Quartus Prime Lite 19.1** ("balanced implementation"). The timinginformation is derived from the Timing Analyzer / Slow 1200mV 0C Model. If not otherwise specified, the default configurationof the CPU's generics is assumed (for example no PMP). No constraints were used at all.Results generated for hardware version `1.4.4.8`.| CPU Configuration | LEs | FFs | Memory bits | DSPs | f_max ||:---------------------------------------|:----------:|:--------:|:-----------:|:----:|:--------:|| `rv32i` | 983 | 438 | 2048 | 0 | ~120 MHz || `rv32i` + `u` + `Zicsr` + `Zifencei` | 1877 | 802 | 2048 | 0 | ~112 MHz || `rv32im` + `u` + `Zicsr` + `Zifencei` | 2374 | 1048 | 2048 | 0 | ~110 MHz || `rv32imc` + `u` + `Zicsr` + `Zifencei` | 2650 | 1064 | 2048 | 0 | ~110 MHz || `rv32emc` + `u` + `Zicsr` + `Zifencei` | 2680 | 1061 | 1024 | 0 | ~110 MHz |### NEORV32 Processor-Internal Peripherals and MemoriesResults generated for hardware version `1.4.4.8`.| Module | Description | LEs | FFs | Memory bits | DSPs ||:----------|:-----------------------------------------------------|----:|----:|------------:|-----:|| BOOT ROM | Bootloader ROM (default 4kB) | 4 | 1 | 32 768 | 0 || BUSSWITCH | Mux for CPU I & D interfaces | 62 | 8 | 0 | 0 || CFU0 | Custom functions unit 0 | - | - | - | - || CFU1 | Custom functions unit 1 | - | - | - | - || DMEM | Processor-internal data memory (default 8kB) | 13 | 2 | 65 536 | 0 || GPIO | General purpose input/output ports | 66 | 65 | 0 | 0 || IMEM | Processor-internal instruction memory (default 16kb) | 7 | 2 | 131 072 | 0 || MTIME | Machine system timer | 268 | 166 | 0 | 0 || PWM | Pulse-width modulation controller | 72 | 69 | 0 | 0 || SPI | Serial peripheral interface | 184 | 125 | 0 | 0 || SYSINFO | System configuration information memory | 11 | 9 | 0 | 0 || TRNG | True random number generator | 132 | 105 | 0 | 0 || TWI | Two-wire interface | 74 | 44 | 0 | 0 || UART | Universal asynchronous receiver/transmitter | 175 | 132 | 0 | 0 || WDT | Watchdog timer | 58 | 45 | 0 | 0 || WISHBONE | External memory interface (`MEM_EXT_REG_STAGES` = 2) | 106 | 104 | 0 | 0 |### NEORV32 Processor - Exemplary FPGA SetupsExemplary processor implementation results for different FPGA platforms. The processor setup uses *the default peripheral configuration* (like no _CFUs_ and no _TRNG_),no external memory interface and only internal instruction and data memories. IMEM uses 16kB and DMEM uses 8kB memory space. The setup's top entity connects most of theprocessor's [top entity](https://github.com/stnolting/neorv32/blob/master/rtl/core/neorv32_top.vhd) signalsto FPGA pins - except for the Wishbone bus and the interrupt signals.Results generated for hardware version `1.4.4.8`.| Vendor | FPGA | Board | Toolchain | Strategy | CPU Configuration | LUT / LE | FF / REG | DSP | Memory Bits | BRAM / EBR | SPRAM | Frequency ||:--------|:----------------------------------|:-----------------|:---------------------------|:-------- |:-----------------------------------------------|:-----------|:-----------|:-------|:-------------|:-----------|:---------|--------------:|| Intel | Cyclone IV `EP4CE22F17C6N` | Terasic DE0-Nano | Quartus Prime Lite 19.1 | balanced | `rv32imc` + `u` + `Zicsr` + `Zifencei` + `PMP` | 4008 (18%) | 1849 (9%) | 0 (0%) | 231424 (38%) | - | - | 105 MHz || Lattice | iCE40 UltraPlus `iCE40UP5K-SG48I` | Upduino v2.0 | Radiant 2.1 (Synplify Pro) | default | `rv32ic` + `u` + `Zicsr` + `Zifencei` | 4296 (81%) | 1611 (30%) | 0 (0%) | - | 12 (40%) | 4 (100%) | *c* 22.5 MHz || Xilinx | Artix-7 `XC7A35TICSG324-1L` | Arty A7-35T | Vivado 2019.2 | default | `rv32imc` + `u` + `Zicsr` + `Zifencei` + `PMP` | 2390 (11%) | 1888 (5%) | 0 (0%) | - | 8 (16%) | - | *c* 100 MHz |**_Notes_*** The Lattice iCE40 UltraPlus setup uses the FPGA's SPRAM memory primitives for the internal IMEM and DMEM (each 64kb).The FPGA-specific memory components can be found in [`rtl/fpga_specific`](https://github.com/stnolting/neorv32/blob/master/rtl/fpga_specific/lattice_ice40up).* The clock frequencies marked with a "c" are constrained clocks. The remaining ones are _f_max_ results from the place and route timing reports.* The Upduino and the Arty board have on-board SPI flash memories for storing the FPGA configuration. These device can also be used by the default NEORV32bootloader to store and automatically boot an application program after reset (both tested successfully).* The setups with `PMP` implement 2 regions with a minimal granularity of 32kB.## Performance### CoreMark BenchmarkThe [CoreMark CPU benchmark](https://www.eembc.org/coremark) was executed on the NEORV32 and is available in the[sw/example/coremark](https://github.com/stnolting/neorv32/blob/master/sw/example/coremark) project folder. This benchmarktests the capabilities of a CPU itself rather than the functions provided by the whole system / SoC.Results generated for hardware version `1.4.5.4`.~~~**Configuration**Hardware: 32kB IMEM, 16kB DMEM, 100MHz clockCoreMark: 2000 iterations, MEM_METHOD is MEM_STACKCompiler: RISCV32-GCC 10.1.0 (rv32i toolchain)Flags: default, see makefilePeripherals: UART for printing the results~~~| CPU | Executable Size | Optimization | CoreMark Score | CoreMarks/MHz ||:--------------------------------------------|:---------------:|:------------:|:--------------:|:-------------:|| `rv32i` | 26 940 bytes | `-O3` | 33.89 | **0.3389** || `rv32im` | 25 772 bytes | `-O3` | 64.51 | **0.6451** || `rv32imc` | 20 524 bytes | `-O3` | 64.51 | **0.6451** || `rv32imc` + `FAST_MUL_EN` | 20 524 bytes | `-O3` | 80.00 | **0.8000** || `rv32imc` + `FAST_MUL_EN` + `FAST_SHIFT_EN` | 20 524 bytes | `-O3` | 83.33 | **0.8333** |The `FAST_MUL_EN` configuration uses DSPs for the multiplier of the `M` extension (enabled via the `FAST_MUL_EN` generic). The `FAST_SHIFT_EN` configurationuses a barrel shifter for CPU shift operations (enabled via the `FAST_SHIFT_EN` generic).When the `C` extension is enabled, branches to an unaligned uncompressed instruction require additional instruction fetch cycles.### Instruction CyclesThe NEORV32 CPU is based on a two-stages pipelined architecutre. Each stage uses a multi-cycle processing scheme. Hence,each instruction requires several clock cycles to execute (2 cycles for ALU operations, ..., 40 cycles for divisions).The average CPI (cycles per instruction) depends on the instruction mix of a specific applications and also on the availableCPU extensions.Please note that by default the CPU-internal shifter (e.g. for the `SLL` instruction) as well as the multiplier and divider of the`M` extension use a bit-serial approach and require several cycles for completion.The following table shows the performance results for successfully running 2000 CoreMarkiterations, which reflects a pretty good "real-life" work load. The average CPI is computed bydividing the total number of required clock cycles (only the timed core to avoid distortion due to IO wait cycles; sampled via the `cycle[h]` CSRs)by the number of executed instructions (`instret[h]` CSRs). The executables were generated using optimization `-O3`.Results generated for hardware version `1.4.5.4`.| CPU | Required Clock Cycles | Executed Instructions | Average CPI ||:--------------------------------------------|----------------------:|----------------------:|:-----------:|| `rv32i` | 5 945 938 586 | 1 469 587 406 | **4.05** || `rv32im` | 3 110 282 586 | 602 225 760 | **5.16** || `rv32imc` | 3 172 969 968 | 615 388 890 | **5.16** || `rv32imc` + `FAST_MUL_EN` | 2 590 417 968 | 615 388 890 | **4.21** || `rv32imc` + `FAST_MUL_EN` + `FAST_SHIFT_EN` | 2 456 318 408 | 615 388 890 | **3.99** |The `FAST_MUL_EN` configuration uses DSPs for the multiplier of the `M` extension (enabled via the `FAST_MUL_EN` generic). The `FAST_SHIFT_EN` configurationuses a barrel shifter for CPU shift operations (enabled via the `FAST_SHIFT_EN` generic).When the `C` extension is enabled, branches to an unaligned uncompressed instruction require additional instruction fetch cycles.## Top EntitiesThe top entity of the **NEORV32 Processor** (SoC) is [**neorv32_top.vhd**](https://github.com/stnolting/neorv32/blob/master/rtl/core/neorv32_top.vhd)and the top entity of the **NEORV32 CPU** is [**neorv32_cpu.vhd**](https://github.com/stnolting/neorv32/blob/master/rtl/core/neorv32_cpu.vhd). Bothtop entities are located in `rtl/core`.All signals of the top entities are of type *std_ulogic* or *std_ulogic_vector*, respectively(except for the processor's TWI signals, which are of type *std_logic*). Leave all unused output ports unconnected (`open`) and tie all unusedinput ports to zero (`'0'` or `(others => '0')`, respectively).Alternative top entities, like the simplified ["hello world" test setup](#Create-a-new-Hardware-Project) or CPU/Processorwrappers with resolved port signal types (i.e. *std_logic*), can be found in [`rtl/top_templates`](https://github.com/stnolting/neorv32/blob/master/rtl/top_templates).Use the top's generics to configure the processor/CPU according to your needs. Each generic is initilized with the default configuration.Detailed information regarding the interface signals and configuration generics can be found inthe [NEORV32 documentary](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf).### NEORV32 CPU```vhdlentity neorv32_cpu isgeneric (-- General --HW_THREAD_ID : std_ulogic_vector(31 downto 0):= (others => '0'); -- hardware thread idCPU_BOOT_ADDR : std_ulogic_vector(31 downto 0):= (others => '0'); -- cpu boot address-- RISC-V CPU Extensions --CPU_EXTENSION_RISCV_C : boolean := false; -- implement compressed extension?CPU_EXTENSION_RISCV_E : boolean := false; -- implement embedded RF extension?CPU_EXTENSION_RISCV_M : boolean := false; -- implement muld/div extension?CPU_EXTENSION_RISCV_U : boolean := false; -- implement user mode extension?CPU_EXTENSION_RISCV_Zicsr : boolean := true; -- implement CSR system?CPU_EXTENSION_RISCV_Zifencei : boolean := true; -- implement instruction stream sync.?-- Extension Options --FAST_MUL_EN : boolean := false; -- use DSPs for M extension's multiplierFAST_SHIFT_EN : boolean := false; -- use barrel shifter for shift operations-- Physical Memory Protection (PMP) --PMP_USE : boolean := false; -- implement PMP?PMP_NUM_REGIONS : natural := 4; -- number of regions (max 8)PMP_GRANULARITY : natural := 14 -- minimal region granularity (1=8B, 2=16B, 3=32B, ...) default is 64k);port (-- global control --clk_i : in std_ulogic := '0'; -- global clock, rising edgerstn_i : in std_ulogic := '0'; -- global reset, low-active, async-- instruction bus interface --i_bus_addr_o : out std_ulogic_vector(data_width_c-1 downto 0); -- bus access addressi_bus_rdata_i : in std_ulogic_vector(data_width_c-1 downto 0) := (others => '0'); -- bus read datai_bus_wdata_o : out std_ulogic_vector(data_width_c-1 downto 0); -- bus write datai_bus_ben_o : out std_ulogic_vector(03 downto 0); -- byte enablei_bus_we_o : out std_ulogic; -- write enablei_bus_re_o : out std_ulogic; -- read enablei_bus_cancel_o : out std_ulogic; -- cancel current bus transactioni_bus_ack_i : in std_ulogic := '0'; -- bus transfer acknowledgei_bus_err_i : in std_ulogic := '0'; -- bus transfer errori_bus_fence_o : out std_ulogic; -- executed FENCEI operation-- data bus interface --d_bus_addr_o : out std_ulogic_vector(data_width_c-1 downto 0); -- bus access addressd_bus_rdata_i : in std_ulogic_vector(data_width_c-1 downto 0) := (others => '0'); -- bus read datad_bus_wdata_o : out std_ulogic_vector(data_width_c-1 downto 0); -- bus write datad_bus_ben_o : out std_ulogic_vector(03 downto 0); -- byte enabled_bus_we_o : out std_ulogic; -- write enabled_bus_re_o : out std_ulogic; -- read enabled_bus_cancel_o : out std_ulogic; -- cancel current bus transactiond_bus_ack_i : in std_ulogic := '0'; -- bus transfer acknowledged_bus_err_i : in std_ulogic := '0'; -- bus transfer errord_bus_fence_o : out std_ulogic; -- executed FENCE operation-- system time input from MTIME --time_i : in std_ulogic_vector(63 downto 0) := (others => '0'); -- current system time-- interrupts (risc-v compliant) --msw_irq_i : in std_ulogic := '0'; -- machine software interruptmext_irq_i : in std_ulogic := '0'; -- machine external interruptmtime_irq_i : in std_ulogic := '0'; -- machine timer interrupt-- fast interrupts (custom) --firq_i : in std_ulogic_vector(3 downto 0) := (others => '0'));end neorv32_cpu;```### NEORV32 Processor```vhdlentity neorv32_top isgeneric (-- General --CLOCK_FREQUENCY : natural := 0; -- clock frequency of clk_i in HzBOOTLOADER_USE : boolean := true; -- implement processor-internal bootloader?USER_CODE : std_ulogic_vector(31 downto 0) := x"00000000"; -- custom user code-- RISC-V CPU Extensions --CPU_EXTENSION_RISCV_C : boolean := false; -- implement compressed extension?CPU_EXTENSION_RISCV_E : boolean := false; -- implement embedded RF extension?CPU_EXTENSION_RISCV_M : boolean := false; -- implement muld/div extension?CPU_EXTENSION_RISCV_U : boolean := false; -- implement user mode extension?CPU_EXTENSION_RISCV_Zicsr : boolean := true; -- implement CSR system?CPU_EXTENSION_RISCV_Zifencei : boolean := true; -- implement instruction stream sync.?-- Extension Options --FAST_MUL_EN : boolean := false; -- use DSPs for M extension's multiplierFAST_SHIFT_EN : boolean := false; -- use barrel shifter for shift operations-- Physical Memory Protection (PMP) --PMP_USE : boolean := false; -- implement PMP?PMP_NUM_REGIONS : natural := 4; -- number of regions (max 8)PMP_GRANULARITY : natural := 14; -- minimal region granularity (1=8B, 2=16B, 3=32B, ...) default is 64kB-- Internal Instruction memory --MEM_INT_IMEM_USE : boolean := true; -- implement processor-internal instruction memoryMEM_INT_IMEM_SIZE : natural := 16*1024; -- size of processor-internal instruction memory in bytesMEM_INT_IMEM_ROM : boolean := false; -- implement processor-internal instruction memory as ROM-- Internal Data memory --MEM_INT_DMEM_USE : boolean := true; -- implement processor-internal data memoryMEM_INT_DMEM_SIZE : natural := 8*1024; -- size of processor-internal data memory in bytes-- External memory interface --MEM_EXT_USE : boolean := false; -- implement external memory bus interface?MEM_EXT_REG_STAGES : natural := 2; -- number of interface register stages (0,1,2)-- Processor peripherals --IO_GPIO_USE : boolean := true; -- implement general purpose input/output port unit (GPIO)?IO_MTIME_USE : boolean := true; -- implement machine system timer (MTIME)?IO_UART_USE : boolean := true; -- implement universal asynchronous receiver/transmitter (UART)?IO_SPI_USE : boolean := true; -- implement serial peripheral interface (SPI)?IO_TWI_USE : boolean := true; -- implement two-wire interface (TWI)?IO_PWM_USE : boolean := true; -- implement pulse-width modulation unit (PWM)?IO_WDT_USE : boolean := true; -- implement watch dog timer (WDT)?IO_TRNG_USE : boolean := false; -- implement true random number generator (TRNG)?IO_CFU0_USE : boolean := false; -- implement custom functions unit 0 (CFU0)?IO_CFU1_USE : boolean := false -- implement custom functions unit 1 (CFU1)?);port (-- Global control --clk_i : in std_ulogic := '0'; -- global clock, rising edgerstn_i : in std_ulogic := '0'; -- global reset, low-active, async-- Wishbone bus interface (available if MEM_EXT_USE = true) --wb_adr_o : out std_ulogic_vector(31 downto 0); -- addresswb_dat_i : in std_ulogic_vector(31 downto 0) := (others => '0'); -- read datawb_dat_o : out std_ulogic_vector(31 downto 0); -- write datawb_we_o : out std_ulogic; -- read/writewb_sel_o : out std_ulogic_vector(03 downto 0); -- byte enablewb_stb_o : out std_ulogic; -- strobewb_cyc_o : out std_ulogic; -- valid cyclewb_ack_i : in std_ulogic := '0'; -- transfer acknowledgewb_err_i : in std_ulogic := '0'; -- transfer error-- Advanced memory control signals (available if MEM_EXT_USE = true) --fence_o : out std_ulogic; -- indicates an executed FENCE operationfencei_o : out std_ulogic; -- indicates an executed FENCEI operation-- GPIO (available if IO_GPIO_USE = true) --gpio_o : out std_ulogic_vector(31 downto 0); -- parallel outputgpio_i : in std_ulogic_vector(31 downto 0) := (others => '0'); -- parallel input-- UART (available if IO_UART_USE = true) --uart_txd_o : out std_ulogic; -- UART send datauart_rxd_i : in std_ulogic := '0'; -- UART receive data-- SPI (available if IO_SPI_USE = true) --spi_sck_o : out std_ulogic; -- SPI serial clockspi_sdo_o : out std_ulogic; -- controller data out, peripheral data inspi_sdi_i : in std_ulogic := '0'; -- controller data in, peripheral data outspi_csn_o : out std_ulogic_vector(07 downto 0); -- SPI CS-- TWI (available if IO_TWI_USE = true) --twi_sda_io : inout std_logic := 'H'; -- twi serial data linetwi_scl_io : inout std_logic := 'H'; -- twi serial clock line-- PWM (available if IO_PWM_USE = true) --pwm_o : out std_ulogic_vector(03 downto 0); -- pwm channels-- Interrupts --mtime_irq_i : in std_ulogic := '0'; -- machine timer interrupt, available if IO_MTIME_USE = falsemsw_irq_i : in std_ulogic := '0'; -- machine software interruptmext_irq_i : in std_ulogic := '0' -- machine external interrupt);end neorv32_top;```## Getting StartedThis overview is just a short excerpt from the *Let's Get It Started* section of the NEORV32 documentary:[ NEORV32 data sheet](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf)### ToolchainAt first you need the **RISC-V GCC toolchain**. You can either [download the sources](https://github.com/riscv/riscv-gnu-toolchain)and build the toolchain by yourself, or you can download a prebuilt one and install it.:warning: Keep in mind that – for instance – a `rv32imc` toolchain only provides library code compiled with compressed and`mul`/`div` instructions! Hence, this code cannot be executed (without emulation) on an architecture without these extensions!To build the toolchain by yourself, follow the official [build instructions](https://github.com/riscv/riscv-gnu-toolchain).Make sure to use the `ilp32` or `ilp32e` ABI.**Alternatively**, you can download a prebuilt toolchain. I have uploaded the toolchains I am using to GitHub. These toolchainswere compiled on a 64-bit x86 Ubuntu 20.04 LTS (Ubuntu on Windows, actually). Download the toolchain of choice:[https://github.com/stnolting/riscv_gcc_prebuilt](https://github.com/stnolting/riscv_gcc_prebuilt)### Dowload the NEORV32 ProjectGet the sources of the NEORV32 Processor project. The simplest way is using `git clone` (suggested for easy project updates via `git pull`):$ git clone https://github.com/stnolting/neorv32.gitAlternatively, you can either download a specific [release](https://github.com/stnolting/neorv32/releases) or get the most recent versionof this project as [`*.zip` file](https://github.com/stnolting/neorv32/archive/master.zip).### Create a new Hardware ProjectCreate a new project with your FPGA design tool of choice. Add all the `*.vhd` files from the [`rtl/core`](https://github.com/stnolting/neorv32/blob/master/rtl)folder to this project. Make sure to add these files to a **new design library** called `neorv32`.You can either instantiate the [processor's top entity](https://github.com/stnolting/neorv32#top-entity) in your own project or youcan use a simple [test setup](https://github.com/stnolting/neorv32/blob/master/rtl/top_templates/neorv32_test_setup.vhd) (from the project's[`rtl/top_templates`](https://github.com/stnolting/neorv32/blob/master/rtl/top_templates) folder) as top entity.This test setup instantiates the processor and implements most of the peripherals and some ISA extensions. Only the UART lines, clock, reset and some GPIO output signals arepropagated as actual entity signals. Basically, it is a FPGA "hello world" example:```vhdlentity neorv32_test_setup isport (-- Global control --clk_i : in std_ulogic := '0'; -- global clock, rising edgerstn_i : in std_ulogic := '0'; -- global reset, low-active, async-- GPIO --gpio_o : out std_ulogic_vector(7 downto 0); -- parallel output-- UART --uart_txd_o : out std_ulogic; -- UART send datauart_rxd_i : in std_ulogic := '0' -- UART receive data);end neorv32_test_setup;```### Check the ToolchainMake sure `GNU Make` and a native `GCC` compiler are installed. To test the installation of the RISC-V toolchain navigate to an example project like`sw/example/blink_led` and run:neorv32/sw/example/blink_led$ make check### Compiling an Example ProgramThe NEORV32 project includes some [example programs](https://github.com/stnolting/neorv32/tree/master/sw/example) fromwhich you can start your own application. Simply compile one of these projects. This will create a NEORV32*executable* `neorv32_exe.bin` in the same folder:neorv32/sw/example/blink_led$ make clean_all exe### Upload the Executable via the BootloaderYou can upload a generated executable directly from the command line using the makefile's `upload` target. Replace `/dev/ttyUSB0` withthe according serial port.sw/exeample/blink_example$ make COM_PORT=/dev/ttyUSB0` uploadA more "secure" way is to use a dedicated terminal program. This allows to directly interact with the bootloader console.Connect your FPGA board via UART to your computer and open the according port to interface with the NEORV32 bootloader. The bootloaderuses the following default UART configuration:* 19200 Baud* 8 data bits* 1 stop bit* No parity bits* No transmission / flow control protocol (raw bytes only)* Newline on `\r\n` (carriage return & newline) - also for sent dataUse the bootloader console to upload the `neorv32_exe.bin` executable and run your application image.```<< NEORV32 Bootloader >>BLDV: Jul 6 2020HWV: 1.0.1.0CLK: 0x0134FD90 HzUSER: 0x0001CE40MISA: 0x42801104PROC: 0x03FF0035IMEM: 0x00010000 bytes @ 0x00000000DMEM: 0x00010000 bytes @ 0x80000000Autoboot in 8s. Press key to abort.Aborted.Available CMDs:h: Helpr: Restartu: Uploads: Store to flashl: Load from flashe: ExecuteCMD:> uAwaiting neorv32_exe.bin... OKCMD:> eBooting...Blinking LED demo program```Going further: Take a look at the _Let's Get It Started!_ chapter of the [ NEORV32 data sheet](https://raw.githubusercontent.com/stnolting/neorv32/master/docs/NEORV32.pdf).## ContributeI'm always thankful for help! So if you have any questions, bug reports, ideas or if you want to give some kind of feedback, feel freeto [open a new issue](https://github.com/stnolting/neorv32/issues) or directly [drop me a line](mailto:stnolting@gmail.com).If you'd like to contribute:1. [Fork](https://github.com/stnolting/neorv32/fork) this repository and clone the fork2. Create a feature branch in your fork: `git checkout -b awesome_new_feature_branch`3. Create a new remote for the upstream repo: `git remote add https://github.com/stnolting/neorv32`3. Commit your modifications: `git commit -m "Awesome new feature!"`4. Push to the branch: `git push origin awesome_new_feature_branch`5. Create a new [pull request](https://github.com/stnolting/neorv32/pulls)Please also check out the project's [code of conduct](https://github.com/stnolting/neorv32/tree/master/CODE_OF_CONDUCT.md).## LegalThis project is released under the BSD 3-Clause license. No copyright infringement intended.Other implied or used projects might have different licensing - see their documentation to get more information.#### CitationIf you are using the NEORV32 or some parts of the project in some kind of publication, please cite it as follows:> S. Nolting, "The NEORV32 Processor", github.com/stnolting/neorv32#### BSD 3-Clause LicenseCopyright (c) 2020, Stephan Nolting. All rights reserved.Redistribution and use in source and binary forms, with or without modification, arepermitted provided that the following conditions are met:1. Redistributions of source code must retain the above copyright notice, this list ofconditions and the following disclaimer.2. Redistributions in binary form must reproduce the above copyright notice, this list ofconditions and the following disclaimer in the documentation and/or other materialsprovided with the distribution.3. Neither the name of the copyright holder nor the names of its contributors may be used toendorse or promote products derived from this software without specific prior writtenpermission.THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESSOR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OFMERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THECOPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTEGOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSEDAND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDINGNEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISEDOF THE POSSIBILITY OF SUCH DAMAGE.#### Limitation of Liability for External LinksOur website contains links to the websites of third parties („external links“). As thecontent of these websites is not under our control, we cannot assume any liability forsuch external content. In all cases, the provider of information of the linked websitesis liable for the content and accuracy of the information provided. At the point in timewhen the links were placed, no infringements of the law were recognisable to us. As soonas an infringement of the law becomes known to us, we will immediately remove thelink in question.#### Proprietary Notice"Artix" and "Vivado" are trademarks of Xilinx Inc."Cyclone", "Quartus Prime", "Quartus Prime Lite" and "Avalon Bus" are trademarks of Intel Corporation."Artix" and "Vivado" are trademarks of Xilinx, Inc."iCE40", "UltraPlus" and "Lattice Radiant" are trademarks of Lattice Semiconductor Corporation."AXI" and "AXI-Lite" are trademarks of Arm Holdings plc.## Acknowledgements[](https://riscv.org/)[RISC-V](https://riscv.org/) - Instruction Sets Want To Be Free](https://travis-ci.com/stnolting/neorv32)Continous integration provided by [Travis CI](https://travis-ci.com/stnolting/neorv32) and powered by [GHDL](https://github.com/ghdl/ghdl).This project is not affiliated with or endorsed by the Open Source Initiative (https://www.oshwa.org / https://opensource.org).--------This repository was created on June 23th, 2020.Made with :coffee: in Hannover, Germany.
Go to most recent revision | Compare with Previous | Blame | View Log
