2-Level Cache Controller on a RISC-V Core

Aim

The aim of this project is to develop a Single cycle RISC-V processor integrated with a hierarchical cache system to reduce memory access latency. The design includes:

A functional RV32I processor core with basic arithmetic, logic, control, and memory instructions.
L1 and L2 caches with distinct mapping policies (L1: direct-mapped, L2: 4-way set associative).
Implementation of write-back and no-write allocate policies with Least Recently Used (LRU) replacement.
Testing and verification of a balanced set of Load/Store and ALU instructions on the integrated system.

Introduction

The RISC-V architecture is an open-source instruction set architecture (ISA) known for its simplicity and flexibility. Originally developed at the University of California, Berkeley, it is part of the fifth generation of RISC processors.

A Cache Controller serves as an interface between the processor and memory, executing read and write requests (Load/Store instructions), and managing data flow across cache levels and main memory.

This project focuses on implementing a two-level cache system with a Single-Cycle RISC-V processor, offering hands-on experience in digital design and microprocessor architecture.

Technologies Used

Xilinx Vivado IDE
Ripes RISC-V Simulator
GTKWave (debugging)
Languages: Verilog HDL, RISC-V Assembly

Tools Description:

Xilinx Vivado: FPGA design suite for synthesis, implementation, and verification
Ripes: Visual simulator for RISC-V, generates binary .dat files for instruction memory
GTKWave: Waveform viewer for efficient debugging

Literature Survey

Implementation and comparison of different cache mappings
Accel: Cache simulator
Cache architecture studies

Methodology

Research Phase

Memory Hierarchy Understanding:
Studied spatial and temporal locality to optimize cache.
AMAT (Average Memory Access Time):
AMAT = Hit time + Miss rate × Miss penalty
Write Policy Analysis:
Compared Write-through vs Write-back

Design Procedure

Developed RV32I Processor Core using Verilog HDL (5-stage pipeline):
- Instruction Fetch (IF)
- Instruction Decode (ID)
- Execute (EX)
- Memory Access (MEM)
- Write Back (WB)
Used structural modeling to define modules and integrate datapath and control path.

Cache Design

Clock Rate: Cache operates ~5× faster than the processor for optimal AMAT.

L1 Cache (Direct-Mapped)

Size: 64 bytes
Delay: 1 cycle

L2 Cache (4-Way Set Associative)

Size: 512 bytes
Delay: 4 cycles
Replacement Policy: LRU

Main Memory

Size: 4KB
Delay: 10 cycles

Policies Implemented:

Write-Back
No Write-Allocate

Implementation

Check Mode: Ensure controller isn’t busy via the wait signal
Read Operation:
- Check L1 Cache
- L1 Hit: Return data to processor
- L1 Miss: Check L2
- L2 Hit: Delay 2 cycles, promote block to L1
- L2 Miss: Fetch from main memory (10-cycle delay)
- Promotions: L2 → L1 with evictions and write-backs if needed
Write Operation:
- L1 Hit: Modify in L1
- L1 Miss: Check and modify in L2 if found
- L2 Miss: Modify directly in main memory
- Policy: No promotion on write, no eviction on write (No Write-Allocate)

Results

Test Program:

addi x5, x0, 0  
addi x6, x0, 0  
addi x7, x0, 4  
addi x6, x5, 0  
sw x7, 0(x6)  
lw x7, 0(x6)  
addi x6, x5, 4  
lw x7, 0(x6)  
addi x6, x5, 8

Processor Speed: 11.9 MHz (84 ns period)
Cache Speed: 500 MHz (2 ns period)
Speedup (after L1 full): 3.75
Observation point: PC = 0x4A; check hit1, hit2, and wait signals

Conclusion and Future Scope

The two-level cache controller significantly reduced memory latency and increased performance in the RISC-V system. Through integration with the RV32I core, substantial throughput gains were achieved compared to a baseline design.

Future Scope

Branch Prediction: Reduce instruction fetch penalties
Advanced Cache Policies: Write-through, Write-allocate, and even L3 Cache
Multicore Coherence: Implement MESI/MOESI for shared caches
Adaptive Replacement: Use DRRIP or ARC for better miss handling
Prefetching Mechanisms: To reduce compulsory misses
FPGA Implementation: Synthesize the full design to obtain power, area, and timing reports on hardware

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
ALU.v		ALU.v
ALUControlUnit.v		ALUControlUnit.v
CACHE_Controller.v		CACHE_Controller.v
CPU.v		CPU.v
CPU_tb.v		CPU_tb.v
ControlUnit.v		ControlUnit.v
DataMemory.v		DataMemory.v
ImmGen.v		ImmGen.v
InstructionMemory.v		InstructionMemory.v
ProgramCounter.v		ProgramCounter.v
README.md		README.md
RegisterFile.v		RegisterFile.v
SE.v		SE.v
TEST_INSTRUCTIONS.dat		TEST_INSTRUCTIONS.dat
cpu_tb.vcd		cpu_tb.vcd
cpu_tb.vvp		cpu_tb.vvp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2-Level Cache Controller on a RISC-V Core

Aim

Introduction

Technologies Used

Literature Survey

Methodology

Research Phase

Design Procedure

Cache Design

Implementation

Results

Conclusion and Future Scope

Future Scope

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

2-Level Cache Controller on a RISC-V Core

Aim

Introduction

Technologies Used

Literature Survey

Methodology

Research Phase

Design Procedure

Cache Design

Implementation

Results

Conclusion and Future Scope

Future Scope

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages