Skip to content

Documentation #134

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Dec 16, 2020
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
89 changes: 87 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,77 @@ Written for the [second edition](https://github.com/phil-opp/blog_os/issues/360)

## Design

TODO
When you press the power button the computer loads the BIOS from some flash memory stored on the motherboard.
The BIOS initializes and self tests the hardware then loads the first 512 bytes into memory from the media device
(i.e. the cdrom or floppy disk). If the last two bytes equal 0xAA55 then the BIOS will jump to location 0x7C00 effectively
transferring control to the bootloader. At this point the CPU is running in 16 bit mode,
meaning only the 16 bit registers are available. Also since the BIOS only loads the first 512 bytes this means our bootloader
code has to stay below that limit, otherwise we’ll hit uninitialised memory!
Using [Bios interrupt calls](https://en.wikipedia.org/wiki/BIOS_interrupt_call) the bootloader prints debug information to the screen.
For more information on how to write a bootloader click [here](http://3zanders.co.uk/2017/10/13/writing-a-bootloader/).
The assembler files get imported through the [global_asm feature](https://doc.rust-lang.org/unstable-book/library-features/global-asm.html).
The assembler syntax definition used is the one llvm uses: [GNU Assembly](http://microelectronics.esa.int/erc32/doc/as.pdf).

* stage_1.s
This stage initializes the stack, enables the A20 line, loads the rest of
the bootloader from disk, and jumps to stage_2.

* stage_2.s
This stage sets the target operating mode, loads the kernel from disk,
creates an e820 memory map, enters protected mode, and jumps to the
third stage.

* stage_3.s
This stage performs some checks on the CPU (cpuid, long mode), sets up an
initial page table mapping (identity map the bootloader, map the P4
recursively, map the kernel blob to 4MB), enables paging, switches to long
mode, and jumps to stage_4.


## Build chain
The file `.cargo/config` defines an llvm target file called `x86_64-bootloader.json`.
This file defines the architecture sets freestanding flags and tells llvm to use the linker script `linker.ld`.

The linker script tells the linker at which offsets the sections should be mapped to. In our case it tells the linker
that the bootloader asm files stage_0-3.s should be mapped to the very beginning of the executable. Read more about linker scripts
[here](https://www.sourceware.org/binutils/docs/ld/Scripts.html)

Another important role plays the file `build.rs`.
Placing a file named `build.rs` in the root of a package will cause
Cargo to compile that script and execute it just before building the package.
You can read more about it [here](https://doc.rust-lang.org/cargo/reference/build-scripts.html).
The `build.rs` file execute the llvm tools you installed with `rustup component add llvm-tools-preview`
in this order:

* Check size of .text section of the kernel if it's too small throw an error
```bash
llvm-size "../../target/x86_64-os/debug/svm_kernel"
```

* Strip debug symbols from kernel to make loading faster
```bash
llvm-objcopy "--strip-debug" "../../target/x86_64-os/debug/svm_kernel" "target/x86_64-bootloader/debug/build/bootloader-c8df27c930d8f65a/out/kernel_stripped-svm_kernel"
```
* Rename the .data section to .kernel in the stripped kernel.
Objcopy when using `--binary-architecture` flag creates three synthetic symbols
`_binary_objfile_start`, `_binary_objfile_end`, `_binary_objfile_size.`.
These symbols use the project / binary name which is why we have to rename them to something more generic
to be able to reference them.
```bash
llvm-objcopy "-I" "binary" "-O" "elf64-x86-64" "--binary-architecture=i386:x86-64" "--rename-section" ".data=.kernel" "--redefine-sym" "_binary_kernel_stripped_svm_kernel_start=_kernel_start_addr" "--redefine-sym" "_binary_kernel_stripped_svm_kernel_end=_kernel_end_addr" "--redefine-sym" "_binary_kernel_stripped_svm_kernel_size=_kernel_size" "target/x86_64-bootloader/debug/build/bootloader-c8df27c930d8f65a/out/kernel_stripped-svm_kernel" "target/x86_64-bootloader/debug/build/bootloader-c8df27c930d8f65a/out/kernel_bin-svm_kernel.o"
```
* Now create a static library out of it
```bash
llvm-ar "crs" "bootloader/target/x86_64-bootloader/debug/build/bootloader-c8df27c930d8f65a/out/libkernel_bin-svm_kernel.a" "target/x86_64-bootloader/debug/build/bootloader-c8df27c930d8f65a/out/kernel_bin-svm_kernel.o"
```
Afterwards `build.rs` tells cargo to use the newly created static library to link against your kernel, with the help of the linker script everything gets placed correctly in the
resulting elf file.
The last step is to strip away the elf header from the resulting elf binary so that the bios can jump directly to the bootloader `stage_1.s`. This is done with:
```bash
cargo objcopy -- -I elf64-x86-64 -O binary --binary-architecture=i386:x86-64 \
target/x86_64-bootloader/release/bootloader target/x86_64-bootloader/release/bootloader.bin
```


## Configuration

Expand Down Expand Up @@ -87,13 +157,28 @@ dd if=target/x86_64-bootloader/release/bootloader.bin of=/dev/sdX && sync

Where sdX is the device name of your USB stick. **Be careful** to choose the correct device name, because everything on that device is overwritten.

## Debugging
Set a breakpoint at address `0x7c00`. Disassemble instructions with gdb:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This address will differ between versions of bootloader and possibly between assembler versions.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's the address BIOS jumps to to start code execution, we'll always have something there

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, ok.

```bash
qemu-system-x86_64 -drive format=raw,file=target/x86_64-bootloader/release/bootloader.bin -s -S
```
```
(gdb) target remote: 1234
(gdb) b *0x7c00
(gdb) x/i $rip
```

If you use the `-enable-kvm` flag you need to use hardware breakpoints `hb`.

## Features
The bootloader crate can be configured through some cargo features:

- `vga_320x200`: This feature switches the VGA hardware to mode 0x13, a graphics mode with resolution 320x200 and 256 colors per pixel. The framebuffer is linear and lives at address `0xa0000`.
- `recursive_page_table`: Maps the level 4 page table recursively and adds the [`recursive_page_table_address`](https://docs.rs/bootloader/0.4.0/bootloader/bootinfo/struct.BootInfo.html#structfield.recursive_page_table_addr) field to the passed `BootInfo`.
- `map_physical_memory`: Maps the complete physical memory in the virtual address space and passes a [`physical_memory_offset`](https://docs.rs/bootloader/0.4.0/bootloader/bootinfo/struct.BootInfo.html#structfield.physical_memory_offset) field in the `BootInfo`.
- The virtual address where the physical memory should be mapped is configurable by setting the `physical-memory-offset` field in the kernel's `Cargo.toml`, as explained in [Configuration](#Configuration).
- `sse` enables sse instruction support
- The virtual address where the physical memory should be mapped is configurable by setting the `physical-memory-offset` field in the kernel's `Cargo.toml`, as explained in [Configuration](#Configuration).


## Advanced Documentation
See these guides for advanced usage of this crate:
Expand Down