-
Notifications
You must be signed in to change notification settings - Fork 13k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use small code model for UEFI targets #87124
Use small code model for UEFI targets #87124
Conversation
Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @oli-obk (or someone else) soon. Please see the contribution instructions for more information. |
|
This sounds reasonable, but I know nothing about code models or UEFI cc @rust-lang/wg-llvm any takers? |
This seems reasonable enough to me, but I also wonder if it shouldn't be up to the user to apply this optimization (since it can indeed make builds fail for people building large UEFI applications). I've a hard time believing that the improvement here is significant enough for most people writing UEFI applications to care. |
This pull can be classified as a follow up pull request to #85357, where we actually have to use the small code model (as as you can see in the comments) rust performs some relocations (when compiling core) that are not supported on COFF targets. This is not an issue on x86_64 but to keep the target code models equal I guess we should use the small code model here too. And
|
As I mentioned in #85357, there is only very limited information on the code model knob and so far no-one could explain to us what this really controls, why the compiler cannot guess it, or what a safe default would be. Apart from this, I wonder whether it is the right fix. Should'nt we rather try to fix the wrong relocations of rustc? |
@dvermd
The fact that the produced executable/dll is going to be huge can be discovered only during linking when multiple object files are combined together, but all the compiler's work is already done. |
For reference small is the default code model on x86 in LLVM. |
This comment has been minimized.
This comment has been minimized.
Now rather then explicitly setting the code model to small, we pick let LLVM to pick the code model (which is the default code model for the architecture).
This should be an issue in rustc, I suspect as the code model does not matter if your code is relocatable. This pull request just makes sure that we use the default LLVM code model to improve codegen since as code model does not matter if your code is relocatable:
|
I'm ready to r+ this because it looks generally reasonable, but I'd want to make sure that this was actually tested on x86 UEFI. |
Yep, I did run the uefi-rs tests and they all succeed: andy@DESKTOP-IUH6DM6:~/uefi-rs/uefi-test-runner$ ./build.py --target x86_64 --headless --ci run
Fresh core v0.0.0 (/home/andy/rust/build/x86_64-unknown-linux-gnu/stage1/lib/rustlib/src/rust/library/core)
Fresh unicode-xid v0.2.2
Fresh rustc-std-workspace-core v1.99.0 (/home/andy/rust/build/x86_64-unknown-linux-gnu/stage1/lib/rustlib/src/rust/library/rustc-std-workspace-core)
Fresh compiler_builtins v0.1.46
Fresh alloc v0.0.0 (/home/andy/rust/build/x86_64-unknown-linux-gnu/stage1/lib/rustlib/src/rust/library/alloc)
Fresh proc-macro2 v1.0.27
Fresh quote v1.0.9
Fresh cfg-if v1.0.0
Fresh bit_field v0.10.1
Fresh bitflags v1.2.1
Fresh qemu-exit v2.0.1
Fresh rlibc v1.0.0
Fresh syn v1.0.73
Fresh ucs2 v0.3.2
Fresh log v0.4.14
Fresh uefi-macros v0.3.3 (/home/andy/uefi-rs/uefi-macros)
Fresh uefi v0.11.0 (/home/andy/uefi-rs)
Fresh uefi-services v0.8.0 (/home/andy/uefi-rs/uefi-services)
Fresh uefi-test-runner v0.2.0 (/home/andy/uefi-rs/uefi-test-runner)
Finished dev [unoptimized + debuginfo] target(s) in 0.12s
BdsDxe: loading Boot0001 "UEFI QEMU HARDDISK QM00001 " from PciRoot(0x0)/Pci(0x1F,0x2)/Sata(0x0,0xFFFF,0x0)
BdsDxe: starting Boot0001 "UEFI QEMU HARDDISK QM00001 " from PciRoot(0x0)/Pci(0x1F,0x2)/Sata(0x0,0xFFFF,0x0)
INFO: UEFI 2.7
INFO: Testing boot services
INFO: Testing memory functions
INFO: Allocating some pages of memory
INFO: Allocating a vector through the `alloc` crate
INFO: Allocating a structure with alignment to 0x100
INFO: Testing the `memmove` / `set_mem` functions
INFO: Testing memory map functions
INFO: Testing timer...
INFO: Testing watchdog...
INFO: Testing various protocols
INFO: Testing console protocols
INFO: Running text output protocol test
INFO: UEFI standard output current mode: Some(OutputMode { index: 0, dims: (80, 25) })
INFO: # uefi-rs test runner
INFO: Cursor visibility control unavailable
INFO: - Text mode #0: 25 rows by 80 columns
INFO: - Text mode #1: 31 rows by 100 columns
INFO: Running serial protocol test
INFO: Running graphics output protocol test
SCREENSHOT: gop_test
INFO: Running pointer protocol test
INFO: Pointer state has not changed since the last query
INFO: Running UEFI debug connection protocol test
INFO: - Architecture: EBC
INFO: Testing Media Access protocols
INFO: Root directory entry: NamedFileProtocolInfo { header: FileInfoHeader { size: 88, file_size: 8192, physical_size: 8192, create_time: 2021-7-16 11:34:32.0 2047 (empty), last_access_time: 2021-7-16 0:0:0.0 2047 (empty), modification_time: 2021-7-16 11:34:32.0 2047 (empty), attribute: DIRECTORY }, name: ['E', 'F', 'I', '\u{0}'] }
INFO: MBR partition: MbrPartitionRecord { boot_indicator: 128, starting_chs: [1, 1, 0], os_type: MbrOsType(6), ending_chs: [15, 255, 255], starting_lba: 63, size_in_lba: 1032129 }
INFO: Testing Platform Initialization protocols
INFO: Running shim lock protocol test
INFO: Shim lock protocol is not supported
INFO: Testing runtime services
INFO: Testing set_variable
INFO: Testing get_variable_size
INFO: Testing get_variable
INFO: Testing complete, shutting down...
andy@DESKTOP-IUH6DM6:~/uefi-rs/uefi-test-runner$ The only change I made to the build script was adding python3 ./x.py build -i library/std && rustup toolchain link aarch-dev build/x86_64-unknown-linux-gnu/stage1 Since I am on windows, I ran the tests on WSL without KVM which the tests script (the test script by default uses KVM acceleration). I added the I have also tested my bootloader and it works perfectly fine. I also flashed my bootloader into USB and booted it and works fine too. |
Ok, r=me with commits squashed. |
* Since the code model only applies to the code and not the data and the code model only applies to functions you call through using `call`, `jmp` and data with `lea`, etc… If you are calling functions using the function pointers from the UEFI structures the code model does not apply in that case. It’s just related to the address space size of your own binary. Since UEFI (uefi is all relocatable) uses relocatable PEs (relocatable code does not care about the code model) so, we use the small code model here. * Since applications don't usually take gigabytes of memory, setting the target to use the small code model should result in better codegen (comparable with majority of other targets). Large code models are also known for generating horrible code, for example 16 bytes of code to load a single 8-byte value. * Use the LLVM default code model for the architecture for the x86_64-unknown-uefi targets. For reference small is the default code model on x86 in LLVM: <https://github.com/llvm/llvm-project/blob/7de2173c2a4c45711831cfee3ccf53690c76ff07/llvm/lib/Target/X86/X86TargetMachine.cpp#L204> * Remove the comments too as they are not UEFI-specific and applies to pretty much any target. I added them before as I was explicitily setting the code model to small. Signed-off-by: Andy-Python-Programmer <andypythonappdeveloper@gmail.com>
ebe9062
to
db1e492
Compare
Done, squashed commits... |
@bors r+ |
📌 Commit db1e492 has been approved by |
☀️ Test successful - checks-actions |
There is no need to use the large code model. See: rust-lang/rust#87124
There is no need to use the large code model. See: rust-lang/rust#87124
Since the code model only applies to the code and not the data and the code model
only applies to functions you call through using
call
,jmp
and data withlea
, etc…If you are calling functions using the function pointers from the UEFI structures the code
model does not apply in that case. It’s just related to the address space size of your own binary.
Since UEFI (uefi is all relocatable) uses relocatable PEs (relocatable code does not care about the
code model) so, we use the small code model here.
Since applications don't usually take gigabytes of memory, setting the
target to use the small code model should result in better codegen (comparable
with majority of other targets).
Large code models are also known for generating horrible code, for
example 16 bytes of code to load a single 8-byte value.
Signed-off-by: Andy-Python-Programmer andypythonappdeveloper@gmail.com