- Feature Name:
safer_transmute
- Start Date: (fill me in with today's date, YYYY-MM-DD)
- RFC PR: rust-lang/rfcs#0000
- Rust Issue: rust-lang/rust#0000
We propose traits, namely TransmuteInto
and TransmuteFrom
, that are implemented automatically for combinations of types that may be safely transmuted. In other words, this RFC makes safe transmutation as easy as 1..., 2..., repr(C)
!
use core::transmute::{
TransmuteInto,
stability::{PromiseTransmutableInto, PromiseTransmutableFrom},
};
#[derive(PromiseTransmutableInto, PromiseTransmutableFrom)] // declare `Foo` to be *stably* transmutable
#[repr(C)]
pub struct Foo(pub u8, pub u16);
// ^ there's a padding byte here, between these fields
// Transmute fearlessly!
let _ : Foo = 64u32.transmute_into(); // Alchemy Achieved!
// ^^^^^^^^^^^^^^ provided by the `TransmuteInto` trait
let _ : u32 = Foo(16, 12).transmute_into(); // Compile Error!
// error[E0277]: the trait bound `u32: TransmuteFrom<foo::Foo, _>` is not satisfied
// --> src/demo.rs:15:27
// |
// 15 | let _ : u32 = Foo(16, 12).transmute_into(); // Compile Error!
// | ^^^^^^^^^^^^^^ the trait `TransmuteFrom<foo::Foo, _>` is not implemented for `u32`
// |
// = note: required because of the requirements on the impl of `TransmuteInto<u32, _>` for `foo::Foo`
// = note: byte 8 of the source type may be uninitialized; byte 8 of the destination type cannot be uninitialized.
Byte-reinterpretation conversions (such as those performed by mem::transmute
, mem::transmute_copy
, pointer casts, and union
s) are invaluable in high performance contexts, are unsafe
, and easy to get wrong. This RFC provides mechanisms that make many currently-unsafe transmutations entirely safe. For transmutations that are not entirely safe, this RFC's mechanisms make mistakes harder to make.
This RFC's comprehensive approach provides additional benefits beyond the mere act of transmutation; namely:
- authoritatively codifies language layout guarantees
- allows crate authors to codify their types' layout stability guarantees
- allows crate authors to codify their abstractions' layout requirements
Given the expressive foundation provided by this RFC, we also envision a range of future possibilities that will not require additional compiler support, including:
- safe slice and
Vec
casting - a unified, generic
Atomic<T>
type - a safe, generic alternative to
include_bytes!
- traits for asserting the size and alignment relationships of types
- zerocopy-style traits for safe initialization
- bytemuck-style mechanisms for fallible reference casting
Documentation of Rust's layout guarantees for a type are often spread across countless issues, pull requests, RFCs and various official resources. It can be very difficult to get a straight answer. When transmutation is involved, users must reason about the combined layout properties of the source and destination types.
This RFC proposes mechanisms that programmers will use to confidently answer such questions—by checking whether the TransmuteFrom
and TransmuteInto
traits are implemented, or (equivalently) by checking whether the can_transmute
predicate (a const fn
) is satisfied.
There is no canonical way for crate authors to declare the SemVer layout guarantees of their types. Crate authors currently must state their layout guarantees using prose in their documentation. In contrast to structural stability (e.g., the declared visibility of fields), layout stability is expressed extra-linguistically.
This isn't satisfactory: guarantees expressed in prose outside of the Rust programming language are guarantees that cannot be reasoned about inside the language. Whereas rustc
can dutifully deny programmers access to private fields, it is unable to prevent programmers from making unfounded expectations of types' in-memory layouts.
This RFC proposes simple-but-powerful mechanisms for declaring layout stability guarantees.
Similarly, there is no canonical way for crate authors to declare the layout requirements of generic abstractions over types that have certain layout properties.
For instance, a common bit-packing technique involves using the relationship between allocations and alignment. If a type is aligned to 2n, then the n least significant bits of pointers to that type will equal 0
. These known-zero bits can be packed with data. Since alignment cannot be currently reasoned about at the type-level, it's currently impossible to bound instantiations of a generic parameter based on minimum alignment.
The mechanisms proposed by the RFC enable this, see here.
Transmutation is the act of reinterpreting the bytes corresponding to a value of one type as if they corresponded to a different type. Concretely, we mean the behavior of this function:
#[inline(always)]
unsafe fn transmute<Src, Dst>(src: Src) -> Dst
{
#[repr(C)]
union Transmute<Src, Dst> {
src: ManuallyDrop<Src>,
dst: ManuallyDrop<Dst>,
}
ManuallyDrop::into_inner(Transmute { src: ManuallyDrop::new(src) }.dst)
}
By safer transmutation we mean: what where
bound could we add to transmute
restricts its type parameters Src
and Dst
in ways that statically limit the function's misuse? Our answer to this question will ensure that transmutations are, by default, sound, safe and stable.
A transmutation is sound if the mere act of transmuting a value from one type to another is not unspecified or undefined behavior.
A sound transmutation is safe if using the transmuted value cannot violate memory safety.
A safe transmutation is stable if the authors of the source type and destination types have indicated that the layouts of those types is part of their libraries' stability guarantees.
Disclaimer: While the high-level definitions of transmutation soundness, safety and stability are a core component of this RFC, the detailed rules and examples in this section are not. We expect that the initial implementation of TransmuteFrom
may initially be considerably less sophisticated than the examples in this section (and thus forbid valid transmutations). Nonetheless, this section explores nuanced cases of transmutation soundness and safety to demonstrate that the APIs we propose can grow to handle that nuance.
A transmutation is sound if the mere act of transmuting a value from one type to another is not unspecified or undefined behavior.
Transmutation is always unsound if it occurs between types with unspecified representations.
Most of Rust's primitive types have specified representations. That is, the precise layout characteristics of u8
, f32
is a documented and guaranteed aspect of those types.
In contrast, most struct
and enum
types defined without an explicit #[repr(C)]
attribute do not have well-specified layout characteristics.
To ensure that types you've define are soundly transmutable, you almost always (with very few exceptions) must mark them with the #[repr(C)]
attribute.
Transmutations involving owned values must adhere to two rules to be sound. They must:
The bits of any valid instance of the source type must be a bit-valid instance of the destination type.
For example, we are permitted to transmute a Bool
into a u8
:
#[derive(Default, PromiseTransmutableFrom, PromiseTransmutableInto)]
#[repr(u8)]
enum Bool {
True = 1,
False = 0,
}
let _ : u8 = Bool::True.transmute_into();
let _ : u8 = Bool::False.transmute_into();
#[derive(PromiseTransmutableFrom, PromiseTransmutableInto)]
annotation connotes that all aspects of Bool
's layout are part of its library stability guarantee.)
...because all possible instances of Bool
are also valid instances of u8
. However, transmuting a u8
into a Bool
is forbidden:
/* ⚠️ This example intentionally does not compile. */
let _ : Bool = u8::default().transmute_into(); // Compile Error!
...because not all instances of u8
are valid instances of Bool
.
Another example: While laying out certain types, Rust may insert padding bytes between the layouts of fields. In the below example Padded
has two padding bytes, while Packed
has none:
#[repr(C)]
#[derive(Default, PromiseTransmutableFrom, PromiseTransmutableInto)]
struct Padded(pub u8, pub u16, pub u8);
#[repr(C)]
#[derive(Default, PromiseTransmutableFrom, PromiseTransmutableInto)]
struct Packed(pub u16, pub u16, pub u16);
assert_eq!(mem::size_of::<Packed>(), mem::size_of::<Padded>());
We may safely transmute from Packed
to Padded
:
let _ : Padded = Packed::default().transmute_into();
...but not from Padded
to Packed
:
/* ⚠️ This example intentionally does not compile. */
let _ : Packed = Padded::default().transmute_into(); // Compile Error!
...because doing so would expose two uninitialized padding bytes in Padded
as if they were initialized bytes in Packed
.
It's completely sound to transmute into a type with fewer bytes than the source type; e.g.:
let _ : [u8; 16] = [u8; 32]::default().transmute_into();
This transmute truncates away the final sixteen bytes of the [u8; 32]
value.
A value may not be transmuted into a type of greater size, if doing so would expose uninitialized bytes as initialized:
/* ⚠️ This example intentionally does not compile. */
let _ : [u8; 32] = [u8; 16]::default().transmute_into(); // Compile Error!
The restrictions above that apply to transmuting owned values also apply to transmuting references. However, references carry a few additional restrictions.
A sound transmutation must:
- preserve or shrink size,
- preserve or relax alignment,
- preserve or shrink lifetimes,
- preserve or shrink uniqueness, and
- and if the destination type is a mutate-able reference, preserve validity.
You may preserve or decrease the size of the referent type via transmutation:
let _: &[u8; 3] = (&[0u8; 9]).transmute_into();
However, you may not, under any circumstances, increase the size of the referent type:
/* ⚠️ This example intentionally does not compile. */
let _: &[u8; 9] = (&[0u8; 3]).transmute_into(); // Compile Error!
Unaligned loads are undefined behavior. You may transmute a reference into reference of more relaxed alignment:
let _: &[u8; 0] = (&[0u16; 0]).transmute_into();
However, you may not transmute a reference into a reference of more-restrictive alignment:
/* ⚠️ This example intentionally does not compile. */
let _: &[u16; 0] = (&[0u8; 0]).transmute_into(); // Compile Error!
You may transmute a reference into a reference of lesser lifetime:
fn shrink<'a>() -> &'a u8 {
static long : &'static u8 = &16;
long.transmute_into()
}
However, you may not transmute a reference into a reference of greater lifetime:
/* ⚠️ This example intentionally does not compile. */
fn extend<'a>(short: &'a u8) -> &'static u8 {
short.transmute_into() // Compile Error!
}
You may preserve or decrease the uniqueness of a reference through transmutation:
let _: &u8 = (&42u8).transmute_into();
let _: &u8 = (&mut 42u8).transmute_into();
However, you may not transmute a shared reference into a unique reference:
/* ⚠️ This example intentionally does not compile. */
let _: &mut u8 = (&42u8).transmute_into(); // Compile Error!
A mutate-able reference is:
- all unique (i.e.,
&mut T
) references - all shared (i.e.,
&T
) references whose referent type contain any bytes produced by the contents ofUnsafeCell
.
Unlike transmutations of owned values, the transmutation of a mutate-able reference may also not expand the bit-validity of the referenced type. For instance:
/* ⚠️ This example intentionally does not compile. */
let mut x = NonZeroU8::new(42).unwrap();
{
let y : &mut u8 = (&mut x).transmute_into(); // Compile Error!
*y = 0;
}
let z : NonZeroU8 = x;
If this example did not produce a compile error, the value of z
would not be a bit-valid instance of its type, NonZeroU8
.
A sound transmutation is safe if using the transmuted value safely cannot violate memory safety. Whereas soundness solely concerns the act of transmutation, safety is concerned with what might happen with a value after transmutation occurs.
A struct or enum variant is fully implicitly constructable at a given location only if, at that location, that type can be instantiated via its implicit constructor, and its fields are also implicitly constructable.
The implicit constructor of a struct or enum variant is the constructor Rust creates implicitly from its definition; e.g.:
struct Point<T> {
x: T,
y: T,
}
let p = Point { x: 4, y: 2 };
// ^^^^^^^^^^^^^^^^^^^^ An instance of `Point` is created here, via its implicit constructor.
Limiting implicit constructability is the fundamental mechanism with which type authors build safe abstractions for unsafe
code, whose soundness is dependent on preserving invariants on fields. Usually, this takes the form of restricting the visibility of fields. For instance, consider the type NonEmptySlice
, which enforces a validity constraint on its fields via its constructor:
pub mod crate_a {
#[repr(C)]
pub struct NonEmptySlice<'a, T> {
data: *const T,
len: usize,
lifetime: core::marker::PhantomData<&'a ()>,
}
impl<'a, T> NonEmptySlice<'a, T> {
pub fn from_array<const N: usize>(arr: &'a [T; N], len: usize) -> Self {
assert!(len <= N);
assert!(len > 0);
Self {
data: arr as *const T,
len,
lifetime: core::marker::PhantomData,
}
}
pub fn first(&self) -> &'a T {
unsafe { &*self.data }
}
}
}
It is sound for first
to be a safe method is because the from_array
constructor ensures that data
is safe to dereference, and because from_array
is the only way to safely initialize NonEmptySlice
outside of crate_a
(note that NonEmptySlice
's fields are not pub
). As a rule: any field that is not marked pub
should be assumed to be private because it is subject to safety invariants.
Unfortunately, field visibility modifiers are not a surefire indicator of whether a type is fully implicitly constructable. A type author may restrict the implicit constructability of a type even in situations where all fields of that type (and all fields of those fields) are pub
; consider:
pub mod crate_a {
#[repr(C)]
pub struct NonEmptySlice<'a, T>(pub private::NonEmptySliceInner<'a, T>);
impl<'a, T> NonEmptySlice<'a, T> {
pub fn from_array<const N: usize>(arr: &'a [T; N], len: usize) -> Self {
assert!(len <= N && len > 0);
Self(
private::NonEmptySliceInner {
data: arr as *const T,
len,
lifetime: core::marker::PhantomData,
}
)
}
pub fn first(&self) -> &'a T {
unsafe { &*self.0.data }
}
}
// introduce a private module to avoid `private_in_public` error (E0446):
pub(crate) mod private {
#[repr(C)]
pub struct NonEmptySliceInner<'a, T> {
pub data: *const T,
pub len: usize,
pub lifetime: core::marker::PhantomData<&'a ()>,
}
}
}
In the above example, the definitions of both NonEmptySlice
and its field NonEmptySliceInner
are marked pub
, and all fields of these types are marked pub
. However, NonEmptySlice
is not fully implicitly constructible outside of crate_a
, because the module containing NonEmptySliceInner
is not visibile outside of crate_a
.
Transmutation supplies a mechanism for constructing instances of a type without invoking its implicit constructor, nor any constructors defined by the type's author.
In the previous examples, it would be unsafe to transmute 0u128
into NonEmptySlice
outside crate_a
, because subsequent safe use of that value (namely, calling first
) would violate memory safety. (However, it's completely safe to transmute NonEmptySlice
into a u128
.)
For transmutations where the destination type involves mutate-able references, the constructability of the source type is also relevant. Consider:
/* ⚠️ This example intentionally does not compile. */
let arr = [0u8, 1u8, 2u8];
let mut x = NonEmptySlice::from_array(&arr, 2);
{
let y : &mut u128 = (&mut x).transmute_into(); // Compile Error!
*y = 0u128;
}
let z : NonEmptySlice<u8> = x;
If this example did not produce a compile error, the value of z
would not be a safe instance of its type, NonEmptySlice
, because z.first()
would dereference a null pointer.
Since the soundness and safety of a transmutation is affected by the layouts of the source and destination types, changes to those types' layouts may cause code which previously compiled to produce errors. In other words, transmutation causes a type's layout to become part of that type's API for the purposes of SemVer stability.
The question is, then: how can the author of a type reason about transmutations they did not write, from-or-to types they did not write? We address this problem by introducing two traits which both allow an author to opt-in to stability guarantees for their types, and allow third-parties to reason at compile-time about what guarantees are provided for such types.
You may declare the stability guarantees of your type by implementing one or both of two traits:
pub trait PromiseTransmutableFrom
where
Self::Archetype: PromiseTransmutableFrom
{
type Archetype: TransmuteInto<Self, NeglectStability>
}
pub trait PromiseTransmutableInto
where
Self::Archetype: PromiseTransmutableInto,
{
type Archetype: TransmuteFrom<Self, NeglectStability>
}
To implement each of these traits, you must specify an Archetype
. An Archetype
is a type whose layout exemplifies the extremities of your stability promise (i.e., the least/most constrained type for which it is valid to transmute your type into/from).
By implementing PromiseTransmutableFrom
, you promise that your type is guaranteed to be safely transmutable from PromiseTransmutableFrom::Archetype
. Conversely, by implementing PromiseTransmutableInto
, you promise that your type is guaranteed to be safely transmutable into PromiseTransmutableInto::Archetype
.
You are free to change the layout of your type however you like between minor crate versions so long as that change does not violates these promises. These two traits are capable of expressing simple and complex stability guarantees.
Together with the PromiseTransmutableFrom
and PromiseTransmutableInto
traits, this impl of TransmuteFrom
constitutes the formal definition of transmutation stability:
unsafe impl<Src, Dst> TransmuteFrom<Src> for Dst
where
Src: PromiseTransmutableInto,
Dst: PromiseTransmutableFrom,
Dst::Archetype: TransmuteFrom<Src::Archetype, NeglectStability>
{}
Why is this safe? Can we really safely judge whether Dst
is transmutable from Src
by assessing the transmutability of two different types? Yes! Transmutability is transitive. Concretely, if we can safely transmute:
Src
toSrc::Archetype
(enforced bySrc: PromiseTransmutableInto
), andDst::Archetype
toDst
(enforced byDst: PromiseTransmutableFrom
), andSrc::Archetype
toDst::Archetype
(enforced byDst::Archetype: TransmuteFrom<Src::Archetype, NeglectStability>
),
...then it follows that we can safely transmute Src
to Dst
in three steps:
- we transmute
Src
toSrc::Archetype
, - we transmute
Src::Archetype
toDst::Archetype
, - we transmute
Dst::Archetype
toDst
.
To promise that all transmutations which are currently safe for your type will remain so in the future, simply annotate your type with:
#[derive(PromiseTransmutableFrom, PromiseTransmutableInto)]
#[repr(C)]
pub struct Foo(pub Bar, pub Baz);
This expands to:
#[repr(C)]
pub struct Foo(pub Bar, pub Baz);
/// Generated `PromiseTransmutableFrom` for `Foo`
const _: () = {
use core::transmute::stability::PromiseTransmutableFrom;
#[repr(C)]
pub struct TransmutableFromArchetype(
pub <Bar as PromiseTransmutableFrom>::Archetype,
pub <Baz as PromiseTransmutableFrom>::Archetype,
);
impl PromiseTransmutableFrom for TransmutableFromArchetype {
type Archetype = Self;
}
impl PromiseTransmutableFrom for Foo {
type Archetype = TransmutableFromArchetype;
}
};
/// Generated `PromiseTransmutableInto` for `Foo`
const _: () = {
use core::transmute::stability::PromiseTransmutableInto;
#[repr(C)]
pub struct TransmutableIntoArchetype(
pub <Bar as PromiseTransmutableInto>::Archetype,
pub <Baz as PromiseTransmutableInto>::Archetype,
);
impl PromiseTransmutableFrom for TransmutableIntoArchetype {
type Archetype = Self;
}
impl PromiseTransmutableInto for Foo {
type Archetype = TransmutableIntoArchetype;
}
};
Since deriving both of these traits together is, by far, the most common use-case, we propose #[derive(PromiseTransmutable)]
as an ergonomic shortcut.
We also can specify custom Archetype
s to finely constrain the set of transmutations we are willing to make stability promises for. Consider, for instance, if we want to leave ourselves the future leeway to change the alignment of a type Foo
without making a SemVer major change:
#[repr(C)]
pub struct Foo(pub Bar, pub Baz);
The alignment of Foo
affects transmutability of &Foo
. A &Foo
cannot be safely transmuted from a &Bar
if the alignment requirements of Foo
exceed those of Bar
. If we don't want to promise that &Foo
is stably transmutable from virtually any Bar
, we simply make Foo
's PromiseTransmutableFrom::Archetype
a type with maximally strict alignment requirements:
const _: () = {
use core::transmute::stability::PromiseTransmutableFrom;
#[repr(C, align(536870912))]
pub struct TransmutableFromArchetype(
pub <Bar as PromiseTransmutableFrom>::Archetype,
pub <Baz as PromiseTransmutableFrom>::Archetype,
);
impl PromiseTransmutableFrom for TransmutableFromArchetype {
type Archetype = Self;
}
impl PromiseTransmutableFrom for Foo {
type Archetype = TransmutableFromArchetype;
}
};
Conversely, a &Foo
cannot be safely transmuted into a &Bar
if the alignment requirements of Bar
exceed those of Foo
. We reduce this set of stable transmutations by making PromiseTransmutableFrom::Archetype
a type with minimal alignment requirements:
const _: () = {
use core::transmute::stability::PromiseTransmutableInto;
#[repr(C, packed(1))]
pub struct TransmutableIntoArchetype(
pub <Bar as TransmutableIntoArchetype>::Archetype,
pub <Baz as TransmutableIntoArchetype>::Archetype,
);
impl PromiseTransmutableInto for TransmutableIntoArchetype {
type Archetype = Self;
}
impl PromiseTransmutableInto for Foo {
type Archetype = TransmutableIntoArchetype;
}
};
Given these two stability promises, we are free to modify the alignment of Foo
in SemVer-minor changes without running any risk of breaking dependent crates.
Two traits provide mechanisms for transmutation between types:
// this trait is implemented automagically by the compiler
#[lang = "transmute_from"]
pub unsafe trait TransmuteFrom<Src: ?Sized, Neglect = ()>
where
Neglect: TransmuteOptions,
{
#[inline(always)]
fn transmute_from(src: Src) -> Self
where
Src: Sized,
Self: Sized,
Neglect: SafeTransmuteOptions,
{
unsafe { Self::unsafe_transmute_from(src) }
}
#[inline(always)]
unsafe fn unsafe_transmute_from(src: Src) -> Self
where
Src: Sized,
Self: Sized,
Neglect: TransmuteOptions,
{
use core::mem::ManuallyDrop;
#[repr(C)]
union Transmute<Src, Dst> {
src: ManuallyDrop<Src>,
dst: ManuallyDrop<Dst>,
}
unsafe {
ManuallyDrop::into_inner(Transmute { src: ManuallyDrop::new(src) }.dst)
}
}
}
// implemented in terms of `TransmuteFrom`
pub unsafe trait TransmuteInto<Dst: ?Sized, Neglect = ()>
where
Neglect: TransmuteOptions,
{
fn transmute_into(self) -> Dst
where
Self: Sized,
Dst: Sized,
Neglect: SafeTransmuteOptions;
unsafe fn unsafe_transmute_into(self) -> Dst
where
Self: Sized,
Dst: Sized,
Neglect: TransmuteOptions;
}
unsafe impl<Src, Dst, Neglect> TransmuteInto<Dst, Neglect> for Src
where
Src: ?Sized,
Dst: ?Sized + TransmuteFrom<Src, Neglect>,
Neglect: TransmuteOptions,
{
...
}
In the above definitions, Src
represents the source type of the transmutation, Dst
represents the destination type of the transmutation, and Neglect
is a parameter that encodes which static checks the compiler ought to neglect when considering if a transmutation is valid. The default value of Neglect
is ()
, which reflects that, by default, the compiler does not neglect any static checks.
The default value of the Neglect
parameter, ()
, statically forbids transmutes that are unsafe, unsound, or unstable. However, you may explicitly opt-out of some static checks:
Transmute Option | Compromises | Usable With |
---|---|---|
NeglectStabilty |
Stability | transmute_{from,into} , unsafe_transmute_{from,into} |
NeglectAlignment |
Safety | unsafe_transmute_{from,into} |
NeglectValidity |
Soundness | unsafe_transmute_{from,into} |
NeglectStabilty
implements the SafeTransmuteOptions
and TransmuteOptions
marker traits, as it can be used in both safe and unsafe code. The selection of multiple options is encoded by grouping them as a tuple; e.g., (NeglectAlignment, NeglectValidity)
is a selection of both the NeglectAlignment
and NeglectValidity
options.
We introduce two marker traits which serve to group together the options that may be used with safe transmutes, and those which may be used with unsafe
transmutes:
pub trait SafeTransmuteOptions: private::Sealed
{}
pub trait TransmuteOptions: SafeTransmuteOptions
{}
impl SafeTransmuteOptions for () {}
impl TransmuteOptions for () {}
By default, TransmuteFrom
and TransmuteInto
's methods require that the layouts of the source and destination types are SemVer-stable. The NeglectStability
option disables this requirement.
pub struct NeglectStability;
impl SafeTransmuteOptions for NeglectStability {}
impl TransmuteOptions for NeglectStability {}
Prior to the adoption of the stability declaration traits, crate authors documented the layout guarantees of their types with doc comments. The TransmuteFrom
and TransmuteInto
traits and methods may be used with these types by requesting that the stability check is neglected; for instance:
fn serialize<W: Write>(val : LibraryType, dst: W) -> std::io::Result<()>
where
LibraryType: TransmuteInto<[u8; size_of::<LibraryType>()], NeglectStability>
{
...
}
Neglecting stability over-eagerly cannot cause unsoundness or unsafety. For this reason, it is the only transmutation option available on the safe methods transmute_from
and transmute_into
. However, neglecting stability over-eagerly may cause your code to cease compiling if the authors of the source and destination types make changes that affect their layout.
By using the NeglectStability
option to transmute types you do not own, you are committing to ensure that your reliance on these types' layouts is consistent with their documented stability guarantees.
By default, TransmuteFrom
and TransmuteInto
's methods require that, when transmuting references, the minimum alignment of the destination's referent type is no greater than the minimum alignment of the source's referent type. The NeglectAlignment
option disables this requirement.
pub struct NeglectAlignment;
impl TransmuteOptions for NeglectAlignment {}
By using the NeglectAlignment
option, you are committing to ensure that the transmuted reference satisfies the alignment requirements of the destination's referent type. For instance:
/// Try to convert a `&T` into `&U`.
///
/// This produces `None` if the referent isn't appropriately
/// aligned, as required by the destination type.
pub fn try_cast_ref<'t, 'u, T, U>(src: &'t T) -> Option<&'u U>
where
&'t T: TransmuteInto<&'u U, NeglectAlignment>,
{
if (src as *const T as usize) % align_of::<U>() != 0 {
None
} else {
// Safe because we dynamically enforce the alignment
// requirement, whose static check we chose to neglect.
Some(unsafe { src.unsafe_transmute_into() })
}
}
By default, TransmuteFrom
and TransmuteInto
's methods require that all instantiations of the source type are guaranteed to be valid instantiations of the destination type. This precludes transmutations which might be valid depending on the source value:
#[derive(PromiseTransmutableFrom, PromiseTransmutableInto)]
#[repr(u8)]
enum Bool {
True = 1,
False = 0,
}
/* ⚠️ This example intentionally does not compile. */
let _ : Bool = some_u8_value.transmute_into(); // Compile Error!
The NeglectValidity
option disables this check.
pub struct NeglectValidity;
impl TransmuteOptions for NeglectValidity {}
By using the NeglectValidity
option, you are committing to ensure dynamically source value is a valid instance of the destination type. For instance:
#[derive(PromiseTransmutableFrom, PromiseTransmutableInto)]
#[repr(u8)]
enum Bool {
True = 1,
False = 0,
}
pub trait TryIntoBool
{
fn try_into_bool(self) -> Option<Bool>;
}
impl<T> TryIntoBool for T
where
T: TransmuteInto<u8>,
u8: TransmuteInto<Bool, NeglectValidity>
{
fn try_into_bool(self) -> Option<Bool> {
let val: u8 = self.transmute_into();
if val > 1 {
None
} else {
// Safe, because we've first verified that
// `val` is a bit-valid instance of a boolean.
Some(unsafe {val.unsafe_transmute_into()})
}
}
}
Even with NeglectValidity
, the compiler will statically reject transmutations that cannot possibly be valid:
#[derive(PromiseTransmutableInto)]
#[repr(C)] enum Foo { A = 24 }
#[derive(PromiseTransmutableFrom)]
#[repr(C)] enum Bar { Z = 42 }
let _ = <Bar as TransmuteFrom<Foo, NeglectValidity>::unsafe_transmute_from(Foo::N) // Compile error!
The only item defined by this RFC requiring special compiler support is TransmuteFrom
. To realize this RFC's proposal of safe transmutation between different types, this item will require compiler support. However, the most minimal acceptable implementation of TransmuteFrom
can be achieved entirely in-language:
/// A type is transmutable into itself.
unsafe impl<T, Neglect> TransmuteFrom<T, Neglect> for T
where
Neglect: TransmuteOptions
{}
/// A transmutation is *stable* if...
unsafe impl<Src, Dst> TransmuteFrom<Src> for Dst
where
Src: PromiseTransmutableInto,
Dst: PromiseTransmutableFrom,
<Dst as PromiseTransmutableFrom>::Archetype:
TransmuteFrom<
<Src as PromiseTransmutableInto>::Archetype,
NeglectStability
>
{}
This minimal implementation is sufficient for convincing the compiler to accept basic stability declarations, such as those of Rust's primitive types. It is insufficient for making the compiler accept transmutations between different types (and, consequently, complex stability declarations). Implementers should use this as a starting point.
This listing is both a minimal implementation of this RFC (excepting the automatic derives) and the canonical specification of this RFC's API surface (playground):
#![feature(untagged_unions,const_fn,const_fn_union)] // for the impl of transmute free functions
#![feature(const_generics)] // for stability declarations on `[T; N]`
#![feature(decl_macro)] // for stub implementations of derives
#![feature(never_type)] // for stability declarations on `!`
#![allow(warnings)]
/// Transmutation conversions.
// suggested location: `core::convert`
pub mod transmute {
use {options::*, stability::*};
/// Reinterprets the bits of a value of one type as another type, safely.
///
/// Use `()` for `Neglect` if you do not wish to neglect any static checks.
#[inline(always)]
pub const fn safe_transmute<Src, Dst, Neglect>(src: Src) -> Dst
where
Dst: TransmuteFrom<Src, Neglect>,
Neglect: SafeTransmuteOptions
{
unsafe { unsafe_transmute::<Src, Dst, Neglect>(src) }
}
/// Reinterprets the bits of a value of one type as another type, potentially unsafely.
///
/// The onus is on you to ensure that calling this method is safe.
#[inline(always)]
pub const unsafe fn unsafe_transmute<Src, Dst, Neglect>(src: Src) -> Dst
where
Dst: TransmuteFrom<Src, Neglect>,
Neglect: TransmuteOptions
{
use core::mem::ManuallyDrop;
#[repr(C)]
union Transmute<Src, Dst> {
src: ManuallyDrop<Src>,
dst: ManuallyDrop<Dst>,
}
unsafe {
ManuallyDrop::into_inner(Transmute { src: ManuallyDrop::new(src) }.dst)
}
}
/// `Self: TransmuteInto<Dst, Neglect`, if the compiler accepts the stability,
/// safety, and soundness of transmuting `Self` into `Dst`, notwithstanding
/// a given set of static checks to `Neglect`.
pub unsafe trait TransmuteInto<Dst: ?Sized, Neglect = ()>
where
Neglect: TransmuteOptions,
{
/// Reinterpret the bits of a value of one type as another type, safely.
fn transmute_into(self) -> Dst
where
Self: Sized,
Dst: Sized,
Neglect: SafeTransmuteOptions;
/// Reinterpret the bits of a value of one type as another type, potentially unsafely.
///
/// The onus is on you to ensure that calling this method is safe.
unsafe fn unsafe_transmute_into(self) -> Dst
where
Self: Sized,
Dst: Sized,
Neglect: TransmuteOptions;
}
unsafe impl<Src, Dst, Neglect> TransmuteInto<Dst, Neglect> for Src
where
Src: ?Sized,
Dst: ?Sized + TransmuteFrom<Src, Neglect>,
Neglect: TransmuteOptions,
{
#[inline(always)]
fn transmute_into(self) -> Dst
where
Self: Sized,
Dst: Sized,
Neglect: SafeTransmuteOptions,
{
Dst::transmute_from(self)
}
#[inline(always)]
unsafe fn unsafe_transmute_into(self) -> Dst
where
Self: Sized,
Dst: Sized,
Neglect: TransmuteOptions,
{
unsafe { Dst::unsafe_transmute_from(self) }
}
}
/// `Self: TransmuteInto<Src, Neglect`, if the compiler accepts the stability,
/// safety, and soundness of transmuting `Src` into `Self`, notwithstanding
/// a given set of static checks to `Neglect`.
pub unsafe trait TransmuteFrom<Src: ?Sized, Neglect = ()>
where
Neglect: TransmuteOptions,
{
/// Reinterpret the bits of a value of one type as another type, safely.
#[inline(always)]
fn transmute_from(src: Src) -> Self
where
Src: Sized,
Self: Sized,
Neglect: SafeTransmuteOptions,
{
unsafe { Self::unsafe_transmute_from(src) }
}
/// Reinterpret the bits of a value of one type as another type, potentially unsafely.
///
/// The onus is on you to ensure that calling this function is safe.
#[inline(always)]
unsafe fn unsafe_transmute_from(src: Src) -> Self
where
Src: Sized,
Self: Sized,
Neglect: TransmuteOptions,
{
unsafe_transmute::<Src, Self, Neglect>(src)
}
}
/// A type is always transmutable from itself.
// This impl will be replaced with a compiler-supported for arbitrary source and destination types.
unsafe impl<T> TransmuteFrom<T, NeglectStability> for T {}
/// A type is *stably* transmutable if...
unsafe impl<Src, Dst> TransmuteFrom<Src> for Dst
where
Src: PromiseTransmutableInto,
Dst: PromiseTransmutableFrom,
<Dst as PromiseTransmutableFrom>::Archetype:
TransmuteFrom<
<Src as PromiseTransmutableInto>::Archetype,
NeglectStability
>
{}
/// Traits for declaring the SemVer stability of types.
pub mod stability {
use super::{TransmuteFrom, TransmuteInto, options::NeglectStability};
/// Declare that transmuting `Self` into `Archetype` is SemVer-stable.
pub trait PromiseTransmutableInto
where
Self::Archetype: PromiseTransmutableInto
{
/// The `Archetype` must be safely transmutable from `Self`.
type Archetype: TransmuteFrom<Self, NeglectStability>;
}
/// Declare that transmuting `Self` from `Archetype` is SemVer-stable.
pub trait PromiseTransmutableFrom
where
Self::Archetype: PromiseTransmutableFrom
{
/// The `Archetype` must be safely transmutable into `Self`.
type Archetype: TransmuteInto<Self, NeglectStability>;
}
/// Derive macro generating an impl of the trait `PromiseTransmutableInto`.
//#[rustc_builtin_macro]
pub macro PromiseTransmutableInto($item:item) {
/* compiler built-in */
}
/// Derive macro generating an impl of the trait `PromiseTransmutableFrom`.
//#[rustc_builtin_macro]
pub macro PromiseTransmutableFrom($item:item) {
/* compiler built-in */
}
impl PromiseTransmutableInto for ! {type Archetype = Self;}
impl PromiseTransmutableFrom for ! {type Archetype = Self;}
impl PromiseTransmutableInto for () {type Archetype = Self;}
impl PromiseTransmutableFrom for () {type Archetype = Self;}
impl PromiseTransmutableInto for f32 {type Archetype = Self;}
impl PromiseTransmutableFrom for f32 {type Archetype = Self;}
impl PromiseTransmutableInto for f64 {type Archetype = Self;}
impl PromiseTransmutableFrom for f64 {type Archetype = Self;}
impl PromiseTransmutableInto for i8 {type Archetype = Self;}
impl PromiseTransmutableFrom for i8 {type Archetype = Self;}
impl PromiseTransmutableInto for i16 {type Archetype = Self;}
impl PromiseTransmutableFrom for i16 {type Archetype = Self;}
impl PromiseTransmutableInto for i32 {type Archetype = Self;}
impl PromiseTransmutableFrom for i32 {type Archetype = Self;}
impl PromiseTransmutableInto for i64 {type Archetype = Self;}
impl PromiseTransmutableFrom for i64 {type Archetype = Self;}
impl PromiseTransmutableInto for i128 {type Archetype = Self;}
impl PromiseTransmutableFrom for i128 {type Archetype = Self;}
impl PromiseTransmutableInto for isize {type Archetype = Self;}
impl PromiseTransmutableFrom for isize {type Archetype = Self;}
impl PromiseTransmutableInto for u8 {type Archetype = Self;}
impl PromiseTransmutableFrom for u8 {type Archetype = Self;}
impl PromiseTransmutableInto for u16 {type Archetype = Self;}
impl PromiseTransmutableFrom for u16 {type Archetype = Self;}
impl PromiseTransmutableInto for u32 {type Archetype = Self;}
impl PromiseTransmutableFrom for u32 {type Archetype = Self;}
impl PromiseTransmutableInto for u64 {type Archetype = Self;}
impl PromiseTransmutableFrom for u64 {type Archetype = Self;}
impl PromiseTransmutableInto for u128 {type Archetype = Self;}
impl PromiseTransmutableFrom for u128 {type Archetype = Self;}
impl PromiseTransmutableInto for usize {type Archetype = Self;}
impl PromiseTransmutableFrom for usize {type Archetype = Self;}
use core::marker::PhantomData;
impl<T: ?Sized> PromiseTransmutableInto for PhantomData<T> { type Archetype = Self; }
impl<T: ?Sized> PromiseTransmutableFrom for PhantomData<T> { type Archetype = Self; }
impl<T, const N: usize> PromiseTransmutableInto for [T; N]
where
T: PromiseTransmutableInto,
[T::Archetype; N]
: TransmuteFrom<Self, NeglectStability>
+ PromiseTransmutableInto,
{
type Archetype = [T::Archetype; N];
}
impl<T, const N: usize> PromiseTransmutableFrom for [T; N]
where
T: PromiseTransmutableFrom,
[T::Archetype; N]
: TransmuteInto<Self, NeglectStability>
+ PromiseTransmutableFrom,
{
type Archetype = [T::Archetype; N];
}
impl<T: ?Sized> PromiseTransmutableInto for *const T
where
T: PromiseTransmutableInto,
*const T::Archetype
: TransmuteFrom<Self, NeglectStability>
+ PromiseTransmutableInto,
{
type Archetype = *const T::Archetype;
}
impl<T: ?Sized> PromiseTransmutableFrom for *const T
where
T: PromiseTransmutableFrom,
*const T::Archetype
: TransmuteInto<Self, NeglectStability>
+ PromiseTransmutableFrom,
{
type Archetype = *const T::Archetype;
}
impl<T: ?Sized> PromiseTransmutableInto for *mut T
where
T: PromiseTransmutableInto,
*mut T::Archetype
: TransmuteFrom<Self, NeglectStability>
+ PromiseTransmutableInto,
{
type Archetype = *mut T::Archetype;
}
impl<T: ?Sized> PromiseTransmutableFrom for *mut T
where
T: PromiseTransmutableFrom,
*mut T::Archetype
: TransmuteInto<Self, NeglectStability>
+ PromiseTransmutableFrom,
{
type Archetype = *mut T::Archetype;
}
impl<'a, T: ?Sized> PromiseTransmutableInto for &'a T
where
T: PromiseTransmutableInto,
&'a T::Archetype
: TransmuteFrom<&'a T, NeglectStability>
+ PromiseTransmutableInto,
{
type Archetype = &'a T::Archetype;
}
impl<'a, T: ?Sized> PromiseTransmutableFrom for &'a T
where
T: PromiseTransmutableFrom,
&'a T::Archetype
: TransmuteInto<&'a T, NeglectStability>
+ PromiseTransmutableFrom,
{
type Archetype = &'a T::Archetype;
}
impl<'a, T: ?Sized> PromiseTransmutableInto for &'a mut T
where
T: PromiseTransmutableInto,
&'a mut T::Archetype
: TransmuteFrom<&'a mut T, NeglectStability>
+ PromiseTransmutableInto,
{
type Archetype = &'a mut T::Archetype;
}
impl<'a, T: ?Sized> PromiseTransmutableFrom for &'a mut T
where
T: PromiseTransmutableFrom,
&'a mut T::Archetype
: TransmuteInto<&'a mut T, NeglectStability>
+ PromiseTransmutableFrom,
{
type Archetype = &'a mut T::Archetype;
}
}
/// Static checks that may be neglected when determining if a type is `TransmuteFrom` some other type.
pub mod options {
/// Options that may be used with safe transmutations.
pub trait SafeTransmuteOptions: TransmuteOptions {}
/// Options that may be used with unsafe transmutations.
pub trait TransmuteOptions: private::Sealed {}
impl SafeTransmuteOptions for () {}
impl TransmuteOptions for () {}
/// Neglect the stability check of `TransmuteFrom`.
pub struct NeglectStability;
// Uncomment this if/when constructibility is fully implemented:
// impl SafeTransmuteOptions for NeglectStability {}
impl TransmuteOptions for NeglectStability {}
// prevent third-party implementations of `TransmuteOptions`
mod private {
use super::*;
pub trait Sealed {}
impl Sealed for () {}
impl Sealed for NeglectStability {}
}
}
}
To support transmutations between different types, implementers of this RFC should begin by defining a transmute_from
lang item to annotate libcore's definition of TransmuteFrom
. Whether TransmuteFrom
is implemented for a given type and parameters shall be determined within the implementation of the type system (à la Sized
and Freeze
).
This initial smart implementation may be made simpler by:
- not supporting enums
- not supporting unions
- not supporting
Neglect
parameters besides()
andNeglectStability
- simplifying constructability
The safety property of constructability defined in the guidance-level explanation of this RFC describes a platonic ideal of the property.
However, we recognize that this definition poses implementation challenges: In our definition of constructability, answering the question of whether a struct or enum variant is constructible depends on where that question is being asked. Consequently, answering whether a given type Src
is TransmutableInto
a given type Dst
will depend on where that question is posed.
We recommend adopting a simplified definition of constructability: a type is constructible if its fields are marked pub
, and those fields are constructible. With this definition, answering the question of whether a type is constructible does not depend on where the question is asked: we do not examine the visibility of the involved types.
Unfortunately, with no other actions taken, this simplified definition comes...
This definition is usually sufficient for ensuring safety: it is generally an error to expose a private type in a public type signature. However, these errors may be circumvented using the public-type-in-private-module trick:
pub mod crate_a {
#[repr(C)]
pub struct NonEmptySlice<'a, T>(pub private::NonEmptySliceInner<'a, T>);
impl<'a, T> NonEmptySlice<'a, T> {
pub fn from_array<const N: usize>(arr: &'a [T; N], len: usize) -> Self {
assert!(len <= N && len > 0);
Self(
private::NonEmptySliceInner {
data: arr as *const T,
len,
lifetime: core::marker::PhantomData,
}
)
}
pub fn first(&self) -> &'a T {
unsafe { &*self.0.data }
}
}
// introduce a private module to avoid `private_in_public` error (E0446):
pub(crate) mod private {
#[repr(C)]
pub struct NonEmptySliceInner<'a, T> {
pub data: *const T,
pub len: usize,
pub lifetime: core::marker::PhantomData<&'a ()>,
}
}
}
With this simplified definition of constructability, it is possible for a third-party to define a safe constructor of NonEmptySlice
that produces a value which is unsafe to use:
pub evil_constructor<T>(src: T) -> NonEmptySlice<'static, u8>
where
T: TransmuteInto<NonEmptySlice<'static, u8>, NeglectStability>,
{
src.transmute_into()
}
evil_constructor(0u128).first() // muahaha!
The above code is "safe" because our simplified definition of constructability fails to recognize this pattern of encapsulation, and because NeglectStability
is a SafeTransmutationOption
.
The intent of NeglectStability
is to permit the safe transmutation of types that predate the stabilization of the stability declaration traits. It also provides a convenient escape-hatch for type authors to neglect the stability of transmutations of their own types, without sacrificing safety. NeglectStability
is a SafeTransmutationOption
because, in principle, neglecting stability does not diminish safety. Our simplified definition of constructability violates this principle.
By temporarily sacrificing these goals, we may preserve safety solely...
We may preserve safety by demoting NeglectStability
to UnsafeTransmutationOption
-status.
In doing so, a third-party is forced to resort to an unsafe
transmutation to construct NonEmptySlice
; e.g.:
pub evil_constructor<T>(src: T) -> NonEmptySlice<'static, u8>
where
T: TransmuteInto<NonEmptySlice<'static, u8>, NeglectStability>,
{
// unsafe because we `NeglectStability`
unsafe { src.unsafe_transmute_into() }
}
Demoting NeglectStability
to unsafe-status does not stop type authors from opting-in to stable (and thus safe) transmutations; e.g., with derive(PromiseTransmutableFrom)
:
pub mod crate_a {
#[derive(PromiseTransmutableFrom)]
#[repr(C)]
pub struct NonEmptySlice<'a, T>(pub private::NonEmptySliceInner<'a, T>);
impl<'a, T> NonEmptySlice<'a, T> {
pub fn from_array<const N: usize>(arr: &'a [T; N], len: usize) -> Self {
assert!(len <= N && len > 0);
Self(
private::NonEmptySliceInner {
data: arr as *const T,
len,
lifetime: core::marker::PhantomData,
}
)
}
pub fn first(&self) -> &'a T {
unsafe { &*self.0.data }
}
}
// introduce a private module to avoid `private_in_public` error (E0446):
pub(crate) mod private {
#[derive(PromiseTransmutableFrom)]
#[repr(C)]
pub struct NonEmptySliceInner<'a, T> {
pub data: *const T,
pub len: usize,
pub lifetime: core::marker::PhantomData<&'a ()>,
}
}
}
In the above example, the type author declares NonEmptySlice
and NonEmptySliceInner
to be stably instantiatable via transmutation. Given this, a third-party no longer needs to resort to unsafe
code to violate the the invariants on inner
:
pub evil_constructor<T>(src: T) -> NonEmptySlice<'static, u8>
where
T: TransmuteInto<NonEmptySlice<'static, u8>>,
{
src.transmute_into()
}
evil_constructor(0u128).first() // muahaha!
This safety hazard is not materially different from the one that would be induced if the type author implemented DerefMut<Target=NonEmptySliceInner>
for NonEmptySlice
, or made the private
module pub
, or otherwise explicitly provided outsiders with unrestricted mutable access to data
.
We recommend that that implementers of this RFC initially simplify constructability by:
- adopting our simplified definition of constructability
- demoting
NeglectStability
to unsafe status (i.e., not implementingSafeTransmuteOptions
forNeglectStability
; onlyTransmuteOptions
) - advise users that implementing the stability declaration traits on types that are not fully-implicitly constructable will be a compiler-error will be a compiler error (i.e., these traits must not be implemented on types exploiting the pub-in-priv trick)
If and when the implementation of TransmuteFrom
encodes our complete definition of constructability, NeglectStability
shall become a safe transmute option.
The stability declaration traits communicate library layout stability, but not platform layout stability. A transmutation is platform-stable if it compiling one one platform implies it will compile on all other platforms. Unfortunately, platform-unstable types are common; e.g.:
- All primitive number types have platform-dependent endianness.
- All pointer-related primitive types (
usize
,isize
,*const T
,*mut T
,&T
,&mut T
) possess platform-dependent layouts; their sizes and alignments are well-defined, but vary between platforms. Concretely, whetherusize
isTransmuteInto<[u8; 4]>
orTransmuteInto<[u8; 8]>
will depend on the platform. - The very existence of some types depends on platform, too; e.g., the contents of
core::arch
,std::os
, andcore::sync::atomic
all depend on platform.
Our proposed stability system is oblivious to the inter-platform variations of these types. Expanding our stability system to be aware of inter-platform variations would introduce considerable additional complexity:
-
Cognitive Complexity: For types whose layout varies between platforms, the stability declaration traits could, perhaps, be adapted to encode platform-related guarantees. We anticipate this would contribute substantial cognitive complexity. Type authors, even those with no interest in cross-platform stability, would nonetheless need to reason about the layout properties of their types on platforms that might not yet exist.
-
Ergonomic Complexity: Platform instabilities are contagious: a type that contains a platform-unstable type is, itself, platform-unstable. Due to the sheer virulence of types with platform-dependent layouts, an explicit '
NeglectPlatformStability
' option would need to be used for many simple transmutations. The ergonomic cost of this would also be substantial. -
Implementation Complexity: The mechanisms proposed by this RFC are, fundamentally, applications of and additions to Rust's type system (i.e., they're traits). Mechanisms that impact platform stability, namely
#[cfg(...)]
annotations, long precede type-resolution and layout computation in the compilation process. For instance, it's possible to define types with impossible layouts:#[cfg(any())] struct Recursive(Recursive);
This program compiles successfully on all platforms because, from the perspective of later compilation stages,
Recursive
may as well not exist.
The issues of platform layout stability exposed by this RFC are not fundamentally different from the challenges of platform API stability. These challenges are already competently addressed by the mechanisms proposed in RFC1868. For this reason, and for the aforementioned concerns of additional complexity, we argue that communicating and enforcing platform layout stability must remain outside the scope of this RFC.
The model of stability proposed by this RFC frames stability as a quality of safe transmutations. A type author cannot specify stability archetypes for unsafe transmutations, and it is reasonable to want to do so.
To accommodate this, we may modify the definitions of PromiseTransmutableFrom
and PromiseTransmutableInto
to consume an optional Neglect
parameter, to allow for stability declarations for unsafe transmutations:
pub trait PromiseTransmutableFrom<Neglect = ()>
where
Neglect: TransmuteOptions
{
type Archetype
: TransmuteInto<Self, Sum<Neglect, NeglectStability>>
+ PromiseTransmutableInto<Sum<Neglect, NeglectStability>>;
}
pub trait PromiseTransmutableInto<Neglect = ()>
where
Neglect: TransmuteOptions
{
type Archetype
: TransmuteFrom<Self, Sum<Neglect, NeglectStability>>
+ PromiseTransmutableInto<Sum<Neglect, NeglectStability>>;
}
Implementations of these traits for a given Neglect
declares that a transmutation which is accepted while neglecting a particular set of checks (namely the set encoded by Neglect
) will continue to be possible.
We omit these definition from this RFC's recommendations because they are not completely satisfying. For instance, Neglect
is a logically unordered set of options, but is encoded as a tuple (which is ordered). To declare a transmutation that requires neglecting validity and alignment checks as stable, only one of these impls ought to be necessary:
impl PromiseTransmutableFrom<(NeglectAlignment, NeglectValidity)> for Foo
{
...
}
impl PromiseTransmutableFrom<(NeglectValidity, NeglectAlignment)> for Foo
{
...
}
Writing both impls (as we do above) is logically nonsense, but is nonetheless supported by Rust's coherence rules.
Some prior art, especially in the crate ecosystem, provides an API that only supports transmutations involving particular types (e.g., from/into bytes). As we discuss in the prior art section, we believe that the inflexibility of such approaches make them a poor basis of a language proposal. In particular, these restrictive approaches don't leave room to grow: supporting additional transmutations requires additional traits.
The API advocated by this proposal is unopinionated about what transmutations users might wish to do, and what transmutations the compiler is able to reason about. The implementation of this RFC may be initially very simple (and perhaps support no more than the restrictive approaches allow for), but then subsequently grow in sophistication—without necessitating public API changes.
If TransmuteInto
is implemented in terms of TransmuteFrom
, why provide it at all? We do so for consistency with libcore's From
/Into
traits, and because directionality conveys intent: TransmuteFrom
connotes conversion, whereas TransmuteInto
connotes initialization. We believe that the supporting code examples of this RFC demonstrate the explanatory benefits of providing both traits.
Although the focus of our API is statically-correct, infalible transmutations, the ability to opt-out of particular static checks is essential for building safer fallible mechanisms, such as alignment-fallible reference casting, or validity-fallible transmutations (e.g., bool
to u8
).
Although transmutations options exist at a type-level, they're represented as type-level tuples, whose familiar syntax is identical to value-level tuples. An empty tuple seems like the natural choice for encoding don't neglect anything.
We could not identify any advantages to representing options with const-generics. There is no clear syntactic advantage: tuples remain the most natural way to encode ad-hoc products of items. The comparative lack of default values for const-generic parameters poses an ergonomic disadvantage.
At least two requirements necessitate the presence of a stability system:
The usual rules of SemVer stability dictate that if a trait is is implemented in a version m.a.b
, it will continue to be implemented for all versions m.x.y
, where x ≥ a
and y ≥ b
. TransmuteFrom<Src, NeglectStability>
is the exception to this rule. It would be irresponsible to do nothing to mitigate this stability hazard.
The compromise made by this RFC is that TransmuteFrom
should be stable-by-default:
Dst: TransmuteFrom<Src>
follows the usual SemVer rules,Dst: TransmuteFrom<Src, NeglectStability>
does not.
The simplified formulation of constructability provides an initially-simpler implementation path at the cost of a soundness hole. There are three possible mitigations:
- Pretend it does not exist. Intentional soundness holes would not bode well for this RFC's acceptance.
- Only provide unsafe transmutation; not safe transmutation. This option fails to remove any
unsafe
blocks from end-users' code. - Allow safe transmutations only when the type authors have promised they will not create a situation that would compromise safety. We recommend this option.
This promise is inherently one of stability—the type author is vowing that they will not change the implementation of their type in a way that violates the no-pub-in-priv safety invariant of safe transmutation.
The proposed stability system is both simple, flexible, and extensible. Whereas ensuring the soundness and safety of TransmuteFrom<Src, NeglectStability>
requires non-trivial compiler support, stability does not—it is realized as merely two normal traits and an impl
.
This formulation is flexible: by writing custom Archetype
s, the stability declaration traits make possible granular and incomplete promises of layout stability (e.g., guaranteeing the size and validity qualities of a type, but not its alignment. Members of the safe-transmute working group have expressed an interest in granular stability declarations.
Finally, this formulation is extensible. The range of advance use-cases permitted by these traits is constrained only by the set of possible Archetype
s, which, in turn, is constrained by the completeness of TransmuteFrom
. As the implementation of TransmuteFrom
becomes more complete, so too will the range of advance use-cases accommodated by these traits.
In our proposal, #[repr(C)]
does not connote any promises of transmutation stability for SemVer purposes. It has been suggested that the presence of #[repr(C)]
already connotes total transmutation stability; i.e., that the type's author promises that the type's size and alignment and bit-validity will remain static. If this is true, then an additional stability mechanism is perhaps superfluous. However, we are unaware of any authoritative documentation indicating that #[repr(C)]
carries this implication. Treating #[repr(C)]
as an indicator of transmutation stability would thus pose a stability hazard.
This RFC builds on ample prior art in the crate ecosystem, but these efforts strain against the fundamental limitations of crates. Fundamentally, safe transmutation efforts use traits to expose layout information to the type system. The burden of ensuring safety is usually either placed entirely on the end-user, or assumed by complex, incomplete proc-macro derives
.
An exception to this rule is the typic crate, which utilizes complex, type-level programming to emulate a compiler-supported, "smart" TransmuteFrom
trait (like the one proposed in this RFC). Nonetheless, typic is fundamentally limited: since Rust does not provide a type-level mechanism for reflecting over the structure of arbitrary types, even typic cannot judge the safety of a transmutation without special user-added annotations on type definitions. Although typic succeeds as a proof-of-concept, its maintainability is questionable, and the error messages it produces are lovecraftian.
The development approaches like typic's could, perhaps, be eased by stabilizing frunk-like structural reflection, or (better yet) by stabilizing a compiler plugin API for registering "smart" traits like TransmuteFrom
. However, we suspect that such features would be drastically harder to design and stabilize.
Regardless of approach, almost all prior art attempts to reproduce knowledge already possessed by rustc
during the compilation process (i.e., the layout qualities of a concrete type). Emulating the process of layout computation to any degree is an error-prone duplication of effort between rustc
and the crate, in a domain where correctness is crucial.
Finally, community-led, crate-based approaches are, inescapably, unauthoritative. These approaches are incapable of fulfilling our motivating goal of providing a standard mechanism for programmers to statically ensure that a transmutation is safe, sound, or stable.
A handful of dimensions of variation characterize the distinctions between prior art in Rust:
- conversion complexity
- conversion fallibility
- source and destination types supported
- implementation mechanism
- stability hazards
We review each of these dimensions in turn, along with this proposal's location along these dimensions:
Prior work differs in whether it supports complex conversions, or only simple transmutation. Pre-RFC FromBits/IntoBits's proposed traits include conversion methods that are implemented by type authors. Because end-users provide their own definitions of these methods, they can be defined to do more than just transmutation (e.g., slice casting). (This approach is similar to the uncon crate's FromUnchecked
and IntoUnchecked
traits, which provide unsafe conversions between types. These traits are safe to implement, but their conversion methods are not.)
In contrast, our RFC focuses only on transmutation. Our TransmutableFrom
and TransmutableInto
traits serve as both a marker and a mechanism: if Dst: TransmuteFrom<Src>
, it is sound to transmute from Dst
into Src
using mem::transmute
. However, these traits also provide transmutation methods that are guaranteed to compile into nothing more complex than a memcpy
. These methods cannot be overridden by end-users to implement more complex behavior.
The signal and transmutability and mechanism are, in principle, separable. The convute crate's Transmute<T>
trait is an unsafe marker trait representing types that can be transmuted into T
. This is just a marker trait; the actual conversion mechanisms are provided by a separate suite of traits and functions. Our RFC combines marker with mechanism because we feel that separating these aspects introduces additional complexity with little added value.
Prior work differs in whether it supports only infallible conversions, or fallible conversions, too. The convute crate's TryTransmute<T>
trait provides a method, can_transmute
, that returns true a transmutation from Self
to T
is valid for a particular value of &self
. An early version of typic abstracted a similar mechanism into an Invariants
trait, with additional facilities for error reporting. Draft-RFC: Compatible
/TryCompatible
employs a similar mechanism to typic.
Typic removed support for fallible transmutation after reckoning with several challenges:
- The causes of uncertain failure could be language-imposed (e.g., alignment or validity requirements), or library imposed (i.e., invariants placed on a structure's private fields).
- The points of uncertain failures could be arbitrarily 'deep' into the fields of a type.
- Error reporting incurs a runtime cost commensurate with the detail of the reporting, but the detail of reporting required by end-user depends on use-case, not just type. For instance: for some use-cases it may be necessary to know where and why a byte was not a valid
bool
; in others it may be sufficient to know simply whether an error occurred.
Finally, we observed that the mechanisms of fallible transmutation were basically separable from the mechanisms of infallible transmutation, and thus these challenges could be addressed at a later date. For these reasons, our RFC only addresses infallible transmutation.
While this RFC does not provide a grand, all-encompassing mechanism for fallible transmutation, the fundamental mechanisms of our RFC are useful for constructing safer, purpose-built fallible conversion mechanisms; e.g.:
/// Try to convert a `&T` into `&U`.
///
/// This produces `None` if the referent isn't appropriately
/// aligned, as required by the destination type.
pub fn try_cast_ref<'t, 'u, T, U>(src: &'t T) -> Option<&'u U>
where
&'t T: TransmuteInto<&'u U, NeglectAlignment>,
{
if (src as *const T as usize) % align_of::<U>() != 0 {
None
} else {
// Safe because we dynamically enforce the alignment
// requirement, whose static check we chose to neglect.
Some(unsafe { src.unsafe_transmute_into() })
}
}
In this approach, our RFC is joined by crates such as plain, bytemuck, dataview, safe-transmute, zerocopy, and byterepr, and several pre-RFCs (such as this and this). The ubiquity of these mechanisms makes a strong case for their inclusion in libcore.
Prior work differs in whether its API surface is flexible enough to support transmutation between arbitrary types, or something less.
Approaches supporting transmutations between arbitrary types invariably define traits akin to:
/// Indicates that `Self` may be transmuted into `Dst`.
pub unsafe trait TransmuteInto<Dst>
{ ... }
/// Indicates that `Self` may be transmuted from `Dst`.
pub unsafe trait TransmuteFrom<Src>
{ ... }
This approach, taken by our RFC, is used by at least two crates:
- The convute crate's
Transmute<T>
trait is akin to the above definition ofTransmuteInto
. - The typic crate's
TransmuteInto
andTransmuteFrom
traits almost exactly mirror the above definitions.
...and several proposals:
- Pre-RFC: Safe coercions proposes a
Coercible<A, B>
trait that is implemented ifA
is safely transmutable intoB
. - Pre-RFC:
FromBits
/IntoBits
proposes the traitsIntoBits<U>
andFromBits<T>.
- Draft-RFC:
FromBytes
proposes the traitsIntoBytes<U>
andFromBytes<T>.
- Draft-RFC:
Compatible
/TryCompatible
proposes the traitCompatible<U>
, akin to the above definition ofTransmuteInto
.
Other approaches adopt an API that only supports transmutation of a type into initialized bytes, and from initialized bytes. These approaches invariably define traits akin to:
/// Indicates that a type may be transmuted into an appropriately-sized array of bytes.
pub unsafe trait IntoBytes
{}
/// Indicates that a type may be transmuted from an appropriately-sized array of bytes.
pub unsafe trait FromBytes
{}
This is the approach taken by the zerocopy crate, and the Pre-RFC: Safe Transmute and Pre-RFC: Safe Transmute v2 proposals.
This approach is strictly less flexible than an API supporting transmutation between arbitrary types. It is incapable of representing transmutations of bytes into types with validity constraints, and incapable of representing transmutations of types with padding bytes into bytes.
Supporting additional transmutation source and destination types requires a commensurate addition of conversion traits. For instance, some of zerocopy's users require a trait that reflects types which can be transmuted from a buffer of zeroed bytes. This would require introducing an additional trait, FromZeros
.
An advantage of this API is that it gives descriptive names to perhaps the two most common transmutations. However, an API providing transmutation between arbitrary types can encode FromBytes
and IntoBytes
:
// `Dst` is `FromBytes` if it can be safely transmuted *from* an
// equivalently sized array of `u8`.
unsafe impl<Dst> FromBytes for Dst
where
Dst: TransmuteFrom<[u8; size_of::<Dst>()]>,
{}
// `Src` is `IntoBytes` if it can be safely transmuted *into* an
// equivalently sized array of `u8`.
unsafe impl<Src> IntoBytes for Src
where
Src: TransmuteInto<[u8; size_of::<Src>()]>,
{}
For these reasons, we argue that a FromBytes
/ToBytes
style API is a poor foundation for in-language safe transmutation.
Finally, many approaches (especially crates) supply a marker trait that represents "plain old data"; e.g.:
/// Implemented by types that are "plain old data":
pub unsafe trait PlainOldData
{}
This sort of trait is present in crates such as plain, bytemuck, dataview, safe-transmute, and pod, and at least two language proposals (here and here).
The exact definition of what constitutes "plain old data" varies between crates. One simple definition is that a type T
is "plain old data" if it can be transmuted both from and into initialized bytes; i.e.:
unsafe impl<T> PlainOldData for T
where
T: FromBytes + IntoBytes,
{}
This definition precludes useful transmutations. For instance, MaybeUninit<u8>
is transmutable from a u8
, but not into a u8
.
Given this inflexibility, we argue that this approach is a poor foundation for in-language safe transmutation.
Not only does prior work differ in which traits are used to encode valid transmutations, they differ in the level of user intervention required to take advantage of the traits.
Fully manual approaches require type authors to implement the transmutation traits manually. The involved traits are unsafe
, so it is up to type authors to verify for themselves that their hand-written implementations are sound. This is the approach taken by crates such as plain, bytemuck, safe-transmute, and pod, and at least one language proposal: Pre-RFC: Safe Transmute (which advocates for a "plain old data" API).
In semi-manual approaches, type authors simply derive
the applicable traits, using derive
macros that produce a compile-error if the implementation is not sound. This approach is realized by crates such as (zerocopy, zeroable and dataview) and advocated by at least two language proposals: Pre-RFC: Safe Transmute v2 (which advocates for a FromBytes
/IntoBytes
-style API), and Pre-RFC FromBits/IntoBits (which advocates for a general-transmutation API).
We believe that the implementation burden these approaches place on end-users, and their inflexibility, make them a poor foundation for in-language safe transmutation:
- These approaches require authors to implement and, potentially, verify a large number of
unsafe
traits, ranging from O(n) implementations for plain-old-data trait approaches, to potentially many more. - These approaches are generally impractical for APIs that permit truly general transmutation, as type authors can only construct implementations of the transmutation traits for types they have at their disposal.
- These approaches conflate transmutation stability with transmutation safety. An end-user wishing to transmute a type for which its author has not manually implemented the applicable traits must resort to the wildly unsafe
mem::transmute
.
Automatic approaches implement the transmutation traits without user intervention, whenever it is sound to do so. This is the approach taken by our RFC. Automatic mechanisms appear in at least four prior language proposals:
- Pre-RFC: Safe coercions
- Draft-RFC:
from_bytes
- Pre-RFC: Trait for deserializing untrusted input
- Draft-RFC:
compatible_trait
The typic crate mocks a fully-automatic approach: its TransmuteFrom
trait is usable with any types that are repr(C)
, or otherwise have a well-defined memory layout. (In practice, since Rust lacks reflection over type definitions, repr(C)
annotations much be changed to typic::repr(C)
.)
Fully automatic approaches introduce, at the very least, a stability hazard: they supply a safe constructor for types, without the consent of those types' authors. If a type author hid the internals of their type because they do not wish for its implementation details to become a part of the type's API for SemVer purposes, an automatic transmutation mechanism subverts that intent.
No attempt to avoid this hazard is made by most of the proposals featuring automatic mechanisms; e.g.:
The automatic mechanism proposed by Pre-RFC: Safe coercions exploits field visibility, requiring that all fields that have different types in Src
and Dst
are visible at the location where the coercion is made. This approach falls short in three respects:
- Confining the visibility requirement only to fields of different types is insufficient; two different types with identical field types may subject those fields to different invariants.
- The 'location' where the coercion is made is ill-defined; the presence of the proposed
Coercible
trait may be far-removed from the location of the actual conversion (if any conversion occurs at all). - Field visibility stabilizes the structure of a type, but not its layout (e.e., its size).
Our RFC exploits the related concept of constructability, which is a property of a struct, or enum variant (rather than solely a property of fields). However, we recognize that it may be difficult to test for constructability within the trait resolution process.
The simplified definition of constructability we propose is the same employed by typic (which uses the term "visibility"). Typic regards the pub-in-priv soundness hole of the simplified definition to be sufficiently niche that NeglectStability
remains "safe". However, unlike typic, we believe that this simplified definition imposes a safety hazard substantial enough to warrant making NeglectStability
initially usable with only unsafe transmutes.
Our RFC separates constructability, which concerns what aspects of a type's structure are part of its public API, and stability, which concerns the aspects of a type's layout that are part of its public API for SemVer purposes. This distinction does not appear in prior work.
Haskell's Coercible
typeclass is implemented for all types A
and B
when the compiler can infer that they have the same representation. As with our proposal's TransmuteFrom
trait, instances of this typeclass are created "on-the-fly" by the compiler. Coercible
primarily provides a safe means to convert to-and-from newtypes, and does not seek to answer, for instance, if two u8
s are interchangeable with a u16
.
Haskell takes an algebraic approach to this problem, reasoning at the level of type definitions, not type layouts. However, not all type parameters have an impact on1 a type's layout; for instance:
#[repr(C)]
struct Bar<U>(PhantomData<U>);
#[repr(transparent)]
struct Foo<T, U>(T, Bar<U>);
Foo
's layout is impacted solely by T
, not U
, but this isn't necessarily clear by looking at the definition of Foo
. To reason about these scenarios, Haskell introduces the concept of type parameter roles—labels that denote the relationship of a type parameter to coercibility.
Our RFC does not need the concept of roles, because it does not attempt to abstractly reason about type definitions. Rather, it reasons about type layouts. This example, for instance, does not pose a challenge to our proposal:
trait SomeTrait { type AssociatedType; }
#[repr(C)]
struct MyStruct<T: SomeTrait>(pub T, pub T::AssociatedType);
For a particular T
, MyStruct<T>
will have a particular layout. Our proposed TransmuteFrom
trait reasons about the
layouts of types (which are fully concrete), not the definitions (which may be somewhat abstract).
The following unresolved questions should be resolved during the RFC process:
We endeavored to design an API surface with (nearly) zero compromises. However, if you have a use-case that you believe is neither satisfied outright by our proposal, nor aided by our proposal, we would urgently like to hear of it.
In Future Possibilities, we propose a number of additional abstractions that are aided by this RFC. Some of these abstractions are commonplace in prior art and should perhaps be included with this RFC. Some of our proposed extensions could begin their crates that work on stable Rust; others, such as generic atomics, require nightly-only intrinsics.
The following unresolved questions should be resolved before feature stabilization:
We observe that our proposed model for stability declaration, although very expressive, does not permit type authors to declare the stability of unsafe transmutations. Alongside that observation, we suggest a SemVer-compatible upgrade of the stability declaration traits that may resolve this shortcoming.
While it is unclear if there is any demand for this degree of flexibility, this upgrade-path should be carefully considered before stabilizing (and thus committing) to this RFC's layout stability declaration traits.
We consider the following unresolved questions to be out-of-scope of this RFC process:
TransmuteFrom
and TransmuteInto
require that the destination type has a matching constructor in which all fields are marked pub
. Conspicuously missing from this RFC is a NeglectConstructability
unsafe option to disable this check.
The omission is intentional. The consequences of such an option are suprising in both their subtlety and their unsafety. Some of unsafe Rust's hairiest interactions lie at the intersections of !Send
, !Sync
, UnsafeCell
and restricted field visibility. These building blocks are used to build safe, public abstractions that encapsulate unsafe, hidden internals.
See here.
See here.
See here.
See here.
See here.
See here.