Capabilities using Plain Traits

In a recent post, I touched upon the "contexts and capabilities" feature idea; after more thought and exciting discussions with Boxy, Jana and others, there's a bullet we're tempted to bite: what if we just used plain old trait bounds?

This has surely been proposed before but I didn't find much on a quick search, if anyone knows a good presentation of similar ideas DM me so I can give credit.

Note

Terminology note: I've been using "capabilities" to describe "something in the signature of a function that allows some operations within it". This is distinct from "contexts", which is the word I use for "implicit argument-passing". A capability is just a ZST context. Finally, both of these are special cases of "effects", which propagate similarly between functions but can also allow altering the control-flow.

Conditional Compilation using Trivial Bounds

A "trivial bound" is a trait bound that mentions no generic parameter; these are allowed under a nightly feature. For my first trick, I'll use them to replace #[cfg(..)] annotations.

Imagine the standard library contained this:

#![feature(trivial_bounds)]
pub struct Env; // A normal struct

pub trait Unix {}
pub trait Windows {}
pub trait Linux: Unix {}

#[cfg(unix)]
impl Unix for Env {}
#[cfg(linux)]
impl Linux for Env {}
// etc

// In std::os::unix::fs
pub fn chown(...)
where
    Env: Unix // a normal trait bound
{ ... }

Now say I call std::os::unix::fs::chown in my crate. If I only compile on unix systems, things will just work. If I compile for other targets too, I can write:

use core::capabilities::{Env, Unix};

fn unix_helper()
where Env: Unix
{
    // ...
    std::os::unix::fs::chown(...)?;
}

which just works: the trait solver is fine with assuming a trait bound that's blatantly not yet implemented. The benefit: all the code gets typechecked, even the code that cannot run on the current target.

Boolean Operations

With cfg, we can do arbitrary boolean operations. Things are a bit different with capabilities.

For &&, just use two bounds: where Env: Unix, Env: PointerWidth<u32>;

For ||, declare a marker trait:

#[marker] // this allows impls to overlap
trait LinuxOrWindows {}
impl<T: Linux> LinuxOrWindows for T {}
impl<T: Windows> LinuxOrWindows for T {}

For negation, we have to think differently. Trait bounds are a positive notion, and while negative trait bounds have been discussed since the dawn of traits, I won't hold my breath for them. That's generally not a problem though: being "not on unix" doesn't allow new behaviors, a function with #[cfg(not(something))] would generally also compile without the annotation¹.

Branching on Available Capabilities

So we come to branching: how do we replace something like the following?

#[cfg(windows)]
fn my_function() { ... }
#[cfg(not(windows))]
fn my_function() { ... }

The answer uses my favorite WIP trait feature: maybe bounds². In short: a T: maybe Trait bound is satisfied whether or not T implements Trait in the calling environment, and the callee can branch depending on whether it does³.

The function above becomes:

fn my_function()
where Env: maybe Windows
{
    if is_implemented!(Env: Windows) { // magic syntax :3
        // In this scope, the compiler knows that `Env: Windows`!
        windows_api()
    } else {
        fallback_api()
    }
}

Of course you don't want to have to write this bound everywhere. Here's how the standard library can declare this bound to be always available:

pub trait Environment: maybe Windows + maybe Unix + ... {}
impl Environment for Env {}

What this impl does is that knowledge of Env: Environment (which is justified by the impl) is enough to be allowed to ask whether Env: Windows. So now any function can ask whether is_implemented!(Env: Windows): our function doesn't even need the Env: maybe Windows bound.

Target Features

Many CPU architectures include optional instructions that may not be implemented by all CPUs, e.g. SIMD stuff. To make use of these special instructions, Rust has a mechanism called "target features" (see also target_features 1.1⁴) to track at runtime and compile time whether a given set of instructions can be used. This fits quite well in our traits model.

The main part of this feature is that a function can opt-in to being compiled with a specific extended instruction set:

#[target_feature(enable = "avx")]
fn times_two_avx(v: &mut [f64]) {
    for v in v {
        *v *= 2.0;
    }
}

This function can be called safely from a context where the compiler knows the feature is enabled, or unsafely from a context where it doesn't.

I propose, as you might expect, to write that instead as follows:

fn times_two_avx(v: &mut [f64])
where
    Env: Avx
{
    for v in v {
        *v *= 2.0;
    }
}

How can that change how the function is compiled? impl Mul for f64 and similar impls would be changed to look like:

impl Mul for f64
where
    Env: maybe Avx + maybe Avx512f + maybe Aes + ...
{
    type Output = f64;
    fn mul(self, rhs: f64) -> Self::Output {
        if is_implemented!(Env: Avx) {
            ...
        } else {
            ...
        }
    }
}

By the very way maybe bounds work, making use of this method in a function with a where Env: Avx bound will cause the impl to know Avx is available, which allows it to call a different instruction/intrinsic. The codegen backend can then vectorize that using the architecture-specific instructions.

In other words, the trait bound is not magic, only the selected instruction/intrinsic carries the knowledge of the available target feature(s)⁵. This replaces the built-in feature tracking done by target_feature 1.1 with plain trait bounds.

Runtime-Dependent Trait Bounds?

Now, the main point of this feature is that we don't know at compile-time which CPU the binary will be running on. So unlike for cfg above, there won't ever be an impl Avx for Env⁶.

So then, starting from a fn main() { ... } that doesn't have any maybe bounds, how do we even get to call a function with Env: Avx? Well, with a magic macro again:

if is_x86_feature_detected!("avx") {
    // In this scope, `Env: Avx` holds
}

This would use the same kind of magic that maybe bounds use for control-flow dependent trait bounds. Maybe this expands to something like if builtin_is_x86_feature_detected!("avx") && unsafe { assert_implemented_unchecked!(Env: Avx) }. Unlike is_implemented!(..) which is purely compile-time information, which branch is taken here is unknown until runtime.

Target Feature Multiversioning

A notable limitation with the feature as it exists is that one must write one function for each desired target feature, in order for each to be compiled with the desired instruction set.

A common trick relies on inlining to be able to define the core computation once and make feature-specific wrappers around it:

#[inline(always)]
fn times_two_generic(v: &mut [f64]) {
    for v in v {
        *v *= 2.0;
    }
}

#[target_feature(enable = "avx")]
fn times_two_avx(v: &mut [f64]) {
    times_two_generic(v);
}

#[target_feature(enable = "avx512f")]
fn times_two_avx512f(v: &mut [f64]) {
    times_two_generic(v);
}

pub fn times_two(v: &mut[f64]) {
    if is_x86_feature_detected!("avx512f") {
        unsafe { times_two_avx512f(v); }
    } else if is_x86_feature_detected!("avx") {
        unsafe { times_two_avx(v); }
    } else {
        times_two_generic(v);
    }
}

This works because inlining a function into a scope with more features allows it to make use of the extra features, and the inline(always) forces the code of times_two_generic to be codegenned twice, once inside each wrapper.

This isn't great ergonomics, so there's a proposal called struct_target_features that makes clever use of generics to get a single function to compile under many targets. It turns out our approach offers the same benefits.

This is how we'd write that example⁷:

// Defined once in `core`.
pub trait TargetFeatures = maybe Avx + maybe Avx512f + maybe Aes + ...;

fn times_two_generic(v: &mut [f64])
where
    Env: TargetFeatures
{
    for v in v {
        *v *= 2.0;
    }
}

pub fn times_two(v: &mut[f64]) {
    if is_x86_feature_detected!("avx512f") {
        times_two_generic(v);
    } else if is_x86_feature_detected!("avx") {
        times_two_generic(v);
    } else {
        times_two_generic(v);
    }
}

Here's how it works: the TargetFeatures trait bound is an alias for a whole bunch of maybe bounds. Each of them acts like a const FOO_IS_IMPLEMENTED: bool generic argument, which is true if the corresponding bound was available when the function got called.

This means that the function will get compiled as many times as there are feature combinations under which it is called. In our case, that's three different versions.

Compared to struct_target_features, we're actually doing almost the same thing: reusing generics for feature tracking and multiple compilation of a same function⁸. We however have the benefit of using a feature already well-integrated into the language. Here's one thing we can do that I believe struct_target_features cannot: safely coerce to function pointers⁹.

pub fn time_two_fn_ptr() -> fn(&mut [f64]) {
    if is_x86_feature_detected!("avx512f") {
        times_two_generic
    } else if is_x86_feature_detected!("avx") {
        times_two_generic
    } else {
        times_two_generic
    }
}

Tracking Whether a Function Can Unwind

Let us now imagine that the start_panic intrinsic takes where Env: Unwind (when built with panic=unwind). To avoid breaking the world, we'll also say that every standalone function as well as every trait and trait impl gets an implicit where Env: Unwind bound.

We can now write:

fn doesnt_panic() -> u32
where
    Env: ?Unwind
{
    42
}

Because we cannot prove Env: Unwind inside this function, calling this function cannot unwind. A caller of this function would be able to skip codegenning an unwind path. An unsafe caller of this function would be able to rely on not-unwinding to make their unsafe code easier to write.

We could imagine having special syntax like nounwind fn for that, if we wanted. However note that the story gets subtler in the presence of generics, so nounwind could be a misleading name, see next section.

Functions Are Capability-Generic By Default

The natural question with any sort of "effect"-like system such as this is: how does this compose? Our answer is: capabilities flow through trait bounds.

Take this standard library function:

impl<T> Option<T> {
    pub fn map<F, U>(self, f: F) -> Option<U>
    where
        Env: ?Unwind,
        F: FnOnce(T) -> U
    { ... }
}

The Env: ?Unwind bound prevents the body of map from proving Env: Unwind, so it cannot call panic or unwrap or any function that requires that bound. But watch what happens when we pass it a closure:

let x = None;
let y = x.map(|_| panic!());

Here the closure type implements FnOnce, and the impl would look like

impl FnOnce for closure
where Env: Unwind
{ ... }

Nothing in the signature of map prevents this. What happens instead is that, in order to know whether a given function call can unwind, we need to check all of its trait clauses. If proving any of the clauses used a Env: Unwind bound somewhere, then the call may unwind¹⁰.

That's why I find the nounwind fn syntax misleading: this seems to say that the function can never unwind, whereas Env: ?Unwind only means "this function cannot unwind by itself"¹¹.

We can however know that opt.map(|x| (x, 0)) won't unwind.

Note

Or, well, I hope we can, but I still haven't figured out how the compiler would decide what Env bounds to add to the automatic impl FnOnce for a closure. If it emits a naive impl FnOnce, the implicit Env: Unwind bound means all closures are assumed to panic. Maybe these traits get a special annotation so that they're propagated onto impl FnOnce impls? At worst, we'll need a special syntax like opt.map(with(Env: ?Unwind) |x| (x, 0)) (see the last section for a slightly less ugly syntax option).

Negative Capability

With the above, we can sometimes tell that a given function call cannot unwind. What would be also very useful would be to say "give me a closure that cannot unwind". For this we need a new idea:

fn foo<F>(f: F)
where
    F: (FnOnce() -> u32) without (Env: Unwind)
{ ... }

I'm stretching the imagination of trait-related features here, but this is rather simple to explain: this function accepts an F if proving its FnOnce bound did not make use of any Env: Unwind bound.

In the world where we construct explicit proofs for all trait facts, this says: "give me a proof of F: FnOnce() -> u32 that doesn't use any proof of Env: Unwind". Such a closure necessarily cannot panic.

Note

I've been using closures as examples but this works with any trait, e.g. T: Clone without (Env: Unwind).

The Signature of `catch_unwind`

There remains one mystery to complete this picture: how to write catch_unwind. For that I'll pull a final trait solver feature out of my hat: implication bounds¹².

pub fn catch_unwind<F, R>(f: F) -> Result<R, Box<dyn Any>>
where
    Env: ?Unwind,
    (Env: Unwind) => (F: FnOnce() -> R)
{ ... }

When calling this function with a particular F, the trait solver tries as usual to prove that F: FnOnce() -> R. The difference is that on top of the things it is normally allowed to assume in the caller context, it may also assume Env: Unwind.

When determining whether a given call to catch_unwind may itself unwind, the trait solver will then always find that none of the trait proofs use a Env: Unwind coming from "outside", which is our criterion for knowing that the call cannot unwind.

As for the body of catch_unwind, it might look something like:

unsafe {
    assert_implemented_unchecked!(Env: Unwind);
    // Here `F: FnOnce() -> R` holds for real.
    actually_catch_unwind(f) // does the real work
}

`const` Traits In This Model

const, as in const fn, is much like nounwind: it's a "negative capability", or rather the removal of a capability that's there by default. We can reuse all the ideas we've seen to express this in our framework: we'll call the capability "Runtime", and add it by default on toplevel functions, traits, and trait impls.

/// `Env: Runtime` gives access to interaction with the os, filesystem, etc.
/// Everything that `const fn`s cannot do.
pub trait Runtime {}

// Has implicit `Env: Runtime`
fn foo() { ... }

// Has no implicit `Env: Runtime`
const fn foo() { ... }
// is that same as:
fn foo() where Env: ?Runtime { ... }

// Has an implicit `Env: Runtime`
trait Clone: Sized { ... }

// Has no implicit `Env: Runtime`
const trait Clone: Sized { ... }
// is the same as:
trait Clone: Sized
where
    Env: ?Runtime
{ ... }

// Has an implicit `Env: Runtime`
impl Clone for Foo { ... }

// Has no implicit `Env: Runtime`
const impl Clone for Foo { ... }
// is the same as:
impl Clone for Foo
where
    Env: ?Runtime
{ ... }

This is a rather straightforward translation; the two views are rather concordant.

I'm a bit out of touch with the latest state of the const trait syntax discussions, but if I recall one of the syntax options was:

const fn foo<T>()
where
    T: Clone,
{ ... }
// is the same as:
fn foo<T>()
where
    Env: ?Runtime, // the function doesn't do any runtime ops...
    Env: Runtime => T: Clone, // ...even if the `T: Clone` does


const fn foo<T>()
where
    T: [const] Clone,
{ ... }
// is the same as:
fn foo<T>()
where
    Env: ?Runtime, // the function doesn't do runtime ops itself...
    T: Clone, // ...but could propagate those of the `Clone` impl


const fn foo<T>()
where
    T: const Clone,
{ ... }
// is the same as:
fn foo<T>()
where
    Env: ?Runtime, // the function doesn't do runtime ops itself...
    T: Clone without (Env: Runtime), // ...and the `Clone` impl cannot either

This can express everything that the const trait proposal includes, and even more. This may be a drawback, e.g. it's possible to express "a method that can do runtime stuff" inside a const trait, and even maybe Runtime bounds i.e. functions that behave differently depending on whether they're in a runtime context or not¹³.

Bonus Fun Ideas

Selfless traits

This dummy Env type is rather useless. We could imagine traits that don't have a Self type:

#[selfless]
pub trait Unix {}

#[cfg(unix)]
impl Unix {}

Here's the various syntaxes we introduced, if the traits become selfless:

pub fn chown(...)
where Unix
{ ... }

fn my_function()
where maybe Windows
{
    if is_implemented!(Windows) {
        ...
    }
}

fn doesnt_panic() -> u32
where ?Unwind
{
    42
}

fn foo<F>(f: F)
where
    F: (FnOnce() -> u32) without Unwind
{ ... }

pub fn catch_unwind<F, R>(f: F) -> Result<R, Box<dyn Any>>
where
    ?Unwind,
    Unwind => (F: FnOnce() -> R)
{ ... }

opt.map(with(?Unwind) |x| (x, 0))

I think that's cute.

Precise Capabilities in the Standard Library

These capabilities are just traits, we can define however many we want! Here's what we could have in the standard library:

/// Allows unwinding.
trait Unwind {}

/// Allows loops that many not terminate, i.e. over iterators that don't implement
/// some `unsafe trait FiniteIterator`
trait NonTermination {}

/// Allows access to non-deterministic APIs.
trait NonDet {}

/// Allows reading/writing to file/network/etc.
trait Io: NonDet {}

/// Allows using the global allocator.
trait Alloc {}

/// Allows all interactions with the runtime (e.g. cpu info, filesystem access).
trait Runtime: NonTermination + Io + Alloc {}

/// Allows interacting with the compile-time (e.g. reflection, type info).
trait CompileTime {}

/// All of the platform `cfg`s we talked about.
trait Unix: Io {}

Uh-oh we got a bit of an effect system, haven't we :3

Moar Effects

All of these cool trait features work equally well if traits can carry values. That gives us what I've been calling "contexts", and Jana has a cool idea for how this can be made a lot more non-breaking than I wrote in my blog post.

The final frontier then, is control-flow-affecting effects. I and others smell that the same trait-bound-based story could get us there. Stay tuned for more blog posts.

Conclusion

Here's what we've just been through:

I introduced three lightly crazy trait features:
- maybe bounds T: maybe Unwind;
- implication bounds (Env: Unwind) => (T: Clone);
- "bound filters" (T: Clone) without (Env: Unwind).
Using that and cleverly-chosen trait bounds, we subsumed a whole bunch of real or wanted language features: type-checked #[cfg], target-feature-generic functions, nounwind fn, and const traits.

I hope this little exploration has gotten you excited for this new way of using trait bounds! There's more to come, in future blog posts by me or others. By and large the enabling feature was maybe bounds; I hope we get them in the language!

This was a dense blog post, and I didn't spend a lot of time explaining the advanced features I use. Sorry :D Ask below if you'd like some specific clarification! You can also DM me on the rust-lang Zulip.

This isn't fully true: some std APIs use negation, e.g. std::os::unix::fs::chroot has #[cfg(all(unix, not(target_os = "fuschia")))]. The solution for these it to rephrase it as a positive capability: trait HasChroot: Unix or at worst trait UnixNotFuschia: Unix. ↩
Recently described here by @lcnr, but proposed before, e.g. here. ↩
This is a lightweight form of specialization. ↩
Thanks Luca Versari for telling me about it! I initially wrote this blog post without knowing the 1.1 version existed. ↩
Well, this is a cute model but it probably breaks down in a bunch of ways. Worst case we can make the trait bounds magic and have the same meaning as the attributes do today. Also I haven't thought about ABI-altering target features; could that be a maybe bound on the type decl, that would prevent equating f64 with f64 where Env: SomeFeature? I don't know. ↩
Actually #[cfg(target_feature = ...)] does exist, for when we force compilation for a specific instruction set, so we would materialize an impl then. But in the general case there may not be one. ↩
One notable drawback of this setup is that it's possible to accidentally call times_two_generic instead of times_two, thus losing the benefits of hardware acceleration. Beyond just making it non-pub, there are also ways we could also change TargetFeatures to prevent that. ↩
In some sense all this does is pass the struct_target_features magic struct value implicitly. See also this blog post for more on the idea of trait bounds carrying implicit values. ↩
To be precise about what's happening here: in order to coerce a generic function item to a function pointer, its generics must be instantiated and trait clauses proven. So things work like before: this will create three versions of the function, two of them hardware accelerated. ↩
This works very nicely if we construct explicit proofs for all trait facts: then you only have to walk the proof, looking for a proof of Env: Unwind being used. ↩
Even that is imprecise: if we define trait MyUnwind: Unwind {}, a function with where Env: MyUnwind + ?Unwind can perfectly well call panic!() directly. ↩
The full feature is probably a huge thing, but what I need for this only has marker traits with no generics to the left of the arrow, which I hope is simple enough to actually get it. ↩
To be very precise, maybe bounds cannot learn more than what the caller knows. It thus would be possible to call the const-version of that function at runtime, by calling it from an intermediate const fn. In that way maybe Runtime is not equivalent to const_eval_select. ↩

Conditional Compilation using Trivial Bounds

Boolean Operations

Branching on Available Capabilities

Target Features

Runtime-Dependent Trait Bounds?

Target Feature Multiversioning

Tracking Whether a Function Can Unwind

Functions Are Capability-Generic By Default

Negative Capability

The Signature of catch_unwind

const Traits In This Model

Bonus Fun Ideas

Selfless traits

Precise Capabilities in the Standard Library

Moar Effects

Conclusion

The Signature of `catch_unwind`

`const` Traits In This Model