Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RFC: Pointer metadata & VTable #2580

Merged
merged 31 commits into from
Jan 29, 2021
Merged
Changes from 3 commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
0bb757e
RFC: DynTrait and VTable
SimonSapin Oct 25, 2018
d4acf9b
Rename to vtable RFC to ptr-meta
SimonSapin Oct 26, 2018
8de354c
RFC: Pointer metadata
SimonSapin Oct 26, 2018
24a2ff8
Pointer metadata: add bounds on the associated type
kennytm Oct 28, 2018
dd3de2f
Pointer metadata: typo fix
oli-obk Oct 28, 2018
637647f
Pointer metadata: typo fix
oli-obk Oct 28, 2018
99fbed9
Pointer metadata: typo fix
oli-obk Oct 28, 2018
74e185c
Pointer metadata: typo fix
oli-obk Oct 28, 2018
65418d0
Pointer metadata: grammar
oli-obk Oct 28, 2018
ef9ee32
Pointer metadata: grammar
oli-obk Oct 28, 2018
f6d95d7
Pointer metadata: grammar
oli-obk Oct 28, 2018
e1aa0df
Pointer metadata: fix unfinished sentences
SimonSapin Oct 28, 2018
18f1fef
Pointer metadata: remove paragraph made redundant
SimonSapin Oct 28, 2018
3dd1484
Pointer metadata: remove `VTable::drop_in_place`?
SimonSapin Oct 31, 2018
8ede231
Pointer metadata: more NonNull API
SimonSapin Oct 31, 2018
fb6ebad
Pointer metadata: generic code can assume `T: Pointee`.
SimonSapin Oct 31, 2018
bf80163
Pointer metadata: rewrite Guide-level explanation with more useful ex…
SimonSapin Oct 31, 2018
2edbe38
Pointer metadata: more unresolved questions
SimonSapin Oct 31, 2018
bb5e09a
Pointer metadata: typo
SimonSapin Nov 14, 2018
bc2a2e7
Pointer metadata: add an `Metadata: Unpin` bound
SimonSapin Nov 14, 2018
7be2349
Pointer metadata RFC: remove `VTable::drop_in_place`
SimonSapin Jun 21, 2019
538b9aa
Pointer metadata RFC: replace `&'static VTable` with `DynMetadata`
SimonSapin Jun 21, 2019
e47aa6e
Pointer metadata RFC: mention extern types in doc-comment.
SimonSapin Jun 22, 2019
29c7547
Pointer metadata RFC: use `NonNull` instead of `&'static` for vtable …
SimonSapin Sep 13, 2020
1e5622f
Pointer metadata RFC: add unresolved question for parameterizing `Dyn…
SimonSapin Sep 13, 2020
04ba25f
Pointer metadata RFC: the Drop impl example written two years ago app…
SimonSapin Sep 14, 2020
f94f627
Pointer metadata: remove 'static bound on the Metadata associated type
SimonSapin Dec 29, 2020
236e657
Pointer metadata: add unresolved question for `into_raw_parts`
SimonSapin Dec 29, 2020
7b0175e
Pointer metadata: parameterize `DynMetadata` over its `dyn Trait` type
SimonSapin Dec 29, 2020
3ff39f0
Typo fix
SimonSapin Jan 23, 2021
50b567b
RFC 2580: Pointer metadata
KodrAus Jan 29, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
320 changes: 320 additions & 0 deletions text/0000-ptr-meta.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,320 @@
- Feature Name: `ptr-meta`
- Start Date: 2018-10-26
- RFC PR:
- Rust Issue:

# Summary
[summary]: #summary

Add generic APIs that allow manipulating the metadata of fat pointers:

* Naming the metadata’s type (as an associated type)
* Extracting metadata from a pointer
* Reconstructing a pointer from a data pointer and metadata
* Representing vtables, the metadata for trait objects, as a type with some limited API

This RFC does *not* propose a mechanism for defining custom dynamically-sized types,
but tries to stay compatible with future proposals that do.


# Background
[background]: #background

Typical high-level code doesn’t need to worry about fat pointers,
a reference `&Foo` “just works” wether or not `Foo` is a DST.
But unsafe code such as a custom collection library may want to access a fat pointer’s
components separately.

In Rust 1.11 we *removed* a [`std::raw::Repr`] trait and a [`std::raw::Slice`] type
from the standard library.
`Slice` could be `transmute`d to a `&[U]` or `&mut [U]` reference to a slice
as it was guaranteed to have the same memory layout.
This was replaced with more specific and less wildly unsafe
`std::slice::from_raw_parts` and `std::slice::from_raw_parts_mut` functions,
together with `as_ptr` and `len` methods that extract each fat pointer component separatly.

For trait objects, where we still have an unstable `std::raw::TraitObjet` type
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
that can only be used with `transmute`:

```rust
#[repr(C)]
pub struct TraitObject {
pub data: *mut (),
pub vtable: *mut (),
}
```

[`std::raw::Repr`]: https://doc.rust-lang.org/1.10.0/std/raw/trait.Repr.html
[`std::raw::Slice`]: https://doc.rust-lang.org/1.10.0/std/raw/struct.Slice.html
[`std::raw::TraitObjet`]: https://doc.rust-lang.org/1.30.0/std/raw/struct.TraitObject.html


# Motivation
[motivation]: #motivation

We now have APIs in Stable Rust to let unsafe code freely and reliably manipulate slices,
accessing the separate components of a fat pointers and then re-assembling them.
However `std::raw::TraitObject` is still unstable,
but it’s probably not the style of API that we’ll want to stabilize
as at encourages dangerous `transmute` calls.
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
This is a “hole” in available APIs to manipulate existing Rust types.

For example [this library][lib] stores multiple trait objects of varying size
in contiguous memory together with their vtable pointers,
and during iteration recreates fat pointers from separate data and vtable pointers.

The new `Thin` trait alias also expanding to [extern types] some APIs
that were unnecessarily restricted to `Sized` types
because there was previously no way to express pointen-thinness in generic code.
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved

[lib]: https://play.rust-lang.org/?version=nightly&mode=debug&edition=2015&gist=bbeecccc025f5a7a0ad06086678e13f3


# Guide-level explanation
[guide-level-explanation]: #guide-level-explanation


For low-level manipulation of trait objects in unsafe code,
new APIs allow accessing the components of a fat pointer:

```rust
use std::ptr::{metadata, VTable};

fn callback_into_raw_parts(f: Box<Fn()>) -> (*const (), &'static VTable) {
let raw = Box::into_raw(f);
(raw as _, metadata(raw))
}
```

… and assembling it back again:

```rust
fn callback_from_raw_parts(data: *const (), vtable: &'static VTable) -> Box<Fn()> {
Box::from_raw(<*mut Fn()>::from_raw_parts(new_data, vtable))
}
```

`VTable` also provides enough information to manage memory allocation:

```rust
// Equivalent to letting `Box<Fn()>` go out of scope to have its destructor run,
// this only demonstrates some APIs.
fn drop_callback(f: Box<Fn()>) {
let raw = Box::into_raw(f);
let vtable = metadata(raw);
unsafe {
vtable.drop_in_place(raw as *mut ());
std::alloc::dealloc(raw as *mut u8, vtable.layout());
}
}
```

# Reference-level explanation
[reference-level-explanation]: #reference-level-explanation

(Parts of this assumes that [trait aliases] are implemented.
If they are not implemented by the time this RFC is,
the `Thin` alias can be replaced by its definition.)

The APIs whose full definition is found below
are added to `core::ptr` and re-exported in `std::ptr`:

* A `Pointee` trait,
implemented automatically for all types
(similar to how `Sized` and `Unsize` are implemented automatically).
* A `Thin` [trait alias].
If this RFC is implemented before type aliases are,
uses of `Thin` should be replaced with its definition.
* A `metadata` free function
* A `from_raw_parts` constructor for each of `*const T` and `*mut T`

The bounds on `null()` and `null_mut()` function in that same module
as well as the `NonNull::dangling` constructor
are changed from (implicit) `T: Sized` to `T: ?Sized + Thin`.
This enables using those functions with [extern types].

For the purpose of pointer casts being allowed by the `as` operator,
a pointer to `T` is considered to be thin if `T: Thin` instead of `T: Sized`.
This similarly includes extern types.

`std::raw::TraitObject` and `std::raw` are deprecated and eventually removed.

[trait alias]: https://github.com/rust-lang/rust/issues/41517
[extern types]: https://github.com/rust-lang/rust/issues/43467

```rust
/// This trait is automatically implement for every type.
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
///
/// Raw pointer types and referenece types in Rust can be thought of as made of two parts:
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
/// a data pointer that contains the memory address of the value, and some metadata.
///
/// For statically-sized types that implement the `Sized` traits,
/// pointers are said to be “thin”, metadata is zero-size, and metadata’s type is `()`.
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
///
/// Pointers to [dynamically-sized types][dst] are said to be “fat”
/// and have non-zero-size metadata:
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
///
/// * For structs whose last field is a DST, metadata is the metadata for the last field
/// * For the `str` type, metadata is the length in bytes as `usize`
/// * For slice types like `[T]`, metadata is the length in items as `usize`
/// * For trait objects like `dyn SomeTrait`, metadata is [`&'static VTable`][VTable]
///
/// In the future, the Rust language may gain new kinds of types
/// that have different pointer metadata.
///
/// Pointer metadata can be extracted from a pointer or reference with the [`metadata`] function.
/// The data pointer can be extracted by casting a (fat) pointer
/// to a (thin) pointer to a `Sized` type the `as` operator,
/// for example `(x: &dyn SomeTrait) as *const SomeTrait as *const ()`.
///
/// [dst]: https://doc.rust-lang.org/nomicon/exotic-sizes.html#dynamically-sized-types-dsts
#[lang = "pointee"]
pub trait Pointee {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so... I'm assuming the compiler implements

default impl<T: ?Sized> Pointee for T {
    type Metadata = &'static Vtable;
}
impl<T: Sized> Pointee for T {
    type Metadata = ();
}
impl Pointee for str {
    type Metadata = usize;
}
impl<T: Sized> Pointee for [T] {
    type Metadata = usize;
}

Which means theoretically we could make Vtable generic over T allowing the drop_in_place method to take a raw pointer with the correct pointee type?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These impls would be accurate in current Rust, but what I had in mind instead was that the compiler would automatically generate impls, similar to what it does for the std::marker::Unsize trait. As far as the standard library is concerned these impls would be "magic", not based on specialization.

Regardless, yes, making VTable generic with a type parameter for the trait object type is possible.

/// The type for metadata in pointers and references to `Self`.
type Metadata;
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
}

/// Pointers to types implementing this trait alias are
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing the end of a sentence?

pub trait Thin = Pointee<Metadata=()>;
Copy link

@rustonaut rustonaut Sep 10, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe it's better to actually introduce thin pointers instead of using this trait alias.

The problem is that this trait alias doesn't necessary play well with structs containing DSTs (which might again be "composed" DSTs). It should be possible to have a thin pointer to such a struct and get access to all of it's fields without needing any unsafe code. Except the DST field which would if referenced just return a thin reference to the DST. For many thinks it would make writing unsafe code around all kinds of DST types much easier if we actually had thin pointers.

For example we could have something like (scatch):

#[repr(transparent)]
#[lang = "thin_ptr"] //see below
#[fundamental] //maybe?? I forgot what that did
pub struct Thin<T: ?Sized>(T) //EDIT: Didn't work out see below, it's now `ThinRef<'a,T>` (as well as `ThinMut`,`ThinConstPtr`, `ThinMutPtr`

//override auto-impl
unsafe impl<T> Pointee for Thin<T> {
    type Metadata = ()
}

The reason why it's a lang item is because it has a bit special deref handling, mainly:

  • given struct Foo { f1: usize, f2: usize, dst: [u8] }
  • given let foo: &Thin<Foo> = ...,
  • then &foo.f1 is of type &usize
  • then &foo.f2 is of type &usize
  • then &foo.dst is of type &Thin<[u8]>

This (or something similar) has some benefits:

  • We have a direct representation for thin slices (with unknown length): &Thin<[T]>,&mut Thin<[T]>, *const Thin<[T]>, *mut Thin<[T]>
    • No need to use unsafe "hacks" to get thin pointer
    • This works with all potential future pointer types (e.g. *raw or similar) which we maybe might add
  • We can make from from_raw_parts more type save by accepting *const Thin<T> instead of an *const ()

By keeping the type instead of using *const () we should need less complex unsafe code to e.g. implement a ThinBox for trait objects shown above we would "just" get the thin pointer to the DST field and then pass it with the metadata to from_raw_parts.

Sure this doesn't really eliminate the data alignment fixing done in data_ptr it just moves it from the user written library into the implementation of from_raw_parts (a Thin<dyn Trait> would not be guaranteed to be correctly aligned as we simply can't know the alignment without knowing the type so the from_raw_parts impl. would need to do what data_ptr does).

Still moving this tricky bits from user libraries to the core/std library seems to be a good idea I think and the improved type safety around from_raw_parts would be good too as (I think) in many case we don't handle completely erased types but just the erasure of the exact metadata used.

Through we probably still would need to have a way to create a *const Thin<dyn Trait> from a *const () for some ffi aspects.

(Note that a &Thin<dyn Trait> is kinda pointless but a &Thin<DSTContainingDynTrait> isn't as you still could access other fields safely).

Copy link

@rustonaut rustonaut Sep 11, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An additional problem I found is that you can't have unaligned references in rust. It's UB.

So &Thin<dyn Trait> would need to "encapsulate" the unalignedness while not doing so for everything for which we know the alignment i.e. another reason why Thin would be a lang item. Keeping with how rust tends to do thinks it might be more appropriate to make Thin a "special" syntax. But then we would need to have thinks like &thin X, &mut thin X, or similar. But having a Thin<T> does just work very egonomical with the rest of rust.

(Solved below)

Another problem I found is that Thin<dyn Trait> or Thin<[u8]> as a unknown layout in the sense that:

  • We don't know the size for either.
  • We don't know the "internal" alignment/padding of the inner value in Thin<dyn Trait> (Thin<dyn Trait> is on the outside well but "unknown" aligned), as we can't move it in memory anyway due to now knowing the size we can just report an alignment of 1, but between the start of Thin<dyn Trait> and the data of dyn Trait there is some unknown padding to "fix" the alignment of the internal field.

But Layout::for_value(&Thin<[u8]>) does still need to return a value.

So either we return a "not so useful" value like alignment 1 and size of usize::MAX, which I don't like or we need to directly have Thin pointers instead of a Thin wrapper around the T: Pointee type.

Like ThinRef<'a,T>, ThinMut<'a,T>, ThinConstPtr, ThinMutPtr. (as e.g. #[...] struct ThinMutPtr<T>(ptr: *mut (), PhantomData<T>))

Which is slightly less elegant but would still work otherwise like described for Thin above.

This also would fix the inner alignment problem much easier as e.g. ThinRef<'a, T> could be implemented as #[...] struct ThinRef<'a, T> { ptr: *const (), marker: PhantomData<&'a T> } which works just fine for a not-necessary-correctly aligned ThinRef<'a, dyn Trait>.

The only real problem would be if we point to not-allocated memory. But we are guaranteed to point to allocated memory.
The only potential wrong thing is the alignment if the pointer is to a DST field. In which case we might point into the padding in front of the field instead of the field directly. Which is fine and expected (see the data_ptr example and what I wrote previously about it).

Copy link
Contributor Author

@SimonSapin SimonSapin Sep 13, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe it's better to actually introduce thin pointers instead of using this trait alias.

The Thin trait alias proposed here describes an existing property of some existing types and the existing pointer/reference types to those. *const u32 is a thin pointer, *const str is not. Therefore u32: Thin holds, str: Thin does not.

The problem is that this trait alias doesn't necessary play well with structs containing DSTs (which might again be "composed" DSTs). It should be possible to have a thin pointer to such a struct

In today’s Rust a struct that contains a DST field is itself also a DST and therefore pointers to it are wide, not thin.

Introducing thin pointers to DSTs (structs or not) is an entirely new language feature that is not part of this RFC. If you feel that feature should be pursued at the language level, consider writing a separate RFC for it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This RFCs would enable a third-party library to implement a type like ThinBox that stores the metadata in the heap allocation and keeps the Box-like pointer thin.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have in the end 2 kinds of DSTs one which I refereed to as composed DST's which have any number of non DST fields and a tailing DST fields and what I will refer to as fundamental DSTs which are currently [T] and dyn Trait.

The problem is that without thin pointers to DSTs which allow access to non DST fields any library ThinBox implementation would need to either create a fat-pointer or use a lot of unsafe code to access non DST fields.

This makes it harder to implement all kind of custom DST types like e.g. slice with the length prefixing the slice or similar.
Mainly a lot of unsafe code is needed which with thin pointers to DST can be moved into the from_raw_parts implementation in core/std.

Also semantically the from_raw_parts functions(s) does take a thin pointer to an DST and meta data and creates a potential non-thin pointer. But instead we represent it as taking a untyped pointer and metadata making the functions more unsafe then they need to be IMHO (as you could put in a "wrong" thinned pointer.

While we seem to clearly disagree on this, thin pointers to DST are IMHO a fundamental part of the API around handling pointer metadata as they are the direct representation of fat-pointers without metadata.

But then a new RFC could also try to amend any non yet stabilized RFC so I guess putting this into a separate RFC is worth a try.

My goal is to long term make it possible to handle/write DST types without needing to know about all kinds of unsafe easy to get wrong without noticing it details like alignment fixing making custom DSTs available to any rust programmer.

I will see if I can find time to write a RFC for this in the next week or so.


/// Extract the metadata component of a pointer.
///
/// Values of type `*mut T`, `&T`, or `&mut T` can be passed directly to this function
/// as they implicitly coerce to `*const T`.
/// For example:
///
/// ```
/// assert_eq(std::ptr::metadata("foo"), 3_usize);
/// ```
///
/// Note that the data component of a (fat) pointer can be extracted by casting
/// to a (thin) pointer to any `Sized` type:
///
/// ```
/// # trait SomeTrait {}
/// # fn example(something: &SomeTrait) {
/// let object: &SomeTrait = something;
/// let data_ptr = object as *const SomeTrait as *const ();
/// # }
/// ```
pub fn metadata<T: ?Sized>(ptr: *const T) -> <T as Pointee>::Metadata {…}

impl<T: ?Sized> *const T {
pub fn from_raw_parts(data: *const (), meta: <T as Pointee>::Metadata) -> Self {…}
}

impl<T: ?Sized> *mut T {
pub fn from_raw_parts(data: *mut (), meta: <T as Pointee>::Metadata) -> Self {…}
}

/// The vtable for a trait object.
///
/// A vtable (virtual call table) represents all the necessary information
/// to manipulate the concrete type stored inside a trait object.
/// It notably it contains:
///
/// * type size
/// * type alignment
/// * a pointer to the type’s `drop_in_place` impl (may be a no-op for plain-old-data)
/// * pointers to all the methods for the type’s implementation of the trait
///
/// Note that the first three are special because they’re necessary to allocate, drop,
/// and deallocate any trait object.
///
/// The layout of vtables is still unspecified, so this type is one a more-type-safe
/// convenience for accessing those 3 special values. Note however that `VTable` does
/// not actually know the trait it’s associated with, indicating that, at very least,
/// the location of `size`, `align`, and `drop_in_place` is identical for all
/// trait object vtables in a single program.
pub struct VTable {
_priv: (),
}

impl VTable {
/// Returns the size of the type associated with this vtable.
pub fn size(&self) -> usize { ... }

/// Returns the alignment of the type associated with this vtable.
pub fn align(&self) -> usize { ... }

/// Returns the size and alignment together as a `Layout`
pub fn layout(&self) -> alloc::Layout {
unsafe {
alloc::Layout::from_size_align_unchecked(self.size(), self.align())
}
}

/// Drops the value pointed at by `data` assuming it has the type
/// associated with this vtable.
///
/// Behaviour is Undefined if the type isn’t correct or the pointer
/// is invalid to reinterpret as `&mut TheType`.
pub unsafe fn drop_in_place(&self, data: *mut ()) { ... }
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It may be unnecessary to have this method, since it’s the same as:

std::ptr::drop_in_place(<*mut _>::from_raw_parts(data, self))

(The size and align methods are different because they do not require a data pointer.)

}
```

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No drawbacks section...?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I came up short trying to think of a reason not to do this at all (as opposed to doing it differently). Suggestions welcome.


# Rationale and alternatives
[rationale-and-alternatives]: #rationale-and-alternatives

The status quo is that code (such as linked in [Motivation]) that requires this functionality
needs to transmute to and from `std::raw::TraitObject`
or a copy of it (to be compatible with Stable Rust).
Additionally, in cases where constructing the data pointer
requires knowing the alignment of the concrete type,
a dangling pointer such as `0x8000_0000_usize as *mut ()` needs to be created.
It is not clear whether `std::mem::align_of(&*ptr)` with `ptr: *const dyn SomeTrait`
is Undefined Behavior with a dangling data pointer.

A [previous iteration][2579] of this RFC proposed a `DynTrait`
that would only be implemented for trait objects like `dyn SomeTrait`.
There would be no `Metadata` associated type, `&'static VTable` was hard-coded in the trait.
In addition to being more general
and (hopefully) more compatible with future custom DSTs proposals,
this RFC resolves the question of what happens
if trait objects with super-fat pointers with multiple vtable pointers are ever added.
(Answer: they can use a different metadata type like `[&'static VTable; N]`.)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should then [&'static VTable; 1] for dyn SomeTrait be used to make that transition smoother and to fit better with const generics?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This would make some sense if we were definitely gonna have super-fat pointers with multiple separate vtable pointers as fat pointer metadata. But if we don’t and end up with a different solution to upcasting, we’ll end up with a always-single-item arrays for no reason. This isn’t really the thread to get into that discussion, but my opinion is that super-fat pointer have a significant enough size cost that I’d much prefer a different solution.

Perhaps an alternative for this RFC, more neutral with respect super-fat pointers v.s. not, would be to have type Metadata = VTable<Self>; for trait objects. (See other comments about VTable’s possible type paramater.) With the pointer/reference indirection hidden away in private fields of the VTable type, this design would be compatible with having VTable<dyn A + B> contain two pointers in the future.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, is there a use case for generic code that accepts any trait object with any number of vtable pointer but not other kinds of DSTs?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we doing the proposals in the right order? Shouldn't we focus on dealing with dyn A + B + C, upcasting, and such things first? Also, cc #2035.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe the design proposed here is compatible enough with various options for multi-traits trait objects and upcasting such that there isn’t a strong dependency, and we don’t need to block this RFC on everything else being settled.


`VTable` could be made generic with a type parameter for the trait object type that it describes.
This would avoid forcing that the size, alignment, and destruction pointers
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How would that avoid forcing this? Can you elaborate?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Without a type parameter, x.size() with x: &'static VTable necessarily executes the same code for any vtable. With a type parameter, x: &'static VTable<dyn Foo> and x: &'static VTable<dyn Bar> are different types and could execute different code. (For example, do table lookup with different offsets.) However, keeping the offset of size the same within all vtables might be desirable regardless of this API.

be in the same location (offset) for every vtables.
SimonSapin marked this conversation as resolved.
Show resolved Hide resolved
But keeping them in the same location is probaly desirable anyway to keep code size
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing the end of a sentence?


[2579]: https://github.com/rust-lang/rfcs/pull/2579


# Prior art
[prior-art]: #prior-art

A previous [Custom Dynamically-Sized Types][cdst] RFC was postponed.
[Internals thread #6663][6663] took the same ideas
and was even more ambitious in being very general.
Except for `VTable`’s methods, this RFC proposes a subset of what that thread did.

[cdst]: https://github.com/rust-lang/rfcs/pull/1524
[6663]: https://internals.rust-lang.org/t/pre-erfc-lets-fix-dsts/6663


# Unresolved questions
[unresolved-questions]: #unresolved-questions

* The name of `Pointee`. [Internals thread #6663][6663] used `Referent`.

* The location of `VTable`. Is another module more appropriate than `std::ptr`?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

and should it be called Dictionary instead? ("type class dictionary")

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Big -1 to calling it Dictionary since this typically means a key-value map (example: C#, Swift, Python).

Furthermore, here in Rust the VTable is implemented as an array of function pointers, not a HashMap (unlike e.g. Python where it is really implemented as a dict), so calling it Dictionary obscures the alleged complexity.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Even if the implementation happened to use HashMap, I’d prefer VTable since it’s more descriptive of the role of this type. (As opposed to: dictionary of what?) I believe that vtable is a well-enough established term of art.


* Should `VTable` be an extern type?
Rather than a zero-size struct? Actual vtables vary in size,
but `VTable` presumably is never used exept behind `&'static`.

* The name of `Thin`.
This name is short and sweet but `T: Thin` suggests that `T` itself is thin,
rather than pointers and references to `T`.

* The location of `Thin`. Better in `std::marker`?

* Should `Thin` be added as a supertrait of `Sized`?
Or could it ever make sense to have fat pointers to statically-sized types?

* Are there other generic standard library APIs like `ptr::null()`
that have an (implicit) `T: Sized` bound that unneccesarily excludes extern types?