Add pointer specs needed to verify slices #1534

Lysxia · 2025-05-16T10:58:34Z

Stuff needed to verify swap on slices.

creusot-contracts/src/std/ptr.rs

Lysxia · 2025-05-16T11:05:29Z

creusot-contracts/src/std/ptr.rs

+    #[trusted]
+    #[requires(self.offset_logic(offset@) == own_offset.ptr())]
+    #[ensures(self.offset_logic(offset@) == result)]
+    unsafe fn add_own(self, offset: usize, own_offset: Ghost<&PtrOwn<T>>) -> Self {


The idea here, which may be wrong, is that the only way to prove self.offset_logic(offset) == own_offset.ptr() is if self actually comes from an allocation of an array of T, so all the pointers between self and self.add(offset) are in fact valid.

Well, we can imagine that we could compute offsets between fields of a struct, so arrays are not the only option here.

But I basically agree with you that this requirement should be enough. Basically, this correspond to assuming that the pointer type ghostly contains the identifier of the allocation: offset_logic would keep this provenance information, so that self and own_offsef would have the same provenance. So own_offset would guarantee that both pointers have the same block, and that this block has not been deallocated.

This matches the provenance semantics that Rust tries to adopt now.

But, in any case, this needs to be documented.

One more thing: we want to allow one-past-the-end pointers. <*const T>::add says this:

vec.as_ptr().add(vec.len()) (for vec: Vec<T>) is always safe.

I just ran into this while looking at split_at_unchecked.

We need a weaker form of evidence than PtrOwn to model this.

#[requires(self.offset_logic(offset@) == own_offset.ptr() || self.offset_logic(offset@-1) == own_offset.ptr())] ?

That is still too restrictive because you might not have access to the PtrOwn of the last element. In split_at_unchecked the add happens after the first slice is created, consuming all of its PtrOwn.

You're right. And more specifically, if we have a pointer to a zero-sized slice (this is a corner case, but this needs to be handled), then doing a 0-offset should be allowed, and my proposal forbids it.

So... it seems like the only remaining possibility is to add some informatin in the pointers.

For example, fn min_offset (p: *const T) -> Int and fn max_offset (p: *const T) -> Int, indicating the minimum and maximum value allowed for an offset. This is sound, because we can consider that these values are contained in the "origin" field of pointers. Some additional postconditions need to be added to offset_logic to tell how min_offset and max_offset are affected.

Alternatively, we could add logical functions fn chunck_begin_addr(p: *const T) -> Int and fn chunck_end_addr(p: *const T) -> Int, and then compare addresses (it's important to compare addresses and not pointers, because then we can leverage the support for arithmetics in SMT solvers).

The idea is the same, it's not clear to me what's the best variant. The advantage of chunck_begin_addr is that these stay constant when offseting, while the advantage of offset ranges is that we do not use the notion of addresses.

I'm experimenting with a SliceOwn token, a variant of PtrOwn, that contains information about allowed offsets. In particular we can create zero-sized SliceOwn that point one-past-the-end of an allocated slice.

I think that adding such information in the pointers themselves is not a solution because the range of possible offsets depends on if the allocated object has been freed:

it’s always UB to offset a pointer derived from something that is now deallocated, except if the offset is 0. --- https://doc.rust-lang.org/std/ptr/index.html#provenance

Does it makes sense to be able to convert &SliceOwn into &PtrOwn of each element, and conversely, to convert &PtrOwn into a singleton &SliceOwn? (And of course, without the & this won't be allowed.) It's certainly convenient to reuse PtrOwn's method to dereference pointers, but I wonder if there may be some extra information in a &PtrOwn that can't come from &SliceOwn.

How having a SliceOwn for a type T would be different to having a PtrOwn for [T]?

It really seems bad to me to have two different token types for owning a single slot and owning a slice. For example, how would we offset into an which is stored in a struct, of which we only have a PtrOwn (I assume that in your proposal, it is impossible to split these ownership tokens, because that would essentially revert to the original proposal)?

To synthesize my current view of things:

On the one hand, we need to be able to split ownership, both because it is much easier to specify reading/writing from a raw pointer, and because we can very much imagine that a large block of memory is split into smaller thunks, themselves used in very different part of a program (different threads...). Being able to split tokens is crucial for proving things like Bumpalo (I'm not saying that you should prove it, be not having the tools to do it seems like a red flag to me).

On the other hand, in order to guarantee that some offsets are allowed, and to guarantee that we never partially deallocate a block by giving back only some of its tokens, we need another token that says "this block is allocated". This other token could not be split and would not provide read/write access to anything, but a shared borrow of it would be needed to offset pointers and full ownership of it would be needed to deallocate memory.

creusot-contracts/src/std/ptr.rs

creusot-contracts/src/logic/seq.rs

creusot-contracts/src/std/ptr.rs

jhjourdan · 2025-05-19T11:49:39Z

creusot-contracts/src/std/ptr.rs

+    #[trusted]
+    #[logic]
+    #[open(self)]
+    fn offset_logic(self, offset: Int) -> RawPtr<T> {


Shouldn't we ensure something about the address of the pointer (i.e., the value when casted to usize).

creusot-contracts/src/std/ptr.rs

jhjourdan · 2025-05-19T13:04:43Z

creusot-contracts/src/std/ptr.rs

+    #[trusted]
+    #[requires(self.offset_logic(offset@) == own_offset.ptr())]
+    #[ensures(self.offset_logic(offset@) == result)]
+    unsafe fn add_own(self, offset: usize, own_offset: Ghost<&PtrOwn<T>>) -> Self {


Well, we can imagine that we could compute offsets between fields of a struct, so arrays are not the only option here.

But I basically agree with you that this requirement should be enough. Basically, this correspond to assuming that the pointer type ghostly contains the identifier of the allocation: offset_logic would keep this provenance information, so that self and own_offsef would have the same provenance. So own_offset would guarantee that both pointers have the same block, and that this block has not been deallocated.

This matches the provenance semantics that Rust tries to adopt now.

But, in any case, this needs to be documented.

jhjourdan · 2025-05-19T13:06:07Z

creusot-contracts/src/std/ptr.rs

+    // TODO: The offset in bytes, `count * size_of::<T>()`, must fit in an `isize`.
+    #[trusted]
+    #[requires(self.offset_logic(offset@) == own_offset.ptr())]
+    // #[ensures(result as RawPtr<T> == self.offset_logic(offset))] // TODO: cast *mut to RawPtr ?


Is this forbidden in Pearlite? We should allow these casts, and translate them by identity (using the Coerce construct).

Lysxia · 2025-05-21T12:44:53Z

TODO:

Handle size_of
Allow conversions in logic between *mut and *const
Finish the pointer specs and document their rationale

jhjourdan · 2025-05-24T12:26:44Z

creusot-contracts/src/std/ptr.rs

+    #[trusted]
+    #[logic]
+    #[open(self)]
+    #[ensures(self.addr_logic()@ + offset < usize::MAX@ ==> result.addr_logic()@ == self.addr_logic()@ + offset)]


This complicated post-condition let me think that, finally, addr_logic should return an Int, with a type invariant on pointer types stating that the logical address of pointers appearing in programs are between 0 and usize::MAX.

In addition, we should have an associativity lemma stating p.offset_logic(x).offset_logic(y) = p.offset_logic(x+y).

Lysxia added 2 commits May 16, 2025 12:55

Add pointer specs: add and slice as_ptr

2084ac9

update tests

a42ad41

Lysxia commented May 16, 2025

View reviewed changes

Lysxia added 2 commits May 16, 2025 16:47

Add Resolve for Seq

6c87e4a

fmt

781acee

Lysxia changed the title ~~Add pointer specs~~ Add pointer specs needed to verify swap on slices May 16, 2025

Lysxia added 2 commits May 16, 2025 16:52

Add extern_specs for ptr add_own

bf2fd36

update tests

dede862

Lysxia force-pushed the ptr-slice-specs branch from 1053d24 to dede862 Compare May 16, 2025 15:21

Lysxia commented May 16, 2025

View reviewed changes

creusot-contracts/src/std/ptr.rs Outdated Show resolved Hide resolved

Lysxia commented May 16, 2025

View reviewed changes

creusot-contracts/src/logic/seq.rs Show resolved Hide resolved

Lysxia added 2 commits May 19, 2025 11:20

Remove swap_disjoint

9da75c2

Update creusot-contracts test

7a0ea90

jhjourdan reviewed May 19, 2025

View reviewed changes

Lysxia added 2 commits May 21, 2025 14:41

address comments

6f67adb

Update tests

c9bb597

Lysxia added 6 commits May 21, 2025 15:04

Merge remote-tracking branch 'origin/master' into ptr-slice-specs

9b7901c

Merge remote-tracking branch 'origin/master' into ptr-slice-specs

6d3537a

Add requires on add_own signature

aa1a547

tests

ba1cc75

Experimental contract for offset_logic

95f83be

tests

1e4f99d

jhjourdan reviewed May 24, 2025

View reviewed changes

Lysxia added 5 commits June 10, 2025 10:30

Merge remote-tracking branch 'origin/master' into ptr-slice-specs

c279a3f

New specs for pointers to blocks

c65768e

wip

61f5502

Merge remote-tracking branch 'origin/master' into ptr-slice-specs

4b17004

Merge branch 'validate-fix' into ptr-slice-specs

121ff91

Lysxia force-pushed the ptr-slice-specs branch from cf52923 to 011aa04 Compare June 12, 2025 09:44

Lysxia added 13 commits June 12, 2025 11:44

wip specs

011aa04

wip specs

adf0dcb

rename BlockOwn to SliceOwn

32aadef

doc

e2fcebc

Update creusot-contracts tests

f72dd0d

doc

6be58dd

tests

a6c4401

wip

d535344

ord specs

78670c3

wip PtrOwn<[T]>

57c3cc5

update specs

c848fac

Merge remote-tracking branch 'origin/master' into ptr-slice-specs

8765ae8

Merge branch 'ptr-cast-vcgen' into ptr-slice-specs

ac3aa37

Lysxia force-pushed the ptr-slice-specs branch from f7b6780 to ac3aa37 Compare June 18, 2025 08:48

Lysxia added 9 commits June 18, 2025 11:20

Add offset_logic_assoc

36d022a

fix specs

81ca07d

Implement Copy for Ghost

e9fe89d

Updates

b0bd0b5

Add specs for unchecked arithmetic

f7d1841

Support *mut as *const and *const as *mut casts

18ab19e

wip Prep pass

787cdd8

Add from_ref and from_mut

d8ab746

extern specs for len of slice pointers

fbed23c

Lysxia changed the title ~~Add pointer specs needed to verify swap on slices~~ Add pointer specs needed to verify slices Jul 30, 2025

Add pointer specs needed to verify slices #1534

Are you sure you want to change the base?

Add pointer specs needed to verify slices #1534

Uh oh!

Conversation

Lysxia commented May 16, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Lysxia commented May 21, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!