Ownership in the type system

Ben Clifford

London Haskell, June 2018

benc@hawaga.org.uk

Press 's' for speaker notes

What is Rust?

closures
traits / typeclasses
parametric polymorphism
concurrency
pattern matching
single-assignment variables
nice macros


void f() {
  r = malloc(10);
  mutate(r);
  free(r);
}


do
  r <- hOpenFile "foo" WriteMode
  hPutStrLn r "hello world"
  hClose r

use-after-free


void f() {
  r = malloc(10);
  free(r);
  mutate(r);
}


do
  r <- hOpenFile "foo" WriteMode
  hClose r 
  hPutStrLn r "hello world"

resource leak


void f() {
  r = malloc(10);
  mutate(r);
}


do
  r <- hOpenFile "foo" WriteMode
  hPutStrLn r "hello world"


let r = FooConstructor
    in process r


withFile "myfile.txt" WriteMode $
  \r -> hPutStrLn r "hello world"


void f()
{
  int x;
  mutate(&x)
}


do 
  r' <- withFile "myfile.txt" WriteMode $
          \r -> return r


int * f()
{
  int x;
  return &x;
}


int f(int *r)
{
  return 7;
}


f :: (Handle, Handle) -> IO ()
f (_,b) = hClose b


void f() {
  r = malloc(10);
  mutate(r);
  free(r);
}


do
  r <- hOpenFile "foo" WriteMode
  hPutStrLn r "hello world"
  hClose r

That's usually combined with being able to duplicate references. We've passed a reference into mutate or putStrLn and those have done some stuff and then forgetten the reference, but that's fine, because we've also kept a copy for ourselves to use to free/close later. Those two everyday operations - forgetting and duplicating - are part of what makes it quite hard to reason about whats going on. Controlling these inherently dangerous operations is a big part of Rust's resource management story.


fn main()
{
  let r = vec![1, 2, 3];
  process(&r);
}


fn main()
{ 
  let r = vec![1, 2, 3];
  process(&r);
  drop(r);
}


fn main()
{
  let r = vec![1, 2, 3];
  process(&r);
  drop(r);
  process(&r);
}

5 |   drop(r);
  |        - value moved here
6 |   process(&r);
  |            ^ value used here after move

OK, so here's some buggy code. use-after-free bug. Rust rejects this! What has happened here is that I've transferred ownership of that vector - instead of being owned by r in this current scope, it has been handed over to be owned by drop, and so we can't also have it owned by r any more. So we have this novelty that variables can disappear effectively out of scope by using them in certain ways. Then it turns out that the implementation of drop just releases all the resources and can safely forget about te value.


fn main()
{
  let r = vec![1, 2, 3];
  process(&r);
  process(&r);
}

fn process(s : &Vec<i32>) {
  println!("vector size: {}", (*s).len());
}

$ ./a 
vector size: 3
vector size: 3


fn main()
{
  let r = vec![1, 2, 3];
  process(&r);
  process(&r);
}

fn process(s : &Vec<i32>) {
  println!("vector size: {}", s.len());
  drop(*s)
}

error[E0507]: cannot move out of borrowed content
  --> a.rs:10:8
   |
10 |   drop(*s)
   |        ^^ cannot move out of borrowed content

What happens if I try to release the vector inside the process function - trying to introduce a different use-after-free bug. As we might hope, we get a compiler error - "cannot move out of borrowed context". What does that mean? When we call drop, we transfer ownership of the value to drop. But, inside "process" we don't actually own s, so it isn't ours to give away. We've just "borrowed" it from the caller, and part of that calling contract is that at the end, we have to give it back. We can't release it; we can't transfer the ownership to someone else. We can however allow another function call to borrow it from us, deeper and deeper: we know statically that next level of function will give it back, because it's only borrowing, and so we know that at the end of process, we'll be able to give it back.


fn main()
{
  let r = vec![1, 2, 3];
  process(r);
  process(r);
}

fn process(s : Vec<i32>) {
  println!("vector size: {}", s.len());
} // release happens here

error[E0382]: use of moved value: `r`
 --> a.rs:5:11
  |
4 |   process(r);
  |           - value moved here
5 |   process(r);
  |           ^ value used here after move
  |
  = note: move occurs because `r` has type `std::vec::Vec<i32>`,
  which does not implement the `Copy` trait


fn main()
{
  let r = 10;
  process(r);
  process(r);
}

fn process(s : i32) {
  println!("integer is: {}", s);
}

$ ./b 
integer is: 10
integer is: 10


fn main()
{
  let r = vec![1, 2, 3];
  let q = vec![1, 2, 3, 4];
  let l = longest(&q, &r);
  process(l);
}

fn longest(s : &Vec<i32>, t : &Vec<i32>) -> &Vec<i32> {
  if (*s).len() > (*t).len() {
    return s;
  } else {
    return t;
  }
}

error[E0106]: missing lifetime specifier
 --> c.rs:9:45
  |
9 | fn longest(s : &Vec<i32>, t : &Vec<i32>) -> &Vec<i32> {
  |                       expected lifetime parameter ^
  |
  = help: this function's return type contains
    a borrowed value, but the signature does not say
    whether it is borrowed from `s` or `t`


fn main()
{
  let r = vec![1, 2, 3];
  let q = vec![1, 2, 3, 4];
  let l = longest(&q, &r);
  process(&l);
}

fn longest <'a> (s : &'a Vec<i32>, t : &'a Vec<i32>)
     -> &'a Vec<i32> {
  if (*s).len() > (*t).len() {
    return s;
  } else {
    return t;
  }
}


use std::rc::Rc;

fn main()
{
  let r = Rc::new(vec![1, 2, 3]);
  let q = r.clone();
  process(&r);
  process(&q);
  drop(r);
  process(&q);
  drop(q);
}

fn process(s : &Vec<i32>) {
  println!("vector size: {}", (*s).len());
}


use std::sync::Mutex;

fn main()
{ 
  let r = Mutex::new(vec![1, 2, 3]);
  { 
    let r1 = &r.lock().unwrap();
    process(r1);
  }
  { let r2 = &r.lock().unwrap();
    process(r2);
  }
}

accessing serialised data without copying it
sockets with type level state
pure bindings to impure APIs