Daniel J. Harvey - Refine, fine, fine

About a month ago I gave a talk about Refined types at a React meetup. Needless to say, it was a resounding success so I thought I would share an adapted version of the slides so that you can all learn to be as learned as me when it comes to such a topic.

Let’s start by listing some things that we as programmers generally agree we don’t particularly like:

Runtime errors caused by Javascript YOLO

Here is some classic code written in the Javascript programming language:

function makeBig(thing) {
  return thing.toUpperCase()
}

At a casual glance, it would appear to be some rather convoluted code for making a string uppercase.

Therefore, this seems fine:

makeBig('horse') // 'HORSE'

But this isn’t so great:

makeBig(12) // Error: toUpperCase is not defined

What happened? Well it wasn’t really a function for making strings uppercase, it was a function that takes any piece of data, then makes it uppercase if it’s a string, and just breaks weirdly with anything else.

Another thing we don’t really like doing as programmers is…

Overly defensive code around user input

So if your start in programming involved more than a sprinkling of PHP, then you’ll be used to starting all your functions with the manual typechecking dance.

function printName(name) {
  if (name && name.length > 0 && typeof name === 'string') then {
    return name
  } else {
    return "No name!"
}

Defense against the dark arts

Another favourite is manually checking our values to check basic mathematical operators aren’t going to explode the whole computer.

function divide(a, b) {
  if (b == 0 || isNaN(a) || isNaN(b)) then {
    return 0; // would have caused error
  } else {
    return a / b
  }
}

How far does something like Typescript get us?

So we can change our weird uppercasing function…

function makeBig(thing) {
  return thing.toUpperCase()
}

…to only take a string like we intended.

function makeBig(thing: string) {
  return thing.toUpperCase()
}

Let’s give it a smash:

makeBig('horse') // 'HORSE'

Excellent stuff.

And now, if we try and do some wild type stupidity, our code doesn’t even compile:

makeBig(12) 
// Argument of type '6' is not assignable to parameter of type 'string'.

What about this potentially malformed user input?

This Wild West Cowboy Javascript…

function printName(name) {
  if (name && name.length > 0 && typeof name === 'string') then {
    return name
  } else {
    return "No name!"
}

…gets a string type, which means we don’t have to check that name exists…

function printName(name: string) {
  if (name.length > 0) then {
    return name
  } else {
    return "No name!"
}

…but we still need to check whether name is long enough and return a default if not.

What about that classic divide by zero problem?

What can basic types give us here?

function divide(a, b) {
  if (b == 0 || isNaN(a) || isNaN(b)) then {
    return 0; // would have caused error
  } else {
    return a / b
  }
}

We can get rid of the number checks…

function divide(a: number, b: number) {
  if (b == 0) then {
    return 0; // would have caused error
  } else {
    return a / b
  }
}

…but we’ve still got to check for that zero value. Better, but not great. What if I told you we could do better than this?

Refined

Enter Refined types. A Refined type looks like this:

newtype Refined predicate value = Refined value

As it is a newtype it is a wrapper around a value that is used for type purposes at compile time but then erased at run time (so when the program runs, Refined 100 is just 100 as far as memory etc is concerned)

value is the type of actual data we are refining, for example Int or Number.

predicate is a type that lets us better describe the value.

The most interesting thing to note here is that predicate only exists on the type side (ie before the =) and not after - this makes it a phantom type which is only used to add contextual information. Let’s see what that actually means…

Making Refined values

There are a few ways to make Refined values, especially in the Haskell library - we’ll concentrate on two. I’m going to use the types from the Purescript version because a) they’re simpler and b) I made them and am thus less likely to get it wrong.

refine 
  :: value 
  -> Either RefinedError (Refined predicate value)

This is the regular way to make Refined value - you pass it a plain value and it returns either Left with a RefinedError describing the problem, or Right with the Refined value inside.

unsafeRefine 
  :: value 
  -> Refined predicate value

This ignores the predicate and leaves it to the programmer to go full YOLO and decide whether the predicate will be fine. I have used this to make Monoid classes where I want to add two positive numbers without checking that the outcome will still be positive.

Id

The most basic predicate is id, which doesn’t really do anything.

Refined Id Int

It’s named after the id (or identity) function - the function that returns whatever it receives, basically doing nothing.

identity :: x -> x
identity x = x

For example, any value that is a value Int can be made into a valid Refined Id Int.

id1 :: Either RefinedError (Refined Id Int)
id1 = refine 11233
-- id1 == Right (Refined 11233)

id2 :: Either RefinedError (Refined Id Int)
id2 = refine (-213123)
-- id2 == Right (Refined (-213123)

Positive

The Positive predicate, which only allows numbers over 0.

Refined Positive Int

This refinement would pass the predicate:

positive1 :: Either RefinedError (Refined Positive Int)
positive1 = refine 10
-- positive1 == Right (Refined 10)

This clearly very negative number clearly won’t fly. Nice try, ding dongs!

positive2 :: Either RefinedError (Refined Positive Int)
positive2 = refine (-10)
-- positive2 == Left (GreaterThanError 0 (-10))

From

We can be even more specific with these types too. The From predicate takes an integer and only allows values equal to or above it.

Refined (From D10) Int

(A note here - that D10 is a type-level 10. It is provided by the purescript-typelevel package.)

Therefore this 9 is clearly taking the piss and totally won’t refine.

from1 :: Either RefinedError (Refined (From D10) Int)
from1 = refine 9
-- from1 == Left (FromError 10 9)

However this 100 is cool with me, and will happily refine.

from2 :: Either RefinedError (Refined (From D10) Int)
from2 = refine 100
-- from2 == Right (Refined 100)

To

Hopefully it should be fairly intuitive how the To predicate works…

to1 :: Either RefinedError (Refined (To D20) Int)
to1 = refine 21
-- to1 == Left (ToError 20 21)

to2 :: Either RefinedError (Refined (To D20) Int)
to2 = refine 17
-- to2 == Right (Refined 17)

SizeEqualTo, SizeGreaterThan, SizeLessThan

Refinements don’t have to just be about numbers - we can use them on foldable structures too, such as Lists. The refinements let us be specific about sizes of said structure. Therefore we could make a non-empty List of Boolean values with Refined (SizeGreaterThan D0) (List Boolean).

Refined (SizeGreaterThan 3) (List Number)

Therefore this list does not refine…

size1 :: Refined RefinedError (Refined (SizeGreaterThan D3) (List Number))
size1 = refine [1, 2]
-- size1 == Left (SizeGreaterThanError 3 [1, 2])

…but this one is fine.

size2 :: Refined RefinedError (Refined (SizeGreaterThan D3) (List Number))
size2 = refine [1, 2, 3, 4]
-- size2 == Right (Refined [1, 2, 3, 4])

And, Or

These type signatures are starting to get pretty hefty, but we can do better than that - we’ve also got And and Or for combining them.

Let’s only allow whole numbers from 1 to 100…

Refined ((From D1) And (To D100)) Int

Or indeed, allow all whole numbers EXCEPT 1 to 100.

Refined ((To D0) Or (From D101)) Int

This type describes the roll of a dice.

type Dice = Refined ((From D1) And (To D6)) Int

Or this one, which describes the first bunch of prime numbers, and is all a bit silly to be honest.

type Prime 
  = Refined 
      (Or (Equal D2) 
        (Or (Equal D3) 
          (Or (Equal D5) 
            (Or (Equal D7) 
              (Or (Equal D11) 
                (Or (Equal D13) (Equal D17))
              )
            )
          )
        )
      ) Int

Back to our stupid contrived problems…

Now with the power of Refined types, our defensive printName function is pretty much unnecessary…

type Name = Refined (SizeFrom 1) String

printName :: Name -> String
printName name = unrefine name

Plus we can make a type to make division safe from fear, at last..

type Divide = Refined (Not (Equal 0)) Number

divide :: Number -> Divide -> Number
divide a b = a / (unrefine b)

Automatic JSON validation

So let’s say we have this data type using Refined…

type AlcoholUser
  = { name :: Refined (SizeFrom 1) String
    , age  :: Refined (From 18) Int
    }

…if we want to use it as an API request, sounds like a lot of work right? Maybe not! Because refined instances have fromJSON and toJSON instances for Aeson (or for Argonaut in Purescript) then we can automatically decode them from JSON and make the decoding fail if the predicate does not pass.

This way, anywhere in our app, name will always be non-empty. and age will always be 18 or more.

Well, shit.

Yep. For more details, check out the Refined Haskell library or indeed the purescript-refined library which I ported from the Haskell one.