Improved contextual type inference errors with bidirectional type checking #4661

yoshi-monster · 2025-06-03T20:23:08Z

yoshi-monster
Jun 3, 2025

Background

During a discussion on Discord, we noticed two places where type error messages could potentially be improved (#4639, #4640). While looking at the compiler code trying to get an idea how I would implement this, I thought that it would be best instead to change how contextual information is handled during the entire type inference process, not just special-casing these 2 situations.

Right now, expressions are type-checked bottom-up; the untyped ast is traversed in a depth-first order, and nodes are recursively typed. The high-level algorithm for this looks like this:

type all nested/inner expression
unify the nested expressions with type information from the expression itself

as an example, when typechecking a function call, the arguments are typed first, before they are unified with their expected argument types. From this, the return type is inferred and returned to the called. The caller then (after the call has finished type-checking) unfies the result with its expected type. If those do not match, the entire call expression is marked as a type error.

Proposal

Instead, I propose passing known type information about an expression down to the infer function for tha expression; Most expreessions are composed of more nested expressions of different shapes. Instead of first producing a typed child epxression which then gets unified with that result, the individual inference functions could directly unify their result with all known information, providing more targeted type error messages regardless of code structure.

The new high-level overview of the algorithm would look like this:

collect all known type constraints from the parent expression and based on the expression shape
pass this context down to nested/inner expressions, type-checking them
the result is successfully typed

Current state

The compiler does something similar already in exactly one situation:

When inferring anonymous functions, the surrounding type context is passed down the tree to allow anonymous functions to access record and tuple fields without needing annotations. This proposal is basically to do something similar for all other expressions, too.

Examples

Here I've tried to simulate in my head what this might look like; Of course I haven't run this code, so maybe it doesn't actually work that way! I expect people to be quite opinionated here. I think a good result of this might be to try it in certain situations (for example case expressions) first and slowly expand that system over time if the resulting errors are indeed better.

1. `let` bindings

currently, the compiler first fully type-checks the expression on the right-hand side, and then tries to unify it with the type annotation on the left. If it doesn't match, the entire right-hand side is marked as a type error.

let list: List(Int) = list.try_map(["1.0", "2.0", "3.0"], float.parse)

Current Compiler Error

error: Type mismatch
   ┌─ /src/main.gleam:10:5
   │  
10 │ ╭     ["1.0", "2.0", "3.0"]
11 │ │     |> list.try_map(fn(str) { float.parse(str) })
12 │ │     |> result.unwrap([])
   │ ╰────────────────────────^

Expected type:

    List(Int)

Found type:

    List(Float)

New Compiler Error

error: Type mismatch
   ┌─ /src/main.gleam:11:51
   │
11 │     |> list.try_map(fn(str) { float.parse(str) })
   │                               ^^^^^^^^^^^^^^^^

The type of this returned value doesn't match the return type
annotation of this function.

Expected type:

    Result(Int, Nil)

Found type:

    Result(Float, Nil)

What's happening here? Let's first desugar the pipe to regular function calls. We can also directly add the types of functions defined in other modules here, so I will also do that.

Definitely too detailed explanation of the inference steps

  let list: List(Int) =
    result.unwrap(             // fn(Result(a, b), a) -> a
      list.try_map(            // fn(List(c), fn(c) -> Result(d, e)) -> Result(List(d), e)
        ["1.0", "2.0", "3.0"], // List(String)
        fn(str) { float.parse(str) }
      ),
      []                       // List(_)
  )

In the new algorithm, we pass type information down, so we immediately know what type result.unwrap needs to return, and therefore what a needs to be equal to:

  let list: List(Int) =
    result.unwrap(    // fn(Result(List(Int), b), List(Int)) -> List(Int)
      list.try_map(   // fn(List(c), fn(c) -> Result(d, e)) -> Result(List(d), e)
        ["1.0", "2.0", "3.0"],
        fn(str) { float.parse(str) }
      ),
      []
  )

The algorithm works recursively: we can now know that the list.try_map call has to return Result(List(Int), b), and we can pass this information down. Again, the when inferring the call, we know that this type has to unify with the return type of list.try_map and has to match Result(List(d), e). From this, d is inferred to be Int and e is inferred to be b:

  let list: List(Int) =
    result.unwrap(   // fn(Result(List(Int), b), List(Int)) -> List(Int)
      list.try_map(  // fn(List(c), fn(c) -> Result(Int, b)) -> Result(List(Int), b)
        ["1.0", "2.0", "3.0"],
        fn(str) { float.parse(str) }
      ),
      []
  )

we can immediately infer the type of the constant list as List(String). arguments are still unified from left-to-right, so nothing special happens here. c is equal to String. Now when inferring the anonymous function, we already know that the try_map argument has to be of type fn(String) -> Result(Int, b). Again, Instead of inferring the child expression and unifying, we pass that information down: str has to unify with String, and the function body (and return value) has to unify with Result(Int, b). float.parse is a function of type fn(String) -> Result(Float, Nil), so before even type-checking the arguments, the inference process would stop here, since the result types cannot be unified: The constraints we gathered up to this point are conflicting.

2. generic values

fn wibble() -> Result(Int, Nil) {
  Ok(5.0)
}

Current compiler error

error: Type mismatch
  ┌─ /src/main.gleam:8:3
  │
8 │   Ok("hello")
  │   ^^^^^^^^^^^

The type of this returned value doesn't match the return type
annotation of this function.

Expected type:

    Result(Int, Nil)

Found type:

    Result(String, a)

New compiler error

error: Type mismatch
  ┌─ /src/main.gleam:8:3
  │
8 │   Ok("hello")
  │      ^^^^^^^

Expected type:

    Int

Found type:

    String

I do not know what to do with the hint in that case.

case expressions

In a case expression, if all branches match except for one, only that specific branch (and even the specific expression inside that branch) can be annotated with the error.

fn wibble() -> Result(String, Nil) {
  case Ok(True) {
    Ok(_) -> Ok(2)
    Error(_) -> Error(Nil)
  }
}

Current Error Message

error: Type mismatch
   ┌─ /src/main.gleam:8:3
   │  
 8 │ ╭   case Ok(True) {
 9 │ │     Ok(_) -> Ok(2)
10 │ │     Error(_) -> Error(Nil)
11 │ │   }
   │ ╰───^

The type of this returned value doesn't match the return type
annotation of this function.

Expected type:

    Result(String, Nil)

Found type:

    Result(Int, Nil)

New Error Message

error: Type mismatch
   ┌─ /src/main.gleam:10:20
   │
 9 │     Ok(_) -> Ok(2)
   │                 ^

Expected type:

    String

Found type:

    Int

`use` Expressions

In use expressions, the return type of the function can be used to add constraints to the return type of the middleware function, in turn constraining the callback function type. Passing down these constraints means that use expressions:

pub fn wibble() -> Result(String, Nil) {
  use n <- result.try(float.parse("2.0"))
  Ok(n)
}

Current Error Message

error: Type mismatch
  ┌─ /src/main.gleam:8:3
  │
8 │   use n <- result.try(float.parse("2.0"))
  │   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

The type of this returned value doesn't match the return type
annotation of this function.

Expected type:

    Result(String, Nil)

Found type:

    Result(Float, Nil)

New Error Message

error: Type mismatch
  ┌─ /src/main.gleam:8:3
  │
9 │   Ok(n)
  │      ^

Expected type:

    Int

Found type:

    Float

lpil · 2025-06-06T11:07:52Z

lpil
Jun 6, 2025
Maintainer

Very cool! Hayleigh shared this paper on bidirectional type checking here: https://www.cl.cam.ac.uk/~nk480/bidir.pdf

This would be more or less a full rewrite of the analyser, which seems like a very daunting bit of work.

I like the improved case expression message. I'd be interested in what impact this could have on the use error messages, as they can be our least clear currently.

0 replies

llakala · 2025-06-06T12:33:27Z

llakala
Jun 6, 2025

This would be incredible. Count me in as the biggest supporter of this idea.

As Louis says, this would be a substantial rewrite. While special-casing situations isn't perfect, I think having temporary logic for the worst offenders would be a big improvement over the status quo. Are there any incremental changes that can be put in place for syntax like use, to result in errors more like the ones shown here?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improved contextual type inference errors with bidirectional type checking #4661

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Uh oh!

Improved contextual type inference errors with bidirectional type checking #4661

Uh oh!

yoshi-monster Jun 3, 2025

Background

Proposal

Current state

Examples

1. let bindings

2. generic values

case expressions

use Expressions

Replies: 2 comments

Uh oh!

lpil Jun 6, 2025 Maintainer

Uh oh!

Uh oh!

llakala Jun 6, 2025

yoshi-monster
Jun 3, 2025

1. `let` bindings

`use` Expressions

lpil
Jun 6, 2025
Maintainer

llakala
Jun 6, 2025