Add recursive-type-unrestriction.md

2025-05-04 10:43:48 +01:00 · 2023-10-30 15:25:03 -05:00 · 2023-10-30 15:25:03 -05:00 · d31ed9665a
commit d31ed9665a
parent 2d619a8504
1 changed files with 222 additions and 0 deletions
--- a/docs/recursive-type-unrestriction.md
+++ b/docs/recursive-type-unrestriction.md
@ -0,0 +1,222 @@
 # Loosening the recursive type restriction
 ## Summary
 Luau supports recursive type aliases, but with an important
 restriction: users can declare functions of recursive types, but *not*
 recursive type functions. This has problems with sophisticated uses of
 types, for example ones which mix recursive types and nested generics.
 This RFC proposes loosening this restriction.
 ## Motivation
 Luau supports recursive type aliases, but with an important restriction:
 users can declare functions of recursive types, such as:
 ```lua
  type Tree<a> = { data: a, children: {Tree<a>} }
 ```
 but *not* recursive type functions, such as:
 ```lua
  type TreeWithMap<a> = { ..., map: <b>(a -> b) -> Tree<b> }
 ```
 These examples come up naturally in OO code bases with generic types, for example
 ```lua
 --!strict
 export type PromiseFor<T> = {
  andThen: <U>(
    self: PromiseFor<T>,
    onFulfilled: ((result: T) -> U)?,
    onRejected: ((err: string) -> U)?
  ) -> PromiseFor<U>,
  catch: <U>(
    self: PromiseFor<T>,
    onRejected: ((err: string) -> U)?
  ) -> PromiseFor<U>,
  finally: <U>(
    self: PromiseFor<T>,
    onResolvedOrRejected: ((wasFulfilled: boolean, resultOrErr: T | string) -> U)
  ) -> PromiseFor<U>,
 }
 ```
 as discussed at the [Roblox DevForum](https://devforum.roblox.com/t/regression-with-genericrecursively-defined-types/1616647).
 Examples like this are quite common in TypeScript code, for example in [ReduxJS](https://github.com/reduxjs/redux-thunk/blob/master/src/types.ts).
 ## Design
 The design is to continue to use strict type aliases, but to use a
 cache during type alias definition. Type aliases are handled as they
 currently are, except for recursive cases.
 When defining a type alias `T<a1,...,aN>`, if we encounter a recursive use
 `T<U1,...,UN>` we proceed as follows:
 * If every `UI` is `aI`, or if every `UI` does not contain any of the `aJ`s:
    * look `<U1,...,UN>` up in the cache,
    * if the cache lookup succeeds, return the cached result,
    * otherwise create a fresh type variable, add it to the cache, and return it.
 * Otherwise, produce a type error and return an error type.
 Once we are finished defining the type alias, iterate through the cache
 * for each entry with key `<U1,...,UN>` and value `X`, unify `X` with
  the result of expanding `T<U1,...,UN>`.
 * expanding `T<U1,...,UN>` may encounter a generic function,
  for example `<b>(S) -> R`, which needs a bit of care, since `a` may occur free in
  some of the `Ui`. We need to rename `b` to `c`, but this renaming may also
  introduce new cache entries, since any cache entry for `U` containing `c`
  needs a new cache entry for `U[c/b]`.
 For example, with type
 ```
  type TreeWithMap<a> = { data: a, children: {Tree<a>}, map: <b>(a -> b) -> Tree<b> }
 ```
 The type alias is `Z` where
 ```
  Z = { data: a, children: {Z}, map: <b>(a -> b) -> X }
 ```
 the cache contains
 ```
  <b> |-> X
 ```
 and after unification
 ```
  X = { data: b, children: {X}, map: <c>(b -> c) -> Y }
 ```
 which introduces a new cache entry
 ```
  <c> |-> Y
 ```
 and after unification
 ```
  Y = { data: b, children: {Y}, map: <b>(c -> b) -> X }
 ```
 This algorithm can be generalized to handle mutual recursion, by using a cache for each mutually recursive type.
 This design is the strictest one we came up with that copes with the examples we're interested in, in particular the `PromiseFor` example.
 ## Drawbacks
 Renaming can double the size of the type graph, with the result that nested type aliases can result in exponential blowup.
 This algorithm doesn't cope with examples which mix concrete and generic types
 ```
  type T<a, b> = { this: a, that; b, children : {T<number, b>} }
 ```
 This algorithm does not support type graphs with infinite expansions.
 ## Alternatives
 ### Lazy recursive type instantiations
 The most permissive change would be to make recursive type
 instantiation lazy rather than strict. In this approach `T<U>` would
 not be instantiated immediately, but only when the body is needed. In
 particular, during unification we can unify `T<U>` with `T<V>` by
 first trying to unify `U` and `V`, and only if that fails try to unify
 the instantiations.
 *Advantages*: this allows recursive types with infinite expansions like:
 ```lua
  type Foo<T> = { ..., promises: {Foo<Promise<T>>} }
 ```
 ### Lazy recursive type instantiations with a cache
 As above, but keep a cache for each type function.
 *Advantages*: reduces the size of the type graph.
 ### Strict recursive type instantiations with a cache
 Rather than lazily instantiating type functions when they are used, we
 could carry on instantiating them when they are defined, and use a
 cache to reuse them. In particular, the cache would be populated when the
 recursive types are defined, and used when types are used recursively.
 For example:
 ```
 type T<a,b> = { foo: T<b,number>? }
 ```
 would result in cache entries:
 ```
 T<a,b> = { foo: T<b,number>? }
 T<b,number> = { foo: T<number,number>? }
 T<number,number> = { foo: T<number,number>? }
 ```
 This can result in exponential blowup, for example:
 ```
 type T<a,b> = { foo: T<b,number>?, bar: T<b,string>? }
 ```
 would result in cache entries:
 ```
 T<a,b> = { foo: T<b,number>?, bar: T<b,string>? }
 T<b,number> = { foo: T<number,number>?, bar: T<string,number>? }
 T<b,string> = { foo: T<string,number>?, bar: T<string,string>? }
 T<number,number> = { foo: T<number,number>?, bar: T<number,string>? }
 T<number,string> = { foo: T<string,number>?, bar: T<string,string>? }
 T<string,number> = { foo: T<number,number>?, bar: T<number,string>? }
 T<string,string> = { foo: T<string,number>?, bar: T<string,string>? }
 ```
 Applying this to a type function with N type variables results in more than 2^N
 types. Because of blowup, we would need a bound on cache size.
 This can also result in the cache being exhausted, for example:
 ```
 type T<a> = { foo: T<Promise<a>>? }
 ```
 results in an infinite type graph with cache:
 ```
 T<a> = { foo: T<Promise<a>>? }
 T<Promise<a>> = { foo: T<Promise<Promise<a>>>? }
 T<Promise<Promise<a>>> = { foo: T<Promise<Promise<Promise<a>>>>? }
 ...
 ```
 *Advantages*: types are computed strictly, so we don't have to worry about lazy types
 producing unbounded type graphs during unification.
 ### Strict recursive type instantiations with a cache and an occurrence check
 We can use occurrence checks to ensure there's less blowup. We can restrict
 a recursive use `T<U1,...,UN>` in the definition of `T<a1...aN>` so that either `UI` is `aI`
 or contains none of `a1...aN`. For example this bans
 ```
 type T<a> = { foo: T<Promise<a>>? }
 ```
 since `Promise<a>` is not `a` but contains `a`, and bans
 ```
 type T<a,b> = { foo: T<b,number>? }
 ```
 since `a` is not `b`, but allows:
 ```
 type T<a,b> = { foo: T<a,number>? }
 ```
 This still has exponential blowup, for example
 ```lua
 type T<a1,a2,a3,...,aN> = {
  p1: T<number, a2, a3, ..., aN>,
  p2: T<a1, number, a3, ..., aN>,
  ...
  pN: T<a1, a2, a3, ..., number>,
 }
 ```
 *Advantages*: types are computed strictly, and may produce better error messages if the occurs check fails.