66

Eeek! GHCi found Skolems in my code!

...
Couldn't match type `k0' with `b'
  because type variable `b' would escape its scope
This (rigid, skolem) type variable is bound by
  the type signature for
    groupBy :: Ord b => (a -> b) -> Set a -> Set (b, [a])
The following variables have types that mention k0
...

What are they? What do they want with my program? And why are they trying to escape (the ungrateful little blighters)?

53

To start with, a "rigid" type variable in a context means a type variable bound by a quantifier outside that context, which thus can't be unified with other type variables.

This works a great deal like variables bound by a lambda: Given a lambda (\x -> ... ), from the "outside" you can apply it to whatever value you like, of course; but on the inside, you can't simply decide that the value of x should be some particular value. Picking a value for x inside the lambda should sound pretty silly, but that's what errors about "can't match blah blah, rigid type variable, blah blah" mean.

Note that, even without using explicit forall quantifiers, any top-level type signature has an implicit forall for each type variable mentioned.

Of course, that's not the error you're getting. What an "escaped type variable" means is even sillier--it's like having a lambda (\x -> ...) and trying to use specific values of x outside the lambda, independently of applying it to an argument. No, not applying the lambda to something and using the result value--I mean actually using the variable itself outside the scope where it's defined.

The reason this can happen with types (without seeming as obviously absurd as the example with a lambda) is because there are two notions of "type variables" floating around: During unification, you have "variables" representing undetermined types, which are then identified with other such variables via type inference. On the other hand, you have the quantified type variables described above which are specifically identified as ranging over possible types.

Consider the type of the lambda expression (\x -> x). Starting from a completely undetermined type a, we see it takes one argument and narrow that to a -> b, then we see that it must return something of the same type as its argument, so we narrow it further to a -> a. But now it works for any type a you might want, so we give it a quantifier (forall a. a -> a).

So, an escaped type variable occurs when you have a type bound by a quantifier that GHC infers should be unified with an undetermined type outside the scope of that quantifier.


So apparently I forgot to actually explain the term "skolem type variable" here, heh. As mentioned in comments, in our case it's essentially synonymous with "rigid type variable", so the above still explains the idea.

I'm not entirely sure where the term originated from, but I would guess it involves Skolem normal form and representing existential quantification in terms of universal, as is done in GHC. A skolem (or rigid) type variable is one that, within some scope, has an unknown-but-specific type for some reason--being part of a polymorphic type, coming from an existential data type, &c.

25

As I understand it, a "Skolem variable" is a variable which does not match any other variable, including itself.

This seems to pop up in Haskell when you use features like explicit foralls, GADTs, and other type system extensions.

For example, consider the following type:

data AnyWidget = forall x. Widget x => AnyWidget x

What this says is that you can take any type that implements the Widget class, and wrap it into an AnyWidget type. Now, suppose you try to unwrap this:

unwrap (AnyWidget w) = w

Um, no, you can't do that. Because, at compile-time, we have no idea what type w has, so there's no way to write a correct type signature for this. Here the type of w has "escaped" from AnyWidget, which is not allowed.

As I understand it, internally GHC gives w a type which is a Skolem variable, to represent the fact that it must not escape. (This is not the only such scenario; there's a couple of other places where a certain value cannot escape due to typing issues.)

  • 20
    I disagree with the "including itself". A Skolem variable (or perhaps better Skolem constant) represents an unknown fixed type during type inference. As such, a Skolem constant does match itself, as well as a (unification) variable, but it won't match any concrete type. Skolem constants indeed arise from existential bindings, usually. They're quite different from normal unification variables that arise from universal bindings and match any concrete type. – kosmikus Oct 4 '12 at 9:00
  • @kosmikus I am by no means an expert in the intricasies of how type inferrence actually works. Your explanation seems logically self-consistent, so... – MathematicalOrchid Oct 4 '12 at 9:28
  • 4
    kosmikus' point is demonstrated by how data AnyEq = forall a. Eq a => AE a permits the definition reflexive (AE x) = x == x. The call to == is valid because x is the same type as itself, even though we don't know what that type is. – Ben Millwood Oct 11 '12 at 16:41
  • Why can't the type for unwrap be "Widget x" in this case? – egdmitry May 30 '14 at 23:34
  • 2
    @egdmitry Widget x isn't a type - it's a type constraint. And if the type was unwrap :: Widget x => AnyWidget -> x then that would mean "given AnyWidget, I can hand you any possible widget you ask me for". Well, clearly this function can't actually do that; it can only hand you the widget type that was originally wrapped. But at compile-time, we don't know what that is. Hence, it cannot be typed. – MathematicalOrchid May 31 '14 at 19:32
10

The error message pops up when a type variable tries to escape its scope.

It took me a while to figure out this, so I'll write an example.

{-# LANGUAGE ExistentialQuantification #-}
data I a = I a deriving (Show)
data SomeI = forall a. MkSomeI (I a)

Then if we try to write a function

 unI (MkSomeI i) = i

GHC refuses to type-infer/type-check this function.


Why? Let's try to infer the type ourselves:

  • unI is a lambda definition, so it's type is x -> y for some types x and y.
  • MkSomeI has a type forall a. I a -> SomeI
    • MkSomeI i has a type SomeI
    • i on the LHS has a type I z for some type z. Because of forall quantifier, we had to introduce new (fresh) type variable. Note, that it's not universal, as it's bound inside (SomeI i) expression.
    • thus we can unify type variable x with SomeI, this is ok. So the unI should have type SomeI -> y.
  • i on the RHS thus have type I z too.
  • At this point unifier tries to unify y and I z, but it notices that z is introduced in the lower context. Thus it fails.

Otherwise the type for unI would have type forall z. SomeI -> I z, but the correct one is exists z. SomeI -> I z. Yet that one GHC cannot represent directly.


Similarly, we can see why

data AnyEq = forall a. Eq a => AE a
-- reflexive :: AnyEq -> Bool
reflexive (AE x) = x == x

works.

The (existential) variable inside AE x doesn't escape into outer scope, so everything is ok.


Also I encountered a "feature" in GHC 7.8.4 and 7.10.1 where RankNTypes on itself is ok, but adding GADTs triggers the error

{-# LANGUAGE RankNTypes #-}
{-# LANGUAGE GADTs #-}

example :: String -> I a -> String
example str x = withContext x s
  where
    s i = "Foo" ++ str

withContext :: I a -> (forall b. I b -> c) -> c
withContext x f = f x

So it might be nothing wrong with your code. It might be GHC, which cannot figure everything out consistently.

EDIT: The solution is to give a type to s :: forall a. I a -> String.

GADTs turn on MonoLocalBinds, which makes inferred type of s to have skolem variable, so the type is not forall a. I a -> String, but t -> String, were t gets bound in the wrong context. See: https://ghc.haskell.org/trac/ghc/ticket/10644

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service, privacy policy and cookie policy

Not the answer you're looking for? Browse other questions tagged or ask your own question.