Typechecker #53

yc2454 · 2021-07-29T18:27:02Z

Untested type checker. Might need a lot more debugging.

Pending tasks:

testing the type checker
create an environment to store the variables etc.

j-hui

I know you still have yet to complete this PR with the type context/environment, but a couple of nits:

If you don't need an argument/pattern match discriminee, "name" it _ to indicate that it isn't needed. For instance, case something Just _ -> True ; Nothing -> False.
If you find yourself manually reimplementing a basic list pattern like map or fold to iterate through it, there's probably already a library function to do that for you.
When constructing messages for your Left TypeErrors, try to include additional information (are there any relevant variable names? what is the context in which you encountered the type error, e.g., while checking a Fork statement or checking a BOp?)

ssm/SSM/Core/TypeCheck.hs

Rewbert · 2021-08-02T08:20:22Z

I can give some general tips if it's of interest:

A nice monad transformer stack for type checking:

import Control.Monad.Reader  -- Reader for context
import Control.Monad.Except  -- Except for error handling (synonymous with manually throwing around Either e a, as you are doing now)

type TC a = ReaderT Context (Except TypeError) a

runTC :: TC a -> Either TypeError a
runTC tca = runExcept $ runReaderT tca context
  where
    context = -- some initial context containing the procedure signatures --

I like using Reader instead of State for type checking because we then don't need to worry about restoring the state once we leave a scope. The reader monad takes care of that with local.

We need some type of errors:

data TypeError = UnboundVariable Ident   -- ^ variable @Ident@ is not ins cope
               | TypeError SSMExp t1 t2  -- ^ The expression failed to type check, found t1 when t2 was expected
               -- more variants as needed

When you type check a language like this, you need to (as you point out yourself) maintain a context that gives you access to the types of any identifiers in scope. This context really consists of two parts - that which does not change (function signatures) and that which is dynamic (identifiers in scope).

import qualified Data.Map as Map

data Context = Context { procedures :: Map Ident Type  -- ^ types of procedures
                       , scopes     :: [Map Ident Type]  -- ^ types of variables (head of this list is the youngest scope)
                       }

lookupProcedure :: Ident -> TC Type
lookupProcedure id = do
  e <- ask
  case Map.lookup id (procedures e) of
    Just t -> return t
    Nothing -> throwError $ UnboundVariable id

{- | Look up the type of a variable. As our language has scopes (e.g if-branches & while loops), we must
dig through all the scopes to search for a type for the variable. We allow shadowing variables, so we start
looking for the type in the youngest scope and work our way towards the oldest. If no scope contains the variable,
it is clearly unbound and we encountered an error. -}
lookupVar :: Ident -> TC Type
lookupVar id = do
  e <- ask
  lookupVar' id (scopes e)
  where
    lookupVar' :: Ident -> [Map Ident Type] -> TC Type
    lookupVar' id [] = throwError $ UnboundVariable id
    lookupVar' id (c:cs) = case Map.lookup id c of
      Just t -> return t
      Nothing -> lookupVar' id xs

We also need some way of extending this environment with additional type information. We also want to be very clear with the lifetime of this variable, so we include a computation that the variable should remain alive for.

withVar :: Ident -> Type -> TC a -> TC a
withVar id t tca = local (\c -> c { withVar' id t (scopes e) }) tca
  where
    withVar' :: Ident -> Type -> [Map Ident Type] -> [Map Ident Type]
    withVar' id t (c:cs) = Map.insert id t c : cs

Now when we are type checking a procedure body, e.g, I would type check NewRef like this:

assertType :: SSMExp -> Type -> Type -> TC ()
assertType e t1 t2 =
  if t1 == t2
    then return ()
    else throwError $ TypeError e t1 t2

stmts :: [Stmt] -> TC ()
stmts (x:xs) = case x of
  NewRef id t e -> do
    t' <- checkExp e -- typecheck e
    assertType e t' t -- check that the type of the expression is the one we expect it to be (t)
    withVar id t' $ stmts xs -- here we say to call @stmts xs@ with the extended environment
    -- but here, after that call returned, the environment is just as it was before! :)

The final crux is scoping. If we are type checking a while-statement, the body of the while should be evaluated in a new scope. Any local variables must be removed from the typing context when we leave the while. local seems like just the thing!

-- | perform a computation with a new scope in the context. Context is restored once the computation terminates.
withNewScope :: TC a -> TC a
withNewScope tca = local (\c -> c { scopes = Map.empty : (scopes s) }) tca

stmts :: [Stmt] -> TC ()
stmts (x:xs) = case x of
  If c thn els -> do
    t <- checkExp c
    assertType c t TBool
    withNewScope $ stmts thn
    withNewScope $ stmts els

If I were to write the type checker I think I would take an approach like this one. It is possible to use State instead of Reader, but it comes with some extra care needed to make sure the state is restored after scoped computations, etc. Maybe there are some issues in my code above (I only wrote it here, I didn't type check it), but it should be mostly correct I hope.

I think writing a type checker for a language like ours (no polymorphic types etc, which makes checking quite straightforward) is a good exercise to get comfortable with monads. It was for me when I took a course in programming language technology :)

Yalu Cai added 4 commits July 21, 2021 16:13

starting files

fe0d6a4

the backbone is formulated

b890059

prototype done

2873007

prototype add

5febc3f

yc2454 requested review from Rewbert and j-hui July 29, 2021 18:27

using Either monad

7c33d5c

j-hui requested changes Aug 2, 2021

View reviewed changes

Yalu Cai and others added 21 commits August 5, 2021 13:36

saving work

10609bc

prototype done

c4d8389

added environment

dffb84d

added environment

3915ced

better error messages

87fdbb6

save cabal file for merging

8748c72

merge

0290503

matching the new syntax

6cddd6c

put ref into the env

20dce4b

Merge remote-tracking branch 'origin/master' into typechecker

cd866fd

commented out

009738d

uncommented

8adbea7

ssm time

32a44e6

ssm time

cf3fc2a

adding progs

c89e305

adding progs

4c7c95b

adding progs

6d2ea26

adding progs

e152d69

adding progs

7169bb6

adding progs

eb801a1

adding progs

952f770

Yalu Cai added 30 commits September 15, 2021 13:44

unbound var test

82c664e

unbound var test

b9b7668

arg length

053a0a0

arg length

5f17103

mis type lit

95b97ef

mis type lit

6b23214

print passing msg

cf23e8e

back to before

03476d3

empty line

d80a8b3

wrong proc name

9c04e16

Spec at end

ea81504

Spec at end

c35ee36

Spec at end

e03fc20

three more test cases

124eaf8

correct specs

da89554

merge master

dbfc51b

merging

2a5f05e

merging

732c2e0

merging

724e5d5

merging

6bd8e8f

merging

2f54c9a

merging

701bab0

merging

4379407

merging

c877e5f

typechecking with peripheral

ceff638

change imports for wrong specs

7ac8f4b

change imports type

3a191ee

changed to new definition

aaad5a7

merging typecheck into code

89b36ab

added typechecker to compile

16a8cb6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Typechecker #53

Typechecker #53

yc2454 commented Jul 29, 2021

j-hui left a comment

Rewbert commented Aug 2, 2021 •

edited

Loading

Typechecker #53

Are you sure you want to change the base?

Typechecker #53

Conversation

yc2454 commented Jul 29, 2021

j-hui left a comment

Choose a reason for hiding this comment

Rewbert commented Aug 2, 2021 • edited Loading

Rewbert commented Aug 2, 2021 •

edited

Loading