Embedding MicroHs

Posted on August 30, 2025 by Thomas Mahler

Abstract

This post shows how to use Lennart Augustsson’s MicroHs as an execution backend for a small combinator compiler and how to embed the MicroHs compiler and runtime into GHC-built programs.

It covers generating MicroHs‑compatible combinator expressions, emitting valid out.comb format and executing the result with the MicroHs runtime. Benchmarks demonstrate substantial speedups over a self-made graph‑reduction engine. The results suggest possible further gains from bulk combinators and optimization of the mhseval-runtime.

The post also outlines contributions that expose MicroHs as an embeddable library and FFI wrapper, enabling compilation and execution from GHC programs and making embedded graph reduction practical in larger applications.

Introduction

Attentive readers of my blog will have noticed that I am a big fan of combinatory logic and graph reduction in the implementation of functional languages.

Some time ago, I became aware of Lennart Augustsson’s MicroHs project, which provides an alternative Haskell compiler that targets a runtime environment based on combinatory logic and graph reduction.

MicroHs is an awesome project for several reasons:

It demonstrates how a complex high-level language like Haskell can be translated and executed efficiently using bracket abstraction, Scott encoding, combinatory logic and a graph reduction runtime.
It outlines an approach for bootstrapping a Haskell compiler using only a C compiler as a prerequisite. This represents a significant step for bootstrapping the GHC compiler in the sense of the Bootstrappable project.
The entire codebase remains compact and “hackable”, making it an ideal textbook example for a functional language compiler.

While studying the MicroHs codebase I noticed that it uses compilation techniques, combinatory logic expressions and graph reduction mechanics that are quite close to the concepts that I presented in some of my previous blog posts on this topic. I was particularly impressed by the MicroHs graph-reduction runtime implemented in C.

So I came up with the idea of adjusting my toy compilation system to generate object code that can be executed with the MicroHs runtime system.

In this blog post I’ll explain what I did to achieve this goal. I’ll also explain briefly the two pull requests that I added to MicroHs, which allow to embed MicroHs into Haskell code copiled with GHC.

Using the MicroHs Runtime as a backend for my toy compiler

MicroHs is using a set of combinators that is quite close to those used in my toy language implementation.

The only difference I noticed was in the handling of conditional expressions. My toy compiler was using a dedicated IF combinator, whereas MicroHs is providing a much more generic and flexible system that makes clever use of combinators A and K.

This gave me confidence that it shouldn’t be too difficult to use the MicroHs runtime as a target to my compiler.

Getting rid of the IF combinator

As my toy language is just dealing with functions over integers I tried to keep thngs as easy as possible and thus modelled True as 1 and False as 0. So for example in the HhiReducer the equality test eql is defined as:

eql :: (Eq a, Num p) => a -> a -> p
eql n m = if n == m then 1 else 0

The IF-Combinator takes three arguments condition, thenExpr and elseExpr. The semantics is simple: if condition evaluates to 1, thenExpr is evaluated alse elseExpr:

CFun (\(CInt condition) -> CFun $ \thenExpr -> CFun $ \elseExpr -> 
    if condition == 1 
        then thenExpr 
        else elseExpr)

MicroHs is encoding boolean values quite differently:

True = A
False = K

Where A and K are Combinators defined as follows:

K x y = x 
A x y = y

So in this Encoding True and False can be used as selector functions to either pick the thenExpr or the elseExpr for evaluation.

We can apply this feature to eliminate the IF-combinator. We will achieve this by using a new function desugarIf before performing bracket abstraction. This function will desugar If-expressions to Scott encoded boolean applications. It will detect sourcecode patterns if condition thenExpr elseExpr and transforms it to: condition elseExpr thenExpr.

desugarIf :: Expr -> Expr
desugarIf (((Var "if" `App` condition) `App` thenExpr) `App` elseExpr) =
  (desugarIf condition `App` desugarIf elseExpr) `App` desugarIf thenExpr
desugarIf (App e1 e2) = App (desugarIf e1) (desugarIf e2)
desugarIf (Lam x e) = Lam x (desugarIf e)
desugarIf expr = expr  -- Var, Int remain unchanged

When condition evaluates to True (i.e. A) the second argument (thenExpr) will be selected. When conditionevaluates to False (i.e. K) the first argument (elseExpr) will be selected.

To make this work we will have to change all comparison functions to return Aand K, like in the following snippet from the HhiReducer:

eql :: (Eq a) => a -> a -> CExpr
eql n m = if n == m then trueCExpr else falseCExpr

-- | Helper functions for Scott-encoded booleans
trueCExpr :: CExpr
trueCExpr = link primitives (translate (Com A))

falseCExpr :: CExpr
falseCExpr = link primitives (translate (Com K))

Let’s have a closer look how this can be very handy when compiling conditional expressions to efficient code. Let’s illustrate this with an example of my improved toy compiler. Let’s say we have a very simple main-expression:

main :: Int
main = if (eql 0 1) 23 42

With the new desugaring this will compiled to the following combinator expression. Please note that thenExpr and elseExpr have been swapped by desugarIf:

EQL 0 1 42 23

Now let`s have a look at the combinator-reduction of this expression:

EQL 0 1 42 23 
K 42 23.       -- by reducing EQL 0 1 to K (representing 'False')
42             -- by reducing K x y to x

-- likewise for the 'True' case:
EQL 0 0 42 23
A 42 23.       -- by reducing EQL 0 0 to A (representing 'True')
23             -- by reducing A x y to y

producing MicroHs compatible combinator code from my toy compiler

After fixing the incompatibility in the handling of conditional expressions there is is only one task left: We’ll have to translate the combinator expressions generated by my compiler to a valid MicroHs combinator program. This is done in the MicroHsExp module.

The first step is to map from my CLTerm.CL-terms to MicroHs.Exp-terms. By looking at the type definitions we can see that the mapping will be straightforward:

-- CLTerm.CL:
data CL = 
    Com Combinator 
  | INT Integer 
  | CL :@ CL 
  deriving (Eq, Data)

-- MicroHs.Exp:
data Exp
  = Var Ident
  | App Exp Exp
  | Lam Ident Exp
  | Lit Lit
  deriving (Eq)

The Exp data type is used to store desugared λ-Terms as well as combinator terms. That is the reason why it allows free variables (Var) and λ-terms (Lam). But after running bracket abstraction over such a term it will only App and Lit constructors.

With this knowledge we can define a toMhsExp :: CL -> Exp function:

import CLTerm (CL(..))
import MicroHs.Exp (Exp(..))
import MicroHs.Expr (Lit(..))

toMhsExp :: CL -> Exp
toMhsExp (Com c) = Lit (LPrim (combToMhscomb c))
toMhsExp (INT i) = Lit (LInt (fromIntegral i))     -- LInt only works with Int
toMhsExp (t :@ u) = App (toMhsExp t) (toMhsExp u)

The interesting part here is that both integers and combinators are treated as Lit instances with specific constructors LInt and LPrim.

In order to be able to import MicroHs code I contributed a PR which exposes the MicroHs source code as a library in the MicroHs.cabal file. Now we can simply embed the MicroHs compiler (or parts of it) in any Haskell program by adding MicroHsas a dependency to our .cabal or package.yaml files.

Mapping the combinators and primops of my compiler to MicroHs can simply be achieved by show, only for a few arithmetic and comparison primops we need specific translations as MicroHs uses other names for them:

import CLTerm (Combinator(..))

combToMhscomb :: Combinator -> String
combToMhscomb ADD = "+"
combToMhscomb SUB = "-"
combToMhscomb MUL = "*"
combToMhscomb DIV = "/"
combToMhscomb REM = "rem"
combToMhscomb EQL = "=="
combToMhscomb GEQ = ">="
combToMhscomb c = show c

translating my combinator expressions into MHS out.comb format

The final step of producing a valid MicroHs out.comb file is even simpler. MicroHs defines a function toStringCMdl, which takes a list of definitions (i.e. all functions, expressions and CAFs defined in a haskell program) and an expression representing the main-expression of a Haskell program as input. The result is a tuple with the number of definitions, a list of all foreign export identifiers, and the program as a string. This last element of the tuple will contain the contain the actual combinator code, i.e. the contents of an out.combfile.

Even when compiling a program with many top-level definitions my toy compiler just returns the compiled main expression (with all calls to other toplevel definitions expanded to combinator code). As we we don’t have any top-level definitions left after compilation we just hand over an empty list

With this knowledge we can can compile our CL-term to a valid MicroHs program with very little effort:

import MicroHs.ExpPrint (toStringCMdl)

toMhsPrg :: CL -> String
toMhsPrg cl = 
  let
    definitions = [] 
    (n, exps, prg) = toStringCMdl (definitions, toMhsExp cl)
   in prg

A first test drive

Now it’s time for a first test drive of our solution:

factorial :: String
factorial = [r| 
  fact = y(\f n. if (eql n 0) 1 (* n (f (- n 1))))
  main = fact 10
|]

main :: IO ()
main = do
  let source = factorial
      env = parseEnvironment source
      expr' = compileEta env.  -- compileEta is a good default for dense combinator code
      prg = toMhsPrg expr'     -- use MicroHs.ExpPrint.toStringCMdl to produce MicroHs code    

  putStrLn $ "Factorial compiled to combinator expression:\n" ++ show expr'
  putStrLn $ "The resulting MicroHs program: \n" ++ prg
  writeFile "out.comb" prg.    -- out.comb is the default file name for code executed by mhseval

ghci> main
Factorial compiled to combinator expression: 
Y(R 1(B C(B(S(C EQL 0))(B(S MUL)(R(C SUB 1) B))))) 10
The resulting MicroHs program: 
v8.2
0
Y R #1 @B C @B S C == @#0 @@@B S * @@R C - @#1 @@B @@@@@@#10 @ }

Please note that Int literals are encoded with a leading #in MicroHs code format, like #0, #1and #10 in the program above.

Now we use the MicroHs evaluator mhseval to run this program written to the file out.comb:

bash> mhseval 
#3628800

I think this is quite impressive: MicroHs knows all the combinators my toy compiler is emmitting, including the Y-combinator and the reduction works in the expected way and produces the correct result!

So using the MicroHs evaluator as a runtime environment for combinator code generated by other compilers seems quite feasible!

Using an FFI Wrapper to call `mhseval` from Haskell

My intial idea was to use the benchmark suite presented in my earlier posts to find out how MicroHs compares to my toy implementations.

As a first attempt I encapsulated the generation of MicroHs code and calling mhseval like follows:

microHsevalTest :: CL -> IO String
microHsevalTest expr = do
  let prg = toMhsPrg expr
  readProcess "mhseval" [] prg

This worked, but due to the overhead caused by executing mhseval as an external process I did not see any performance gains.

So I came up with a new plan: let’s write an FFI wrapper around mhseval to avoid spawning external processes.

I won’t go into the details of this wrapper. The good news is that the respective PR was accepted and is now part of the official MicroHs codebase. If you are interested you can study the code of the C-wrapper here: mhseval.h and mhseval.c. The Haskell wrapper is in MhsEval.hs.

import MicroHsExp (toMhsPrg)
import MhsEval (withMhsContext, eval, run)

main :: IO ()
main = do
  let source = factorial
  let env = parseEnvironment source
  let expr = compileEta env
  putStrLn $ "Factorial compiled to combinator expression:\n" ++ show expr

  let prg = toMhsPrg expr
  putStrLn $ "The resulting MicroHs program: \n" ++ prg
  
  result <- withMhsContext $ \ctx ->
    eval ctx prg
  putStrLn $ "Result: " ++ result

The resulting output does not bring any surprises:

ghci> main
Factorial compiled to combinator expression: 
Y(R 1(B C(B(S(C EQL 0))(B(S MUL)(R(C SUB 1) B))))) 10
The resulting MicroHs program: 
v8.2
0
Y R #1 @B C @B S C == @#0 @@@B S * @@R C - @#1 @@B @@@@@@#10 @ }
Result: #3628800

The function withMhsContext :: (MhsContext -> IO a) -> IO a executes an action (like eval or run) with a MicroHs context. It initializes a context, runs the action, and cleans up afterwards. It is useful for one-off evaluations without needing to manage the context manually.

The function eval :: MhsContext -> MhsCombCode -> IO String takes a string containing MicroHs combinator code, evaluates it, and returns the result as a string. If evaluation fails, it throws an MhsEvalError. The type MhsCombCode is just an alias for String. This currently works properly only for results of type Int.

The function run :: MhsContext -> MhsCombCode -> IO () takes a string containing MicroHs combinator code, and runs it without returning any result.

In a scenario like a performance benchmark it is not a good idea to create a new context in the tight benchmark loop. For such use cases I have also provided functions for explicitely managing the context: createMhsContext :: IO MhsContext and closeMhsContext :: MhsContext -> IO ().

Benchmarking MhsEval against my toy runtime.

In my last blog post on this matter I focussed on comparing different bracket abstraction algorithms. This time I will have a closer look to execution speed of graph-reduction based approaches versus GHC compiled code.

I will consider three execution runtimes in four scenarios:

The HHI-Reducer. This is the fastest of my graph-reduction implementations. Compilation with compileEta.
The MicroHs Runtime mhseval. Compilation with compileEta.
The GHC Runtime. With GHC compiled Haskell code
The MicroHs Runtime mhseval. With MicroHs compiled Haskell code.

As I am focussing on backend performance I will not vary the compilation algorithm. I will use the compileEta algorithm (introduced in the above mentioned post) which will produce the most compact combinator code for standard combinators. (As of now MicroHs does not support Bulk-Combinators which would allow even more compact code.)

I added the fourth option (execution time for mhs compiled code on the mhs runtime) upon a request in the reddit discussion on this blog post. Having this benchmark will show whether the MHS compiler produces combinator object code that is better optimized than the output of my compileEta algorithm. This is also useful as a sanity check for my toy compiler.

I will benchmark execution of the following programs of my toy language:

fibonacci

fibonacci = [r| 
  fib  = y(λf n. if (leq n 1) 1 (+ (f (- n 1)) (f (- n 2))))
  main = fib 37
|]

compiled by compileEta to:

Y(R 1(B C(B(S(C LEQ 1))(S(B S(B(B ADD)(R(C SUB 1) B)))(R(C SUB 2) B))))) 37

ackermann

ackermann = [r|
  ack  = y(λf n m. if (eql n 0) (+ m 1) (if (eql m 0) (f (- n 1) 1) (f (- n 1) (f n (- m 1)))))
  main = ack 3 9
|]

compiled to:

Y(B(R(C ADD 1))(B(B S)(B(S(B B(R 0 EQL)))(S(B S(B(B C)(B(B(S(C EQL 0)))(S(B S(B(B B)(R(R 1 SUB) B)))(B(R(C SUB 1))(B B))))))(B(R 1)(R(R 1 SUB) B)))))) 3 9

tak

tak = [r| 
  tak  = y(λf x y z. (if (geq y x) z (f (f (- x 1) y z) (f (- y 1) z x) (f (- z 1) x y ))))
  main = tak 18 6 3
|]

compiled to:

Y(B(B(R I))(B(B(B S))(B(S(B S(B(B B)(C GEQ))))(S(B S(B(B S)(B(B(B S))(S(B S(B(B S)(B(B(B S))(S(B B(B B B))(R(R 1 SUB) B)))))(B C(B(B C)(R(R 1 SUB) B)))))))(B(B C)(B C(R(C SUB 1) B))))))) 18 6 3

In order to compare these programs against GHC I’m using the following Haskell equivalents for GHC. These implementeations will also be used to benchmark the performance of Haskell programs compiled with MicroHs:

fib  = fix (\f n -> 
  if n <= 1 
    then 1 
    else f (n-1) + f (n - 2))

ack  = fix (\f n m ->
  if n == 0
    then m + 1
    else (if m == 0
      then f (n-1) 1
      else f (n-1) (f n (m-1))))

tak  = fix (\f x y z -> 
  if y >= x 
    then z 
    else f (f (x-1) y z) (f (y-1) z x) (f (z-1) x y ))

Benchmarking these programs with the Criterion micro-benchmarking suite yields the following results:

program	HHI-Reducer	MicroHs	Haskell native	MHS native
fib 37	8.641 s	6.296 s	727.3 ms	6.135 s
ackermann 3 9	4.170 s	1.575 s	105.6 ms	1.239 s
tak 18 6 3	1.225 ms	1.101 ms	42.01 μs	858.8 μs

Interpretation of the results

The numbers show a clear hierarchy: MicroHs consistently outperforms the baseline HHI-Reducer, but native GHC code remains an order of magnitude faster. There is some variation between the different programs. For example, MicroHs can handle the ack-function significantly better (factor 2.6) than the HHI-Reducer. For the tak-function on the other hand the factor is just 1.1. A complete overview is given in the following table:

factor in direct comparison	fib	ack	tak
MicroHs vs HHI-Reducer	1.4	2.6	1.1
GHC native vs MicroHs	8.7	14.9	26.2
MicroHs native vs MicroHs Execute	1.03	1.27	1.28

Before doing this benchmarking exercise I had two expectations:

MicroHs will outperform my toy reducers.
GHC will be about ten times faster then MicroHs.

My first expection was met, however for a 3-argument function like tak MicroHs is only 10% faster.

The second expectation was clearly not met for the tak-function! In my post on optimizing bracket abstraction I have demonstrated how the code size of classic abstraction algorithms will grow quadratically with the number of variables of a function. In that post I have also shown how advanced approaches like Kiselyovs Bulk-Combinators can significantly improve this behaviour.

Evaluating whether bulk combinators can speed up MicroHs is a promising next step.

For the. fib 37 program, the compiler already emits quite compact code:

Y(R 1(B C(B(S(C LEQ 1))(S(B S(B(B ADD)(R(C SUB 1) B)))(R(C SUB 2) B))))) 37

Even so, the GHC version runs about 8.7× faster. For programs like this, increasing mhseval’s reductions per second would be a beneficial undertaking!

The third row in this table compares the mhseval execution time for the haskell equivalents compiled with MHS versus my toy language programs compiled with compileEta.

This comparison shows that the MicroHs compiled programs run up to 28% faster. This indicates that the MicroHs compiler produces combinator object code that is better optimized than the compileEta algorithm. This is not surprising as compileEtais not the best abstraction algorithms according to my previous research. I am using it here because it produces the code that is compatible with the mhseval runtime.

Using the FFI wrapper to compile and execute Haskell programs

In this blog post, I have so far only used the parts of MiscroHs that deal either directly with the generation of Combinator code or with the execution of Combinator code.

But my two pull requests allow to embed MicroHs in GHC compiled Haskell programs in a much more complete way:

Let’s assume we have a file Example.hs with the following code:

module Example where

fac :: Int -> Int
fac 0 = 1
fac n = n * fac(n - 1)

main :: IO ()
main = do
  putStrLn "computing some factorials"
  print $ map fac [0..10]

Now let’s use MicroHs to compile and execute this code from some arbitrary GHC compiled Haskell program:

import           MhsEval (withMhsContext, eval, run)
import qualified MicroHs.Main as MHS (main)
import           System.Process (withArgs)

main :: IO ()
main = do
  -- use microHs to compile 'Example.hs' to 'out.comb'
  withArgs ["Example.hs"] MHS.main
  -- read the program 'out.comb' into a string
  prg' <- readFile "out.comb"
  -- use the MicroHs runtime to execute the program
  withMhsContext $ \ctx ->
    run ctx prg'

The output of this program in GHCI looks like follows:

ghci> main
computing some factorials
[1,1,2,6,24,120,720,5040,40320,362880,3628800]

Ok, this works as expected but it feels a bit clumsy to send the compiler output to a file and then read in that file to be able to execute it. In order to improve this sketchy solution I integrated the MhsEval wrapper more tightly into the mhs compiler by implementing the mhs -r option also for GHC based compiles. With this goody the compile and execute cycle can be unified in a single command, as shown in the following snippet:

import qualified MicroHs.Main as MHS (main)
import           System.Process (withArgs)

main :: IO ()
main = do
  -- use MicroHs to compile AND execute the 'Example.hs' program
  withArgs ["-r", "Example.hs"] MHS.main

Conclusion

In this post, I’ve demonstrated how to successfully integrate MicroHs as a backend for my toy combinator compiler, showcasing the elegance and power of combinatory logic as a compilation target. By adapting my compiler to emit MicroHs-compatible code, I was able to leverage the robust MicroHs graph-reduction runtime as an execution backend.

The performance benchmarks reveal that MicroHs consistently outperforms my toy runtime implementations, with speedups ranging from 10% (for tak) to 2.6x (for Ackermann). There is still a huge gap to GHC compiled code. It could be promising to study whether techniques like Bulk-Combinators could improve execution speed of MicroHs. In addition to this substantial optimizations to the mhseval runtime will be needed to reach the performance of GHC compiled programs.

The post also highlighted the remarkable achievement of the MicroHs project itself: a complete, bootstrappable Haskell compiler that fits in a compact, hackable codebase while demonstrating sophisticated compilation techniques. The presented FFI wrapper makes it possible to embed the complete MicroHs compiler and runtime within any GHC-compiled Haskell program. This opens up interesting possibilities for embedded haskell scripting and runtime code generation.

For anyone interested in compiler implementation, functional programming foundations, or combinatory logic, MicroHs provides an excellent playground for experimentation. The ability to embed it within larger Haskell applications makes it particularly valuable for research and teaching purposes.

Appendix: my earlier posts on combinatory logic and graph-reduction

In λ-Calculus, Combinatory Logic and Cartesian Closed Categories I explained the basic concepts of compiling λ-Calculus based languages to combinatory logic expressions. This post also explores the deep correspondence between simply‑typed λ‑calculus, combinatory logic, and cartesian closed categories (CCC).
In Implementing a Functional Language with Graph Reduction I’m describing a minimal functional language implemented in Haskell using three core components: (1) a parser for untyped λ‑calculus, (2) a compiler translating λ‑terms into a fixed combinator set (S,K,I,B,C,Y) via bracket abstraction with basic optimizations, and (3) a graph‑reduction engine using in‑place mutable STRef nodes to implement combinator rewrite rules efficiently
Evaluating SKI Combinators as Native Haskell Functions This follow‑up provides an alternative to the graph reduction backend with direct evaluation of combinators as first‑class Haskell functions. The result is a significantly simpler and more compact implementation that achieves performance improvements by factors of 10 to 100 over the previous graph‑reduction approach
Optimizing Bracket Abstraction for Combinator Reduction Building on the last two posts, this article focuses on improving the bracket‑abstraction phase. I’m implementing several different optimizations of bracket abstraction introduced by Kiselyov. In particular I perform benchmarks to study their impact on execution speed.