Speed Tests

When I learn a new programming language, I always implement the Münchausen numbers problem in the given language. The problem is simple but it includes a lot of computations, thus it gives an idea of the execution speed of a language.

Münchausen numbers

A Münchausen number is a number equal to the sum of its digits raised to each digit's power.

For instance, 3435 is a Münchausen number because 3³+4⁴+3³+5⁵ = 3435.

0⁰ is not well-defined, thus we'll consider 0⁰=0. In this case there are four Münchausen numbers: 0, 1, 3435, and 438579088.

Exercise

Write a program that finds all the Münchausen numbers. We know that the largest Münchausen number is less than 440 million.

Updates

Dates are in yyyy-month format.

2025-July: F# was added.

2025-April: Python 3 with Rust removed. Common LISP updated. C3 added.

Implementations

In the implementations I tried to use the same (simple) algorithm in order to make the comparisons as fair as possible.

All the tests were run on my home desktop machine (Intel Core i7-4771 CPU @ 3.50GHz with 8 CPU cores) using Manjaro Linux. Execution times are wall-clock times and they are measured with hyperfine (warmup runs: 1, benchmarked runs: 2).

The following implementations were received in the form of pull requests:

Clojure, Common LISP, Crystal, D, F#, FASM, Forth, Fortran, Haskell, JavaScript, Lua, Mojo, NASM, OCaml, Pascal, Perl, PHP, Python 3 with Numba, Racket, Ruby, Scala 3, Scheme, Swift, Toit, V, Zig

Thanks for the contributions!

If you know how to make something faster, let me know!

Languages are listed in alphabetical order.

The size of the EXE files can be further reduced with the command strip -s. If it's applicable, then the stripped EXE size is also shown in the table.

Below, you can find single-threaded implemetations. We also have some multi-threaded implementations, see here.

C

gcc (GCC) 13.2.1 20230801
clang version 16.0.6
Benchmark date: 2024-02-05 [yyyy-mm-dd]

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`gcc -O3 main.c -o main -lm`	3.893 ± 0.01	15,560	14,408
`gcc -O2 main.c -o main -lm`	3.892 ± 0.001	15,560	14,408
`clang -O3 main.c -o main -lm`	2.684 ± 0.013	15,528	14,416
`clang -O2 main.c -o main -lm`	2.672 ± 0.001	15,528	14,416

Notes:

No real difference between the switches -O2 and -O3. It's enough to use -O2.
clang is better in this case

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`g++ -O3 --std=c++2a main.cpp -o main`	3.865 ± 0.01	15,936	14,432
`g++ -O2 --std=c++2a main.cpp -o main`	3.849 ± 0.012	15,936	14,432
`clang++ -O3 --std=c++2a main.cpp -o main`	2.913 ± 0.01	15,904	14,440
`clang++ -O2 --std=c++2a main.cpp -o main`	2.827 ± 0.015	15,904	14,440

Execution	Runtime (sec)	compiled / transpiled output size (bytes)	--
`clj -M -m main`	5.631 ± 0.112	--	--
mkdir classes && java -cp `clj -Spath` main	5.339 ± 0.101	--	--

Execution	Runtime (sec)	--	--
`clisp -C main2.cl`	517.914 ± 1.032	--	--
`clisp -C main.cl`	322.324 ± 0.98	--	--
`sbcl --script main.cl`	7.277 ± 0.003	--	--
`sbcl --script main2.cl`	4.897 ± 0.007	--	--

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`dmd -release -O main.d`	9.987 ± 0.045	993,816	712,504
`ldc2 -release -O main.d`	3.089 ± 0.008	34,584	23,008

Execution	Runtime (sec)	compiled / transpiled output size (bytes)	--
`dart main.dart`	23.909 ± 0.581	--	--
`dart compile js main.dart -O2 -m -o main.js && node main.js`	10.509 ± 0.032	31,684	--
`dart compile exe main.dart -o main && ./main`	8.377 ± 0.009	5,925,856	--

Execution	Runtime (sec)	--	--
`elixir main.exs`	227.963 ± 0.543	--	--
`elixirc munchausen.ex && elixir caller.exs`	217.528 ± 0.762	--	--

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# FASM x64, see v1 in Makefile`	15.792 ± 0.018	532	532
`# FASM x86, see v2 in Makefile`	15.207 ± 0.023	444	444

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# using int, see v1 in Makefile`	4.122 ± 0.034	2,137,820	1,391,192
`# using uint and uint32, see v2 in Makefile`	3.5 ± 0.045	2,137,756	1,391,192

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# basic, see v1 in Makefile`	93.816 ± 0.043	3,175,704	754,008
`# optimized, see v2 in Makefile`	3.517 ± 0.009	6,324,936	3,183,648

Execution	Runtime (sec)	--	--
`node main1.js`	17.789 ± 0.009	--	--
`node main2.js`	6.819 ± 0.001	--	--

Compilation	Runtime (sec)	--	--
`lua main1.lua`	112.412 ± 0.03	--	--
`luajit main1.lua`	16.854 ± 0.013	--	--
`luajit main2_goto.lua`	15.737 ± 0.007	--	--

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`nelua main.nelua --release -o main`	3.519 ± 0.02	15,704	14,432
`nelua main.nelua --release --cc=clang -o main`	3.215 ± 0.011	15,616	14,432

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`nim c -d:release main.nim`	3.773 ± 0.017	73,696	63,936
`nim c --cc:clang -d:release main.nim`	3.645 ± 0.014	57,440	47,608
`nim c --cc:clang -d:danger main.nim`	3.41 ± 0.021	42,808	35,152
`nim c -d:danger main.nim`	3.098 ± 0.022	54,808	47,328

Execution	Runtime (sec)	--	--
`perl main.pl`	494.71 ± 4.649	--	--
`perl -Minteger main.pl`	423.805 ± 2.471	--	--

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# using Int, see v1 in Makefile`	3.844 ± 0.011	1,160,400	302,952
`# using UInt32, see v2 in Makefile`	3.125 ± 0.043	1,160,400	302,952

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# NASM x86, see v2 in Makefile`	15.19 ± 0.012	9,228	8,428
`# NASM x64, see v1 in Makefile`	15.186 ± 0.034	9,656	8,552

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# using int32, see v3 in Makefile`	3.728 ± 0.02	57,440	47,608
`# using int64, see v2 in Makefile`	3.644 ± 0.023	57,440	47,608
`# using int, see v1 in Makefile`	3.623 ± 0.001	57,440	47,608
`# using uint64, see v5 in Makefile`	3.427 ± 0.033	57,496	47,608
`# using uint32, see v4 in Makefile`	3.248 ± 0.026	57,496	47,608

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`# see v1 in Makefile`	17.391 ± 0.009	531,056	531,056
`# see v2 in Makefile`	5.828 ± 0.024	531,056	531,056

Execution	Runtime (sec)	--	--
`python3 main.py`	313.333 ± 8.03	--	--
`pypy3 main.py`	19.911 ± 0.054	--	--

Execution	Runtime (sec)	--	--
`racket main1.rkt`	107.486 ± 0.5	--	--
`racket main2.rkt`	43.847 ± 1.932	--	--

Execution	Runtime (sec)	--	--
`ruby main.rb`	199.632 ± 3.2	--	--
`ruby --jit main.rb`	75.863 ± 1.174	--	--

Execution	Runtime (sec)	EXE (bytes)	--
`guile -s main.scm`	148.423 ± 1.773	--	--
`chez --compile-imported-libraries --optimize-level 3 -q --script main.scm`	69.826 ± 0.387	--	--
`gambitc -:debug=pqQ0 -exe -cc-options '-O3' main.scm && ./main`	21.718 ± 0.229	9,098,392	--
`stalin -architecture amd64 -s -On -Ot -Ob -Om -Or -dC -dH -dP\ && ./main`	4.599 ± 0.017	25,472	--
`stalin -architecture amd64 -s -On -Ot -Ob -Om -Or -dC -dH -dP\ && ./main`	4.012 ± 0.014	25,512	--

Execution	Runtime (sec)	EXE (bytes)	--
`toit.run main.toit`	120.263 ± 0.069	--	--
`toit.compile -O2 -o main main.toit && ./main`	118.63 ± 0.774	1,254,784	1,254,784

Compilation	Runtime (sec)	EXE (bytes)	stripped EXE (bytes)
`v -prod main.v`	4.056 ± 0.004	209,392	187,728
`v -cc clang -prod main.v`	3.936 ± 0.018	212,720	191,736

README

Speed Tests

Münchausen numbers

Exercise

Updates

Implementations

C

C++

C#

C3

Clojure

Codon

Common LISP

Crystal

D

Dart

Elixir

F#

FASM

Forth

Fortran

Go

Haskell

Java

JavaScript

Julia

Kotlin

Lua

Mojo

NASM

Nelua

Nim Tests #1

Nim Tests #2

OCaml

Odin

Pascal

Perl

PHP

Python 3

Python 3 with mypyc

Python 3 with Nim

Python 3 with Numba

Racket

Ruby

Rust

Scala 3

Scheme

Swift

Toit

V

Zig