Performance of Wasm tail calls is lacking on aarch64
compared to x86
#9690
Labels
bug
Incorrect behavior in the current implementation that needs fixing
In Wasmi's benchmark suite I have the following Wasm test case:
cc @alexcrichton
It is a simple fibonacci routines based on Wasm's
call_return
tail calls.When I ran those benchmarks on my Macbook M2 Pro I saw that Wasmi is roughly 10-15x slower than Wasmtime on
aarch64
usually. However, for this particular test-case it is just ~4x slower than Wasmtime. Back then I found this suspicious which is why I didn't mention this in the article I wrote about Wasmi.After having had a short discussion with @alexcrichton he told me to open an issue since this kind of performance gap is considered a bug for Wasmtime maintainers.
Feel free to clone Wasmi benchmarks and test it out on your own hardware. Unfortunately I only have a Macbook M2 Pro and nothing else, so I cannot rerun those benchmarks on different hardware for this issue.
Benchmarks from my machine:
The text was updated successfully, but these errors were encountered: