My guess is that for some simple benchmarks call/1 (without meta_predicate) it is taking more than the benchmark itself (of that even a small consumption of heap during the ntimes loop is causing some unexpected GC pause). I’ve sent you another PR that adds the meta_predicate decl in the ciao.pl file.
I’ll try to run locally and try to reproduce the results locally.