Improve reproducibility by documenting the steps needed to run the benchmarks and generate the plots. Also simplify plot generation a bit.