Spark: Inconsistent performance number in scaling number of cores
Theoretical limitations I assume you are familiar Amdahl’s law but here is a quick reminder. Theoretical speedup is defined as followed : where : s – is the speedup of the parallel part. p – is fraction of the program that can be parallelized. In practice theoretical speedup is always limited by the part that … Read more