Update ADACS: Scaling and Profiling notes authored by Conrad Chan's avatar Conrad Chan
...@@ -170,6 +170,8 @@ There are 2 interesting points to take away from this: ...@@ -170,6 +170,8 @@ There are 2 interesting points to take away from this:
2. The barrier time (a measure of the dispersion of finishing times of the workers) starts quite high (20-40%) before decreasing to zero. This is the other reason for poor scaling. For the 256 core case, the job finishes before the dispersion drops to zero. 2. The barrier time (a measure of the dispersion of finishing times of the workers) starts quite high (20-40%) before decreasing to zero. This is the other reason for poor scaling. For the 256 core case, the job finishes before the dispersion drops to zero.
## Reduced Order Quadrature ## Reduced Order Quadrature
* Using config [pbilby_roq_4s_test.ini](uploads/fe387a8b33bce1c9f6e7b9928d7b335b/pbilby_roq_4s_test.ini)
ROQ should be a factor of 5-10 faster, but this is not the case. The run only completed 1.4x faster. Profiling shows that time spent doing computations is similarly close to 100%. Though the fraction of barrier time is slightly higher, this does not account for the discrepancy. ROQ should be a factor of 5-10 faster, but this is not the case. The run only completed 1.4x faster. Profiling shows that time spent doing computations is similarly close to 100%. Though the fraction of barrier time is slightly higher, this does not account for the discrepancy.
![std_8](uploads/751dfa029a09b055c12d12d10fdb5aa4/std_8.png) ![std_8](uploads/751dfa029a09b055c12d12d10fdb5aa4/std_8.png)
... ...
......