... | ... | @@ -109,7 +109,7 @@ To review the evidence (marginal likelihood) estimates produced by the nested sa |
|
|
Based on the tests above we sign-off the evidence evaluation as reviewed with the following criteria:
|
|
|
|
|
|
* The evidence for the `dynesty`, `nestle` and `pypolychord` samplers appear valid when using more than 1000 live points. In these cases any small systematic bias on the evidence is well within the statistical variation of the evidence. We therefore recommend that evidences from these samplers should only be quoted if using more than 1000 live points.
|
|
|
* For these samplers, and using greater than 1000 live points, the evidence uncertainties output by bilby should be reliable and provide conservative bounds (i.e., they may be slight underestimates of the true uncertainty).
|
|
|
* For these samplers, and using greater than 1000 live points, the evidence uncertainties output by bilby should be reliable and provide conservative bounds (i.e., they may be slight overestimates of the true uncertainty).
|
|
|
* The evidences for other samplers can suffer significant systematic biases across a broad range of numbers of live points. If evidences for these samplers are required then the tests in this review would need to be reproduced to show specific settings that can reduce the bias.
|
|
|
|
|
|
At the time of writing consistency tests between bilby and LALInference for the evidence produced for a real gravitational-wave signal are underway. However, the test performed so far are equivalent to those used to evaluate LALInference and are deemed sufficient. |
|
|
\ No newline at end of file |