Visually inspect the A posteriors to see if everything looks OK. Things to look out for: is GR recovered (within or close to 90% CI represented by vertical lines); weird posterior shape (like double peaks); how it compares to gwtc3 posterior (about the same, better, worse).
I propose following key for filling in the table (you can include brief text note after the symbol if you want to elaborate). Use 2 symbols. 1st represents how bilby posterior looks:
* :white_check_mark: nice single peak and GR recovered [`:white_check_mark:`]
* :ok: GR recovered but posterior looks weird (2nd peak, very broad, etc.) [:ok:]
* :x: GR not recovered (A=0 significantly beyond 90% CI) [:x:]
2nd symbol represents how well does bilby posterior compare to gwtc3 one:
* :arrow_up: bilby has noticebly better posterior (narrower, 2nd peaks damped/removed, closer to GR) [:arrow_up:]
* :arrow_down_small: bilby is noticeably worse (wider, more 2nd peaks, farther from GR) [:arrow_down_small:]
* :neutral_face: both posteriors look more or less the same [:neutral_face:]