Dirichlet Process Gaussian-mixture model: An application to localizing coalescing binary neutron stars with gravitational-wave observations

Where do gravitational waves like GW170817 come from? Using our network of detectors, we cannot pinpoint a source, but we can make a good estimate—the amplitude of the signal tells us about the distance; the time delay between the signal arriving at different detectors, and relative amplitudes of the signal in different detectors tells us about the sky position (see the excellent video by Leo Singer below).

In this paper we look at full three-dimensional localization of gravitational-wave sources; we important a (rather cunning) technique from computer vision to construct a probability distribution for the source’s location, and then explore how well we could localise a set of simulated binary neutron stars. Knowing the source location enables lots of cool science. First, it aids direct follow-up observations with non-gravitational-wave observatories, searching for electromagnetic or neutrino counterparts. It’s especially helpful if you can cross-reference with galaxy catalogues, to find the most probable source locations (this technique was used to find the kilonova associated with GW170817). Even without finding a counterpart, knowing the most probable host galaxy helps us figure out how the source formed (have lots of stars been born recently, or are all the stars old?), and allows us measure the expansion of the Universe. Having a reliable technique to reconstruct source locations is useful!

This was a fun paper to write [bonus note]. I’m sure it will be valuable, both for showing how to perform this type of reconstruction of a multi-dimensional probability density, and for its implications for source localization and follow-up of gravitational-wave signals. I go into details of both below, first discussing our statistical model (this is a bit technical), then looking at our results for a set of binary neutron stars (which have implications for hunting for counterparts) .

Dirichlet process Gaussian mixture model

When we analyse gravitational-wave data to infer the source properties (location, masses, etc.), we map out parameter space with a set of samples: a list of points in the parameter space, with there being more around more probable locations and fewer in less probable locations. These samples encode everything about the probability distribution for the different parameters, we just need to extract it…

For our application, we want a nice smooth probability density. How do we convert a bunch of discrete samples to a smooth distribution? The simplest thing is to bin the samples. However, picking the right bin size is difficult, and becomes much harder in higher dimensions. Another popular option is to use kernel density estimation. This is better at ensuring smooth results, but you now have to worry about the size of your kernels.

Our approach is in essence to use a kernel density estimate, but to learn the size and position of the kernels (as well as the number) from the data as an extra layer of inference. The “Gaussian mixture model” part of the name refers to the kernels—we use several different Gaussians. The “Dirichlet process” part refers to how we assign their properties (their means and standard deviations). What I really like about this technique, as opposed to the usual rule-of-thumb approaches used for kernel density estimation,  is that it is well justified from a theoretical point of view.

I hadn’t come across a Dirchlet process before. Section 2 of the paper is a walkthrough of how I built up an understanding of this mathematical object, and it contains lots of helpful references if you’d like to dig deeper.

In our application, you can think of the Dirichlet process as being a probability distribution for probability distributions. We want a probability distribution describing the source location. Given our samples, we infer what this looks like. We could put all the probability into one big Gaussian, or we could put it into lots of little Gaussians. The Gaussians could be wide or narrow or a mix. The Dirichlet distribution allows us to assign probabilities to each configuration of Gaussians; for example, if our samples are all in the northern hemisphere, we probably want Gaussians centred around there, rather than in the southern hemisphere.

With the resulting probability distribution for the source location, we can quickly evaluate it at a single point. This means we can rapidly produce a list of most probable source galaxies—extremely handy if you need to know where to point a telescope before a kilonova fades away (or someone else finds it).

Gravitational-wave localization

To verify our technique works, and develop an intuition for three-dimensional localizations, we used studied a set of simulated binary neutron star signals created for the First 2 Years trilogy of papers. This data set is well studied now, it illustrates performance it what we anticipated to be the first two observing runs of the advanced detectors, which turned out to be not too far from the truth. We have previously looked at three-dimensional localizations for these signals using a super rapid approximation.

The plots below show how well we could localise the sources of our binary neutron star sources. Specifically, the plots show the size of the volume which has a 90% probability of containing the source verses the signal-to-noise ratio (the loudness) of the signal. Typically, volumes are 10^410^5~\mathrm{Mpc}^3, which is about 10^{68}10^{69} Olympic swimming pools. Such a volume would contain something like 1001000 galaxies.

Volume verses signal-to-noise ratio

Localization volume as a function of signal-to-noise ratio. The top panel shows results for two-detector observations: the LIGO-Hanford and LIGO-Livingston (HL) network similar to in the first observing run, and the LIGO and Virgo (HLV) network similar to the second observing run. The bottom panel shows all observations for the HLV network including those with all three detectors which are colour coded by the fraction of the total signal-to-noise ratio from Virgo. In both panels, there are fiducial lines scaling inversely with the sixth power of the signal-to-noise ratio. Adapted from Fig. 4 of Del Pozzo et al. (2018).

Looking at the results in detail, we can learn a number of things

  1. The localization volume is roughly inversely proportional to the sixth power of the signal-to-noise ratio [bonus note]. Loud signals are localized much better than quieter ones!
  2. The localization dramatically improves when we have three-detector observations. The extra detector improves the sky localization, which reduces the localization volume.
  3. To get the benefit of the extra detector, the source needs to be close enough that all the detectors could get a decent amount of the signal-to-noise ratio. In our case, Virgo is the least sensitive, and we see the the best localizations are when it has a fair share of the signal-to-noise ratio.
  4. Considering the cases where we only have two detectors, localization volumes get bigger at a given signal-to-noise ration as the detectors get more sensitive. This is because we can detect sources at greater distances.

Putting all these bits together, I think in the future, when we have lots of detections, it would make most sense to prioritise following up the loudest signals. These are the best localised, and will also be the brightest since they are the closest, meaning there’s the greatest potential for actually finding a counterpart. As the sensitivity of the detectors improves, its only going to get more difficult to find a counterpart to a typical gravitational-wave signal, as sources will be further away and less well localized. However, having more sensitive detectors also means that we are more likely to have a really loud signal, which should be really well localized.

Banana vs cucumber

Left: Localization (yellow) with a network of two low-sensitivity detectors. The sky location is uncertain, but we know the source must be nearby. Right: Localization (green) with a network of three high-sensitivity detectors. We have good constraints on the source location, but it could now be at a much greater range of distances. Not to scale.

Using our localization volumes as a guide, you would only need to search one galaxy to find the true source in about 7% of cases with a three-detector network similar to at the end of our second observing run. Similarly, only ten would need to be searched in 23% of cases. It might be possible to get even better performance by considering which galaxies are most probable because they are the biggest or the most likely to produce merging binary neutron stars. This is definitely a good approach to follow.

Three-dimensional localization with galaxy catalgoue

Galaxies within the 90% credible volume of an example simulated source, colour coded by probability. The galaxies are from the GLADE Catalog; incompleteness in the plane of the Milky Way causes the missing wedge of galaxies. The true source location is marked by a cross [bonus note]. Part of Figure 5 of Del Pozzo et al. (2018).

arXiv: 1801.08009 [astro-ph.IM]
Journal: Monthly Notices of the Royal Astronomical Society; 479(1):601–614; 2018
Code: 3d_volume
Buzzword bingo: Interdisciplinary (we worked with computer scientist Tom Haines); machine learning (the inference involving our Dirichlet process Gaussian mixture model); multimessenger astronomy (as our results are useful for following up gravitational-wave signals in the search for counterparts)

Bonus notes

Writing

We started writing this paper back before the first observing run of Advanced LIGO. We had a pretty complete draft on Friday 11 September 2015. We just needed to gather together a few extra numbers and polish up the figures and we’d be done! At 10:50 am on Monday 14 September 2015, we made our first detection of gravitational waves. The paper was put on hold. The pace of discoveries over the coming years meant we never quite found enough time to get it together—I’ve rewritten the introduction a dozen times. It’s extremely satisfying to have it done. This is a shame, as it meant that this study came out much later than our other three-dimensional localization study. The delay has the advantage of justifying one of my favourite acknowledgement sections.

Sixth power

We find that the localization volume \Delta V is inversely proportional to the sixth power of the signal-to-noise ration \varrho. This is what you would expect. The localization volume depends upon the angular uncertainty on the sky \Delta \Omega, the distance to the source D, and the distance uncertainty \Delta D,

\Delta V \sim D^2 \Delta \Omega \Delta D.

Typically, the uncertainty on a parameter (like the masses) scales inversely with the signal-to-noise ratio. This is the case for the logarithm of the distance, which means

\displaystyle \frac{\Delta D}{D} \propto \varrho^{-1}.

The uncertainty in the sky location (being two dimensional) scales inversely with the square of the signal-to-noise ration,

\Delta \Omega \propto \varrho^{-2}.

The signal-to-noise ratio itself is inversely proportional to the distance to the source (sources further way are quieter. Therefore, putting everything together gives

\Delta V \propto \varrho^{-6}.

Treasure

We all know that treasure is marked by a cross. In the case of a binary neutron star merger, dense material ejected from the neutron stars will decay to heavy elements like gold and platinum, so there is definitely a lot of treasure at the source location.

Advertisement

GW170814—Enter Virgo

On 14 August 2017 a gravitational wave signal (GW170814), originating from the coalescence of a binary black hole system, was observed by the global gravitational-wave observatory network of the two Advanced LIGO detectors and Advanced Virgo.  That’s right, Virgo is in the game!

A new foe appeared

Very few things excite me like unlocking a new character in Smash Bros. A new gravitational wave observatory might come close.

Advanced Virgo joined O2, the second observing run of the advanced detector era, on 1 August. This was a huge achievement. It has not been an easy route commissioning the new detector—it never ceases to amaze me how sensitive these machines are. Together, Advanced Virgo (near Pisa) and the two Advanced LIGO detectors (in Livingston and Hanford in the US) would take data until the end of O2 on 25 August.

On 14 August, we found a signal. A signal that was observable in all three detectors [bonus note]. Virgo is less sensitive than the LIGO instruments, so there is no impressive plot that shows something clearly popping out, but the Virgo data do complement the LIGO observations, indicating a consistent signal in all three detectors [bonus note].

Three different ways of visualising GW170814: an SNR time series, a spectrogram and a waveform reconstruction

A cartoon of three different ways to visualise GW170814 in the three detectors. These take a bit of explaining. The top panel shows the signal-to-noise ratio the search template that matched GW170814. They peak at the time corresponding to the merger. The peaks are clear in Hanford and Livingston. The peak in Virgo is less exceptional, but it matches the expected time delay and amplitude for the signal. The middle panels show time–frequency plots. The upward sweeping chirp is visible in Hanford and Livingston, but less so in Virgo as it is less sensitive. The plot is zoomed in so that its possible to pick out the detail in Virgo, but the chirp is visible for a longer stretch of time than plotted in Livingston. The bottom panel shows whitened and band-passed strain data, together with the 90% region of the binary black hole templates used to infer the parameters of the source (the narrow dark band), and an unmodelled, coherent reconstruction of the signal (the wider light band) . The agreement between the templates and the reconstruction is a check that the gravitational waves match our expectations for binary black holes. The whitening of the data mirrors how we do the analysis, by weighting noise at different frequency by an estimate of their typical fluctuations. The signal does certainly look like the inspiral, merger and ringdown of a binary black hole. Figure 1 of the GW170814 Paper.

The signal originated from the coalescence of two black holes. GW170814 is thus added to the growing family of GW150914, LVT151012, GW151226 and GW170104.

GW170814 most closely resembles GW150914 and GW170104 (perhaps there’s something about ending with a 4). If we compare the masses of the two component black holes of the binary (m_1 and m_2), and the black hole they merge to form (M_\mathrm{f}), they are all quite similar

  • GW150914: m_1 = 36.2^{+5.2}_{-3.8} M_\odot, m_2 = 29.1^{+3.7}_{-4.4} M_\odot, M_\mathrm{f} = 62.3^{+3.7}_{-3.1} M_\odot;
  • GW170104: m_1 = 31.2^{+5.4}_{-6.0} M_\odot, m_2 = 19.4^{+5.3}_{-5.9} M_\odot, M_\mathrm{f} = 48.7^{+5.7}_{-4.6} M_\odot;
  • GW170814: m_1 = 30.5^{+5.7}_{-3.0} M_\odot, m_2 = 25.3^{+2.8}_{-4.2} M_\odot, M_\mathrm{f} = 53.2^{+3.2}_{-2.5} M_\odot.

GW170814’s source is another high-mass black hole system. It’s not too surprising (now we know that these systems exist) that we observe lots of these, as more massive black holes produce louder gravitational wave signals.

GW170814 is also comparable in terms of black holes spins. Spins are more difficult to measure than masses, so we’ll just look at the effective inspiral spin \chi_\mathrm{eff}, a particular combination of the two component spins that influences how they inspiral together, and the spin of the final black hole a_\mathrm{f}

  • GW150914: \chi_\mathrm{eff} = -0.06^{+0.14}_{-0.14}, a_\mathrm{f} = 0.70^{+0.07}_{-0.05};
  • GW170104:\chi_\mathrm{eff} = -0.12^{+0.21}_{-0.30}, a_\mathrm{f} = 0.64^{+0.09}_{-0.20};
  • GW170814:\chi_\mathrm{eff} = 0.06^{+0.12}_{-0.12}, a_\mathrm{f} = 0.70^{+0.07}_{-0.05}.

There’s some spread, but the effective inspiral spins are all consistent with being close to zero. Small values occur when the individual spins are small, if the spins are misaligned with each other, or some combination of the two. I’m starting to ponder if high-mass black holes might have small spins. We don’t have enough information to tease these apart yet, but this new system is consistent with the story so far.

One of the things Virgo helps a lot with is localizing the source on the sky. Most of the information about the source location comes from the difference in arrival times at the detectors (since we know that gravitational waves should travel at the speed of light). With two detectors, the time delay constrains the source to a ring on the sky; with three detectors, time delays can narrow the possible locations down to a couple of blobs. Folding in the amplitude of the signal as measured by the different detectors adds extra information, since detectors are not equally sensitive to all points on the sky (they are most sensitive to sources over head or underneath). This can even help when you don’t observe the signal in all detectors, as you know the source must be in a direction that detector isn’t too sensitive too. GW170814 arrived at LIGO Livingston first (although it’s not a competition), then ~8 ms later at LIGO Hanford, and finally ~14 ms later at Virgo.  If we only had the two LIGO detectors, we’d have an uncertainty on the source’s sky position of over 1000 square degrees, but adding in Virgo, we get this down to 60 square degrees. That’s still pretty large by astronomical standards (the full Moon is about a quarter of a square degree), but a fantastic improvement [bonus note]!

Sky localization of GW170814

90% probability localizations for GW170814. The large banana shaped (and banana coloured, but not banana flavoured) curve uses just the two LIGO detectors, the area is 1160 square degrees. The green shows the improvement adding Virgo, the area is just 100 square degrees. Both of these are calculated using BAYESTAR, a rapid localization algorithm. The purple map is the final localization from our full parameter estimation analysis (LALInference), its area is just 60 square degrees! Whereas BAYESTAR only uses the best matching template from the search, the full parameter estimation analysis is free to explore a range of different templates. Part of Figure 3 of the GW170814 Paper.

Having additional detectors can help improve gravitational wave measurements in other ways too. One of the predictions of general relativity is that gravitational waves come in two polarizations. These polarizations describe the pattern of stretching and squashing as the wave passes, and are illustrated below.

Plus and cross polarizations

The two polarizations of gravitational waves: plus (left) and cross (right). Here, the wave is travelling into or out of the screen. Animations adapted from those by MOBle on Wikipedia.

These two polarizations are the two tensor polarizations, but other patterns of squeezing could be present in modified theories of gravity. If we could detect any of these we would immediately know that general relativity is wrong. The two LIGO detectors are almost exactly aligned, so its difficult to get any information on other polarizations. (We tried with GW150914 and couldn’t say anything either way). With Virgo, we get a little more information. As a first illustration of what we may be able to do, we compared how well the observed pattern of radiation at the detectors matched different polarizations, to see how general relativity’s tensor polarizations compared to a signal of entirely vector or scalar radiation. The tensor polarizations are clearly preferred, so general relativity lives another day. This isn’t too surprising, as most modified theories of gravity with other polarizations predict mixtures of the different polarizations (rather than all of one). To be able to constrain all the  mixtures with these short signals we really need a network of five detectors, so we’ll have to wait for KAGRA and LIGO-India to come on-line.

The siz gravitational wave polarizations

The six polarizations of a metric theory of gravity. The wave is travelling in the z direction. (a) and (b) are the plus and cross tensor polarizations of general relativity. (c) and (d) are the scalar breathing and longitudinal modes, and (e) and (f) are the vector x and y polarizations. The tensor polarizations (in red) are transverse, the vector and longitudinal scalar mode (in green) are longitudinal. The scalar breathing mode (in blue) is an isotropic expansion and contraction, so its a bit of a mix of transverse and longitudinal. Figure 10 from (the excellent) Will (2014).

We’ll be presenting a more detailed analysis of GW170814 later, in papers summarising our O2 results, so stay tuned for more.

Title: GW170814: A three-detector observation of gravitational waves from a binary black hole coalescence
arXiv: 1709.09660 [gr-qc]
Journal: Physical Review Letters; 119(14):141101(16) [bonus note]
Data release: LIGO Open Science Center
Science summary: GW170814: A three-detector observation of gravitational waves from a binary black hole coalescence

If you’re looking for the most up-to-date results regarding GW170814, check out the O2 Catalogue Paper.

Bonus notes

Signs of paranoia

Those of you who have been following the story of gravitational waves for a while may remember the case of the Big Dog. This was a blind injection of a signal during the initial detector era. One of the things that made it an interesting signal to analyse, was that it had been injected with an inconsistent sign in Virgo compared to the two LIGO instruments (basically it was upside down). Making this type of sign error is easy, and we were a little worried that we might make this sort of mistake when analysing the real data. The Virgo calibration team were extremely careful about this, and confident in their results. Of course, we’re quite paranoid, so during the preliminary analysis of GW170814, we tried some parameter estimation runs with the data from Virgo flipped. This was clearly disfavoured compared to the right sign, so we all breathed easily.

I am starting to believe that God may be a detector commissioner. At the start of O1, we didn’t have the hardware injection systems operational, but GW150914 showed that things were working properly. Now, with a third detector on-line, GW170814 shows that the network is functioning properly. Astrophysical injections are definitely the best way to confirm things are working!

Signal hunting

Our usual way to search for binary black hole signals is compare the data to a bank of waveform templates. Since Virgo is less sensitive the the two LIGO detectors, and would only be running for a short amount of time, these main searches weren’t extended to use data from all three detectors. This seemed like a sensible plan, we were confident that this wouldn’t cause us to miss anything, and we can detect GW170814 with high significance using just data from Livingston and Hanford—the false alarm rate is estimated to be less than 1 in 27000 years (meaning that if the detectors were left running in the same state, we’d expect random noise to make something this signal-like less than once every 27000 years). However, we realised that we wanted to be able to show that Virgo had indeed seen something, and the search wasn’t set up for this.

Therefore, for the paper, we list three different checks to show that Virgo did indeed see the signal.

  1. In a similar spirit to the main searches, we took the best fitting template (it doesn’t matter in terms of results if this is the best matching template found by the search algorithms, or the maximum likelihood waveform from parameter estimation), and compared this to a stretch of data. We then calculated the probability of seeing a peak in the signal-to-noise ratio (as shown in the top row of Figure 1) at least as large as identified for GW170814, within the time window expected for a real signal. Little blips of noise can cause peaks in the signal-to-noise ratio, for example, there’s a glitch about 50 ms after GW170814 which shows up. We find that there’s a 0.3% probability of getting a signal-to-ratio peak as large as GW170814. That’s pretty solid evidence for Virgo having seen the signal, but perhaps not overwhelming.
  2. Binary black hole coalescences can also be detected (if the signals are short) by our searches for unmodelled signals. This was the case for GW170814. These searches were using data from all three detectors, so we can compare results with and without Virgo. Using just the two LIGO detectors, we calculate a false alarm rate of 1 per 300 years. This is good enough to claim a detection. Adding in Virgo, the false alarm rate drops to 1 per 5900 years! We see adding in Virgo improves the significance by almost a factor of 20.
  3. Using our parameter estimation analysis, we calculate the evidence (marginal likelihood) for (i) there being a coherent signal in Livingston and Hanford, and Gaussian noise in Virgo, and (ii) there being a coherent signal in all three detectors. We then take the ratio to calculate the Bayes factor. We find that a coherent signal in all three detectors is preferred by a factor of over 1600. This is a variant of a test proposed in Veitch & Vecchio (2010); it could be fooled if the noise in Virgo is non-Gaussian (if there is a glitch), but together with the above we think that the simplest explanation for Virgo’s data is that there is a signal.

In conclusion: Virgo works. Probably.

Follow-up observations

Adding Virgo to the network greatly improves localization of the source, which is a huge advantage when searching for counterparts. For a binary black hole, as we have here, we don’t expect a counterpart (which would make finding one even more exciting). So far, no counterpart has been reported.

i

Announcement

This is the first observation we’ve announced before being published. The draft made public at time at announcement was accepted, pending fixing up some minor points raised by the referees (who were fantastically quick in reporting back). I guess that binary black holes are now familiar enough that we are on solid ground claiming them. I’d be interested to know if people think that it would be good if we didn’t always wait for the rubber stamp of peer review, or whether they would prefer to for detections to be externally vetted? Sharing papers before publication would mean that we get more chance for feedback from the community, which is would be good, but perhaps the Collaboration should be seen to do things properly?

One reason that the draft paper is being shared early is because of an opportunity to present to the G7 Science Ministers Meeting in Italy. I think any excuse to remind politicians that international collaboration is a good thing™ is worth taking. Although I would have liked the paper to be a little more polished [bonus advice]. The opportunity to present here only popped up recently, which is one reason why things aren’t as perfect as usual.

I also suspect that Virgo were keen to demonstrate that they had detected something prior to any Nobel Prize announcement. There’s a big difference between stories being written about LIGO and Virgo’s discoveries, and having as an afterthought that Virgo also ran in August.

The main reason, however, was to get this paper out before the announcement of GW170817. The identification of GW170817’s counterpart relied on us being able to localize the source. In that case, there wasn’t a clear signal in Virgo (the lack of a signal tells us the source wan’t in a direction wasn’t particularly sensitive to). People agreed that we really need to demonstrate that Virgo can detect gravitational waves in order to be convincing that not seeing a signal is useful information. We needed to demonstrate that Virgo does work so that our case for GW170817 was watertight and bulletproof (it’s important to be prepared).

Perfect advice

Some useful advice I was given when a PhD student was that done is better than perfect. Having something finished is often more valuable than having lots of really polished bits that don’t fit together to make a cohesive whole, and having everything absolutely perfect takes forever. This is useful to remember when writing up a thesis. I think it might apply here too: the Paper Writing Team have done a truly heroic job in getting something this advanced in little over a month. There’s always one more thing to do… [one more bonus note]

One more thing

One point I was hoping that the Paper Writing Team would clarify is our choice of prior probability distribution for the black hole spins. We don’t get a lot of information about the spins from the signal, so our choice of prior has an impact on the results.

The paper says that we assume “no restrictions on the spin orientations”, which doesn’t make much sense, as one of the two waveforms we use to analyse the signal only includes spins aligned with the orbital angular momentum! What the paper meant was that we assume a prior distribution which has an isotopic distribution of spins, and for the aligned spin (no precession) waveform, we assume a prior probability distribution on the aligned components of the spins which matches what you would have for an isotropic distribution of spins (in effect, assuming that we can only measure the aligned components of the spins, which is a good approximation).

Going the distance: Mapping host galaxies of LIGO and Virgo sources in three dimensions using local cosmography and targeted follow-up

GW150914 claimed the title of many firsts—it was the first direct observation of gravitational waves, the first observation of a binary black hole system, the first observation of two black holes merging, the first time time we’ve tested general relativity in such extreme conditions… However, there are still many firsts for gravitational-wave astronomy yet to come (hopefully, some to be accompanied by cake). One of the most sought after, is the first is signal to have a clear electromagnetic counterpart—a glow in some part of the spectrum of light (from radio to gamma-rays) that we can observe with telescopes.

Identifying a counterpart is challenging, as it is difficult to accurately localise a gravitational-wave source. electromagnetic observers must cover a large area of sky before any counterparts fade. Then, if something is found, it can be hard to determine if that is from the same source as the gravitational waves, or some thing else…

To help the search, it helps to have as much information as possible about the source. Especially useful is the distance to the source. This can help you plan where to look. For nearby sources, you can cross-reference with galaxy catalogues, and perhaps pick out the biggest galaxies as the most likely locations for the source [bonus note]. Distance can also help plan your observations: you might want to start with regions of the sky where the source would be closer and so easiest to spot, or you may want to prioritise points where it is further and so you’d need to observe longer to detect it (I’m not sure there’s a best strategy, it depends on the telescope and the amount of observing time available). In this paper we describe a method to provide easy-to-use distance information, which could be supplied to observers to help their search for a counterpart.

Going the distance

This work is the first spin-off from the First 2 Years trilogy of papers, which looked a sky localization and parameter estimation for binary neutron stars in the first two observing runs of the advance-detector era. Binary neutron star coalescences are prime candidates for electromagnetic counterparts as we think there should be a bigger an explosion as they merge. I was heavily involved in the last two papers of the trilogy, but this study was led by Leo Singer: I think I mostly annoyed Leo by being a stickler when it came to writing up the results.

3D localization with the two LIGO detectors

Three-dimensional localization showing the 20%, 50%, and 90% credible levels for a typical two-detector early Advanced LIGO event. The Earth is shown at the centre, marked by \oplus. The true location is marked by the cross. Leo poetically described this as looking like the seeds of the jacaranda tree, and less poetically as potato chips. Figure 1 of Singer et al. (2016).

The idea is to provide a convenient means of sharing a 3D localization for a gravitational wave source. The full probability distribution is rather complicated, but it can be made more manageable if you break it up into pixels on the sky. Since astronomers need to decide where to point their telescopes, breaking up the 3D information along different lines of sight, should be useful for them.

Each pixel covers a small region of the sky, and along each line of sight, the probability distribution for distance D can be approximated using an ansatz

\displaystyle p(D|\mathrm{data}) \propto D^2\exp\left[-\frac{(D - \mu)^2}{2\sigma}\right],

where \mu and \sigma are calculated for each pixel individually.  The form of this ansatz can be understood as the posterior probability distribution is proportional to the product of the prior and the likelihood. Our prior is that sources are uniformly distributed in volume, which means \propto D^2, and the likelihood can often be well approximated as a Gaussian distribution, which gives the other piece [bonus note].

The ansatz doesn’t always fit perfectly, but it performs well on average. Considering the catalogue of binary neutron star signals used in the earlier papers, we find that roughly 50% of the time sources are found within the 50% credible volume, 90% are found in the 90% volume, etc. We looked at a more sophisticated means of constructing the localization volume in a companion paper.

The 3D localization is easy to calculate, and Leo has worked out a cunning way to evaluate the ansatz with BAYESTAR, our rapid sky-localization code, meaning that we can produce it on minute time-scales. This means that observers should have something to work with straight-away, even if we’ll need to wait a while for the full, final results. We hope that this will improve prospects for finding counterparts—some potential examples are sketched out in the penultimate section of the paper.

If you are interested in trying out the 3D information, there is a data release and the supplement contains a handy Python tutorial. We are hoping that the Collaboration will use the format for alerts for LIGO and Virgo’s upcoming observing run (O2).

arXiv: 1603.07333 [astro-ph.HE]; 1605.04242 [astro-ph.IM]
Journal: Astrophysical Journal Letters; 829(1):L15(7); 2016; Astrophysical Journal Supplement Series; 226(1):10(8); 2016
Data release: Going the distance
Favourite crisp flavour: Salt & vinegar
Favourite jacaranda: Jacaranda mimosifolia

Bonus notes

Catalogue shopping

The Event’s source has a luminosity distance of around 250–570 Mpc. This is sufficiently distant that galaxy catalogues are incomplete and not much use when it comes to searching. GW151226 and LVT151012 have similar problems, being at around the same distance or even further.

The gravitational-wave likelihood

For the professionals interested in understanding more about the shape of the likelihood, I’d recommend Cutler & Flanagan (1994). This is a fantastic paper which contains many clever things [bonus bonus note]. This work is really the foundation of gravitational-wave parameter estimation. From it, you can see how the likelihood can be approximated as a Gaussian. The uncertainty can then be evaluated using Fisher matrices. Many studies have been done using Fisher matrices, but it important to check that this is a valid approximation, as nicely explained in Vallisneri (2008). I ran into a case when it didn’t during my PhD.

Mergin’

As a reminder that smart people make mistakes, Cutler & Flanagan have a typo in the title of arXiv posting of their paper. This is probably the most important thing to take away from this paper.

Parameter estimation for binary neutron-star coalescences with realistic noise during the Advanced LIGO era

The first observing run (O1) of Advanced LIGO is nearly here, and with it the prospect of the first direct detection of gravitational waves. That’s all wonderful and exciting (far more exciting than a custard cream or even a chocolate digestive), but there’s a lot to be done to get everything ready. Aside from remembering to vacuum the interferometer tubes and polish the mirrors, we need to see how the data analysis will work out. After all, having put so much effort into the detector, it would be shame if we couldn’t do any science with it!

Parameter estimation

Since joining the University of Birmingham team, I’ve been busy working on trying to figure out how well we can measure things using gravitational waves. I’ve been looking at binary neutron star systems. We expect binary neutron star mergers to be the main source of signals for Advanced LIGO. We’d like to estimate how massive the neutron stars are, how fast they’re spinning, how far away they are, and where in the sky they are. Just published is my first paper on how well we should be able to measure things. This took a lot of hard work from a lot of people, so I’m pleased it’s all done. I think I’ve earnt a celebratory biscuit. Or two.

When we see something that looks like it could be a gravitational wave, we run code to analyse the data and try to work out the properties of the signal. Working out some properties is a bit trickier than others. Sadly, we don’t have an infinite number of computers, so it means it can take a while to get results. Much longer than the time to eat a packet of Jaffa Cakes…

The fastest algorithm we have for binary neutron stars is BAYESTAR. This takes the same time as maybe eating one chocolate finger. Perhaps two, if you’re not worried about the possibility of choking. BAYESTAR is fast as it only estimates where the source is coming from. It doesn’t try to calculate a gravitational-wave signal and match it to the detector measurements, instead it just looks at numbers produced by the detection pipeline—the code that monitors the detectors and automatically flags whenever something interesting appears. As far as I can tell, you give BAYESTAR this information and a fresh cup of really hot tea, and it uses Bayes’ theorem to work out how likely it is that the signal came from each patch of the sky.

To work out further details, we need to know what a gravitational-wave signal looks like and then match this to the data. This is done using a different algorithm, which I’ll refer to as LALInference. (As names go, this isn’t as cool as SKYNET). This explores parameter space (hopping between different masses, distances, orientations, etc.), calculating waveforms and then working out how well they match the data, or rather how likely it is that we’d get just the right noise in the detector to make the waveform fit what we observed. We then use another liberal helping of Bayes’ theorem to work out how probable those particular parameter values are.

It’s rather difficult to work out the waveforms, but some our easier than others. One of the things that makes things trickier is adding in the spins of the neutron stars. If you made a batch of biscuits at the same time you started a LALInference run, they’d still be good by the time a non-spinning run finished. With a spinning run, the biscuits might not be quite so appetising—I generally prefer more chocolate than penicillin on my biscuits. We’re working on speeding things up (if only to prevent increased antibiotic resistance).

In this paper, we were interested in what you could work out quickly, while there’s still chance to catch any explosion that might accompany the merging of the neutron stars. We think that short gamma-ray bursts and kilonovae might be caused when neutron stars merge and collapse down to a black hole. (I find it mildly worrying that we don’t know what causes these massive explosions). To follow-up on a gravitational-wave detection, you need to be able to tell telescopes where to point to see something and manage this while there’s still something that’s worth seeing. This means that using spinning waveforms in LALInference is right out, we just use BAYESTAR and the non-spinning LALInference analysis.

What we did

To figure out what we could learn from binary neutron stars, we generated a large catalogue of fakes signals, and then ran the detection and parameter-estimation codes on this to see how they worked. This has been done before in The First Two Years of Electromagnetic Follow-Up with Advanced LIGO and Virgo which has a rather delicious astrobites write-up. Our paper is the sequel to this (and features most of the same cast). One of the differences is that The First Two Years assumed that the detectors were perfectly behaved and had lovely Gaussian noise. In this paper, we added in some glitches. We took some real data™ from initial LIGO’s sixth science run and stretched this so that it matches the sensitivity Advanced LIGO is expected to have in O1. This process is called recolouring [bonus note]. We now have fake signals hidden inside noise with realistic imperfections, and can treat it exactly as we would real data. We ran it through the detection pipeline, and anything which was flagged as probably being a signal (we used a false alarm rate of once per century), was analysed with the parameter-estimation codes. We looked at how well we could measure the sky location and distance of the source, and the masses of the neutron stars. It’s all good practice for O1, when we’ll be running this analysis on any detections.

What we found

  1. The flavour of noise (recoloured or Gaussian) makes no difference to how well we can measure things on average.
  2. Sky-localization in O1 isn’t great, typically hundreds of square degrees (the median 90% credible region is 632 deg2), for comparison, the Moon is about a fifth of a square degree. This’ll make things interesting for the people with telescopes.

    Sky localization map for O1.

    Probability that of a gravitational-wave signal coming from different points on the sky. The darker the red, the higher the probability. The star indicates the true location. This is one of the worst localized events from our study for O1. You can find more maps in the data release (including 3D versions), this is Figure 6 of Berry et al. (2015).

  3. BAYESTAR does just as well as LALInference, despite being about 2000 times faster.

    Sky localization for binary neutron stars during O1.

    Sky localization (the size of the patch of the sky that we’re 90% sure contains the source location) varies with the signal-to-noise ratio (how loud the signal is). The approximate best fit is \log_{10}(\mathrm{CR}_{0.9}/\mathrm{deg^2}) \approx -2 \log_{10}(\varrho) +5.06, where \mathrm{CR}_{0.9} is the 90% sky area and \varrho is the signal-to-noise ratio. The results for BAYESTAR and LALInference agree, as do the results with Gaussian and recoloured noise. This is Figure 9 of Berry et al. (2015).

  4. We can’t measure the distance too well: the median 90% credible interval divided by the true distance (which gives something like twice the fractional error) is 0.85.
  5. Because we don’t include the spins of the neutron stars, we introduce some error into our mass measurements. The chirp mass, a combination of the individual masses that we’re most sensitive to [bonus note], is still reliably measured (the median offset is 0.0026 of the mass of the Sun, which is tiny), but we’ll have to wait for the full spinning analysis for individual masses.

    Mean offset in chirp-mass estimates when not including the effects of spin.

    Fraction of events with difference between the mean estimated and true chirp mass smaller than a given value. There is an error because we are not including the effects of spin, but this is small. Again, the type of noise makes little difference. This is Figure 15 of Berry et al. (2015).

There’s still some work to be done before O1, as we need to finish up the analysis with waveforms that include spin. In the mean time, our results are all available online for anyone to play with.

arXiv: 1411.6934 [astro-ph.HE]
Journal: Astrophysical Journal; 904(2):114(24); 2015
Data release: The First Two Years of Electromagnetic Follow-Up with Advanced LIGO and Virgo
Favourite colour: Blue. No, yellow…

Notes

The colour of noise: Noise is called white if it doesn’t have any frequency dependence. We made ours by taking some noise with initial LIGO’s frequency dependence (coloured noise), removing the frequency dependence (making it white), and then adding in the frequency dependence of Advanced LIGO (recolouring it).

The chirp mass: Gravitational waves from a binary system depend upon the masses of the components, we’ll call these m_1 and m_2. The chirp mass is a combination these that we can measure really well, as it determines the most significant parts of the shape of the gravitational wave. It’s given by

\displaystyle \mathcal{M} = \frac{m_1^{3/5} m_2^{3/5}}{(m_1 + m_2)^{1/5}}.

We get lots of good information on the chirp mass, unfortunately, this isn’t too useful for turning back into the individual masses. For that we next extra information, for example the mass ratio m_2/m_1. We can get this from less dominant parts of the waveform, but it’s not typically measured as precisely as the chirp mass, so we’re often left with big uncertainties.