Beginning in mid-February 2008, the 1997-2007 online version of the Science Watch® newsletter, ESI-Topics.com, and in-cites.com, will all be featured together on the redesigned ScienceWatch.com. All previous content from the three sites will be permanently archived, and remain accessible from any existing bookmarks to the archived pages. No new content will be added to this site. Updates and new content (updated biweekly) are available at ScienceWatch.com now.
The Thomson Corporation inin-cites logoites
ScientistsPapersInstitutionsJournalsCountriesH O M ERSS feeds


S E A R C H
incites



PAPERS

Scientists
Papers
Institutions
Journals
Countries
 

The Top 10...
Analysis of...
Site Map by Fields
Overview Menu of all Interviews
Podcasts
Hot Papers published within the last 2 years
Current Classics
SCI-BYTES - What's New in Research
What's New in Research

in-cites, September 2005
 http://www.in-cites.com/papers/PeterGreen.html

Papers

             
An interview with:
Professor Peter Green
           

In the interview below, Professor Peter Green discusses his paper, "Reversible jump Markov chain Monte Carlo computation and Bayesian model determination," (Biometrika 82[4]: 711-732, December 1995). This paper has been cited 490 times to date and is the fourth most-cited paper in the field of Mathematics in the ISI Essential Science Indicators Web product. Sixteen of Professor Green’s papers in this field have a total of 823 citations to date. Professor Green is a member of the Statistics Group in the Department of Mathematics at the University of Bristol, United Kingdom.

  Why, in your view, is your paper highly cited?

The paper introduces Reversible jump Markov chain Monte Carlo, a new simulation-based methodology for fitting statistical models that have variable-dimension parameters. It caught on simply because people found this useful in their statistical modelling work, and indeed even now it must be the most widely used method for treating these problems. The paper appeared in 1995, just the right time to make an impact, since the extraordinary rise of interest in computational Bayesian methods that started in the late ‘80s had by that time reached just about every area of application of statistical analysis, and researchers were starting to tackle more challenging problems. Curiously, the problems that Reversible jump solves were probably not very widely regarded as interesting before the paper appeared, but the example applications in the paper, and early applications of the ideas by other authors, seem to have generated broader interest in these problems and showed people that solutions were possible, so the topic became self-reinforcing. Not very much of this was foreseen when I wrote the paper!

  What are the circumstances which led you to your work?


Curiously, the problems that Reversible jump solves were probably not very widely regarded as interesting before the paper appeared, but the example applications in the paper, and early applications of the ideas by other authors, seem to have generated broader interest in these problems and showed people that solutions were possible, so the topic became self-reinforcing.”

In October 1993, Ulf Grenander and Michael Miller presented a paper on a computer vision problem at a Royal Statistical Society discussion meeting in London. Papers for such meetings are available a week or two in advance so that others can prepare contributions for the discussion, which are eventually published along with the authors’ rejoinder, and the paper itself. I was studying the paper the night before the meeting, and was particularly intrigued by the computational algorithm they developed, involving stochastic simulation, for inference about objects in microscopy images. It occurred to me both that (1) this kind of algorithm could equally well address a huge class of inference problems of which theirs was just one special example, and (2) the algorithm itself could be made both more general, and easier to use and understand. I presented these points in a five-minute slot in the discussion (well, I probably ran over time by a minute or two).

It didn’t make a very big impact, but I worked on the idea over the next month or two, verified the mathematical details, attempted to codify the method into a recipe that was easier to use, and implemented three fairly substantial examples – to multiple change-point analysis for point processes, to image segmentation, and to a partition problem for binomial data. I thought these would help convince a broad range of readers that the ideas would be relevant to them. This package formed the paper that I submitted to Biometrika, which appeared in 1995. By the time the paper had appeared, however, the work had already being quite widely taken up. The idea was propagated both through preprint servers, and through conference and workshop talks; there was a lot of workshop activity in the general area of stochastic systems in Europe at the time, thanks to the European Science Foundation funded network on Highly Structured Stochastic Systems. It was a very exciting time for research in this area, and I was lucky that my paper was ready at exactly the right time to get noticed.

  Would you describe the significance of this work for your field?

Many problems in statistical inference, in all sorts of field of application, have the character that "the number of things you don’t know is one of the things you don’t know." In microscopy on cell tissue, you want to count the number of cells and measure the properties of each. In the analysis of genetic data on quantitative trait loci you want to estimate the number of loci and the quantitative effect at each. In multiple regression you want to select relevant variables and estimate effects for the selected variables. Problems like this are ubiquitous. I was interested in how you could make simultaneous inference about all unknowns in such problems, given noisy data. This is a bit of a challenge technically if you want a statistically rigorous solution, since most statistical methods deal with vectors of unknowns—parameters—of fixed dimension. The philosophically appealing way to do this inference is through fully probabilistic, or Bayesian, analysis. Bayes theorem tells you the theoretical solution, but you need to compute it in practice.

Reversible jump Markov chain Monte Carlo is a class of methods for doing that, which is feasible even in very complex models, and which is not too cumbersome to set up and use. So the immediate significance of the work was that people could now solve these problems. The interesting thing is that secondarily but ultimately more significantly, people started noticing that these "variable dimension" problems were all around them, and started thinking that estimating all the unknowns simultaneously, and doing other tricks like Bayesian model averaging, might be good things to do. The methodology embraces both model-choice situations, where several competing explanations for the data are being entertained and the analysis needs to simultaneously choose and fit a model, and settings where there is really a single model which has a variable dimension parameter—for example, a series representation of a function where the number of terms is not fixed.

  Where has this research gone since the publication of your paper?

Most of the citations of the paper are by researchers who are implementing the method to solve their own inference problems—the citations appear in over 160 different journals, in many different fields. They will often have tailored details of the algorithm to their specific situation, and so particular idioms and rules of thumb have emerged over time. Almost all of the papers citing mine that do not deal with specific applications introduce novel generic statistical methodology, for example, in model selection or nonparametric regression, building on Reversible jump. Only a very few authors have really developed the idea itself and made significant additions to the simulation methodology. The most significant papers to mention here are by Tierney (Ann. Appl. Probab., 1998), Godsill (J. Comput. Graph. Stat., 2001), by Mira and myself (Biometrika, 2001), by Brooks, Giudici, and Roberts (J. Roy. Statist. Soc. B, 2003), and by Cappé, Robert, and Rydén (J. Roy. Statist. Soc. B, 2003).

  Where do you see it going 10 years from now?

On the practical side, the methods are not yet easy enough to use. You can make mistakes setting them up, and even when you do get it right, the methods usually need to be tuned, and that can be laborious. So there would be tremendous benefit if the process of implementation could be automated, and I hope we see work in this direction. It has started; for example, my Ph.D. student David Hastie has devised a very nice adaptive/automatic approach based on normal mixtures.

More fundamentally, there are a host of challenging theoretical questions about how to summarize inference in complex variable-dimension models: Reversible jump helps you compute what you want, but can’t tell you what to compute. I hope my work will assist researchers in testing out in practical contexts their developing ideas about these important issues.End of interview

Professor Peter J. Green, FRS
Department of Mathematics
University of Bristol
Bristol, UK

in-cites, September 2005
 http://www.in-cites.com/papers/PeterGreen.html


ScienceWatch.com - Tracking Trends and Perfomance in Basic Research
Go to the new ScienceWatch.com

Home | Search | Disclaimer | Terms of Use | Privacy Policy | Copyright
Contact Webmaster with questions/comments |
(c) 2008 The Thomson Corporation.