|

 |
 |
An
essay by:
Dr. David R. Nelson |
 |

 |
n
this in-cites essay, Dr. David R. Nelson discusses the history
of cytochrome P450 nomenclature and his role in it. Dr. Nelson
is the lead author of the paper, "P450 superfamily:
update on new sequences, gene mapping, accession numbers and
nomenclature," (Pharmacogenetics 6[1]: 1-42 February
1996), which ranks among the top 5 papers in the field of
Pharmacology & Toxicology in the ISI
Essential Science Indicators
Web product, with 1,004 citations to date. Dr. Nelson is an
Associate Professor in the Department of Molecular Sciences at
the University of Tennessee.
|
Cytochrome P450
research is a field that has benefited by having a systematic
standardized nomenclature. The need for such a system only appeared
gradually, since early work in the ‘60s debated if there was more
than one P450. The purification of multiple forms from rat and rabbit
created a de facto nomenclature started by the labs that were
writing the papers on these forms. The tendency was to make the forms
either a, b, c or 1, 2, 3 with some prefix to identify the source or
inducer, like RLM for rat liver microsomes or PB1 for Phenobarbital
form 1. This quickly became unwieldy as every lab had their own naming
convention. The ‘80s saw cloning of many P450s so the problem got
worse. The visionary who saw where this was leading and proposed a
solution was Dan Nebert. Dan, along with Frank Gonzalez, put together
the first P450 nomenclature in a chapter for Annual Reviews of Biochemistry
in 19871. An official nomenclature proposal was published
that same year with a large number of influential P450 researchers as
co-authors2. This system has been revised and expanded four
more times with the last being the 1996 Pharmacogenetics paper.
I remember I had to work on the 1993 update during a vacation to
Washington DC. While my wife Susan took a few naps, I was working on
the tables. We had to make a deadline and she was a good sport about
it. After 1996 the size of the nomenclature posed a problem for
publication, so the current version resides on my Mac as a web server
(http://drnelson.utmem.edu/CytochromeP450.html). This server gets
1,800-2,500 hits a day or about 800,000 per year. The server and the
nomenclature are a service to the P450 community and the broader
scientific community.
The nomenclature has evolved over time. The first
edition used roman numerals for family names, and that was dropped
quickly. Newer developments are requiring adequate solutions, such as
how do alternative splicing variants get named? We have not answered
that one yet. The definition of families and subfamilies has shifted
toward lower cutoffs for inclusion. A more realistic view today is
that a cutoff window is better than an absolute value like the
original 40% identity or higher for family inclusion. More problematic
is the merging or blurring of subfamilies that used to be distinct.
The CYP4 family is a prime example. It is now so full of subfamilies
that they are hard to keep from overlapping. This makes naming new
members very difficult.
My history in this area goes back to my first
post-doc with Henry Strobel at the University of Texas Medical School
in Houston. I joined Henry's lab in 1985 when the first P450 sequences
were just being published. I remember writing these sequences out by
hand on graph paper to make alignments of about a dozen sequences.
These were color coded with highlighters and posted on the wall in the
hall outside the lab. They took up about eight feet of wall. When the
alignment reached 34 sequences I sought some help from the Center for
Demographic and Population Genetics at UT. Masatoshi Nei, who was
chairman there, introduced me to Joel Claiborne "Clay"
Stephens who gave me a FORTRAN program for doing UPGMA trees. I
converted it to BASIC to run on an IBM PC and made my first P450 tree.
That was the basis for my first P450 publication "Evolution of
Cytochrome P-450 Proteins"3. That paper prompted Dan
Nebert to ask me to join the nomenclature committee, and I have been
doing P450 naming ever since.
Where will this lead? The immediate goal is to find
every member of the P450 superfamily in the completed genome sequences
from as many species as possible. The ability to point to complete
sets of genes is powerful. It defines history by saying we know
absolutely what is here now in all these different lineages, so how
did we get here? That is the more interesting question. I am now
caught up with the genome projects on Dictyostelium (45 P450
genes and counting), Neurospora crassa (38 P450s), Phanerochaete
chrysosporium (white rot fungus, about 150 P450 genes), Fugu
and Tetraodon (pufferfishes), rice (about 290 P450 contigs) and
soon Ciona intestinalis (a tunicate). Human, fly, worm, and Arabidopsis
are already done, with mouse to follow soon, so the next phase will be
comparative genomics of all these P450s. I am optimistic that the
evolutionary holes in the data set will be filled. It would be grand
to have an echinoderm set and a sponge, as well as many protist
genomes, especially a Stramenopile and a non-parasitic alveolate (like
Tetrahymena). It was a bit of a shock this summer when the Plasmodium
falciparum genome was searchable and I found no P450s in this
aerobic parasite, not even a CYP51.
I would like to praise all the genome sequencers
who have worked so hard to fill gaps in genomes and put the data out
in a timely fashion, sometimes daily. I use it every day and I
appreciate it being there to search, even if it is not at Genbank, but
on private genome project servers. One of my biggest frustrations is
knowing that genomes are done already for rice and mouse and these are
not publicly available. I have signed private legal agreements to see
some of this (rice) data, but I doubt the utility of holding this back
and imposing restrictions. It will all come out in the end, so why
hold it hostage now? Fortunately, there is no shortage of
blast-searching to be done at NCBI and elsewhere. I wish they all had
faster computers.
David R. Nelson, Ph.D.
Department of Molecular Sciences
University of Tennessee
Memphis, TN, USA
REFERENCES:
1) Nebert, D.W. and
Gonzalez, FJ. P450 genes. Structure, evolution and regulation.
Annual Reviews of Biochemistry 56, 945-993 (1987)
2) Nebert, D.W.,
Adesnik, M., Coon, M.J., Estabrook, R.W., Gonzalez, F.J., Guengerich,
F.P., Gunsalus, I.C., Johnson, E.F., Kempewr, B., Levin, W.,
Phillips, I.R., Sato, R. and Waterman, M.R. The P450 gene
superfamily: Recommended nomenclature. DNA 6, 1-11 (1987)
3) Nelson, D.R. and
Strobel, H.W. Evolution of Cytochrome P-450 Proteins Molec. Biol.
Evol. 6, 572-593 (1987)
|
|