Gene Duplication Data

Michael Lynch
John S. Conery
Computational Science Institute
University of Oregon

This web page contains information related to our paper on the evolution of duplicate genes [1].

We analyzed the coding sequences of nine species, using BLAST to locate pairs of similar sequences and then subjecting each pair to further analysis. The process is described in more detail in a companion paper [2], and the software we developed is available from another web page at this site [3].

  1. Lynch, M. and Conery, J.S. The evolutionary fate and consequences of duplicate genes.
    Science 290:1151-1155 (Nov 10, 2000).

  2. Conery, J.S. and Lynch, M. Nucleotide substitutions and the evolution of duplicate genes.
    Pacific Symposium on Biocomputing 6:167-178 (2001).

  3. Software for Analyzing Nucleotide Substitutions. [http://www.csi.uoregon.edu/projects/genetics/ntdiffs].

Data Files

The files listed below are all plain text files. See the file format page for details of the data format.
Data sets containing physical map locations for each gene:

Caenorhabditis elegans (nematode)
Drosophila melanogaster (fruit fly)
Saccharomyces cerevisiae (yeast)

Data sets without physical map locations:

Arabidopsis thaliana (thale cress)
Danio rerio (zebrafish)
Gallus gallus (chicken)
Homo sapiens (human)
Mus musculus (mouse)
Oryza sativa (rice)

Supplementary Figure

A figure that is a supplement to the Science article can be viewed by selecting one of the links below. The figure is a dot-plot of the number of substitutions per silent site vs. the physical distance between members of a duplicate pair on the same chromosome.


  accesses since November 11, 2000.