Nervenet.org

Note to the Reader Quantitative Neuroanatomy: A Picture is Worth a Thousand Words, but a Number is Worth a Thousand Pictures. Organizers: John H. Morrison and Patrick R. Hof Society for Neuroscience Education Committee Saturday, November 7, 1998 (2:30–3:00 pm) Westin Bonaventure Hotel and Suites, Los Angeles, CA pp. 66–78.

Neuroscience Meets Quantitative Genetics: Using Morphometric Data to Map Genes that Modulate CNS Architecture

Robert W. Williams
Center for Neuroscience, Department of Anatomy and Neurobiology, University of Tennessee, Memphis, Tennessee 38163

Email questions and comments to rwilliam@nb.utmem.edu

Contents

Prelude to Mapping
Mapping QTLs that Modulate CNS Traits
PCR protocols for mapping microsatellites
Types of Genetic Crosses used to Map QTLs
The Highs and Lows of QTL Mapping
References

Introduction

Quantitative methods now pervade most aspects of neuroanatomical research, whether the focus is on dendritic spines or the topography of human neocortex. Sophisticated techniques to study cells in the CNS have gained in importance as the field of neuroanatomy has matured beyond qualitative analysis. Efficient methods to quantify cells in thick or thin sections, particularly direct three-dimensional counting (Williams & Rakic, 1988) and several adaptations of the disector (Gundersen et al., 1988), make it easier to obtain unbiased estimate of neuroanatomical traits. As a practical matter, the most important advance in quantitative techniques has been the rapid hybridization of microscopes, video systems, fast microcomputers, and powerful application programs such as NIH Image (http://rsb.info.nih.gov/nih-image/Default.html). These advances have given neuroscience a major boost, and investigators can now use more trustworthy quantitative techniques to study large numbers of cases.

The promise of quantitative genetics. This tutorial illustrates how to exploit quantitative neuroanatomical datasets to map and positively identify (clone) genes that generate the astonishing, but nonetheless normal, variation in CNS architecture. The types of genes that I am referring to are called quantitative trait loci or QTLs, and these QTLs are responsible for much of the inheritable variation among individuals within a species. For example, the volume and number of cells in visual cortex of humans and other primates vary nearly 3-fold (Gilissen & Zilles, 1996; Suner & Rakic, 1996), and most of this remarkable variation is generated by sets of still completely unknown neurogenic QTLs. You don't need expertise in molecular biology to map these genes. In the near future it should even be possible to clone QTLs without running gels. A solid background in statistics and expertise in phenotyping is more helpful at present than expertise in molecular biology. What you must have to pursue complex trait analysis to study the nervous system is the ability, patience, and resources to quantify large numbers of cases.

The QTL methods I describe in this paper have become available only in the last few years (Tanksley, 1993; Lander & Schork, 1994, Kearsey & Pooni, 1996) and their applications in neuroscience are still largely restricted to mice and human populations. These methods depend on the polymerase chain reaction (PCR), high-density genetic maps, and sophisticated statistics programs. The application of these methods in the next decades will revolutionize our understanding of normal genetic mechanisms controlling CNS development, susceptibility to disease, and even CNS evolution (Williams 2000; Greenspan 2001). Progress in QTL mapping is just as relentless as in any other branch of genetics, and the application of new methods will soon allow reseachers to achieve extremely high precision QTL mapping—precision of better than 0.2 cM—with relatively modest total phenotyping requirements and no genotyping at all within the next ten years (cf., Williams et al., 2001). The terrific feature of these new genetic tools is that specific categories of genes can be targeted for analysis using a forward genetic approach. A forward genetic approach starts with well-defined phenotypes and moves toward single genes that contribute to those phenotypes (Takahashi et al., 1994). It is therefore an ideal approach for neuroscientists, who typically begin with specific problems and questions about specific CNS traits. Forward genetics is not fishing; instead it is highly directed, like harvesting fruit from a chosen tree in an orchard. The genes discovered using QTL methods will usually be key controllers in molecular pathways that normally influence CNS traits.

This chapter summarizes key principles and methods involved in a quantitative genetic dissection of the mouse CNS. After reading this chapter you may be interested in several papers that provide concrete examples of how and why we are using these methods to map genes that control variation in neuron number. A set of closely related papers on this topic (covering olftactory bulb, striatum, hippocampus, and cerebellum) are available without restriction on this web site and at www.jneurosci.org/cgi/content/full/16/22/7193, www.jneurosci.org/cgi/content/full/18/1/138, and www.jneurosci.org/cgi/content/full/16/22/7193. A more general overview of the genetics of brain weight is available at http://www.nervenet.org/papers/BrainRev99.html.

What are QTLs? Quantitative trait loci are normal genes in every sense of the word, but they are referred to using this somewhat unwieldy term to highlight the fact that variant forms—or alleles—of QTLs have relatively subtle quantitative effects on phenotypes (Lynch & Walsh, 1998). In studies of human populations in clinical settings, QTLs are also referred to as susceptibility genes, because humans who carry certain alleles are at greater risk of developing disease. QTLs are often contrasted with Mendelian loci that have pronounced, and usually discontinuous, qualitative effects on phenotypes, but there is actually no sharp demarcation between quantitative and qualitative traits. QTLs that have particularly large effects that verge on producing Mendelian segregation patterns (e.g., 1:2:1) are major-factor or major-effect QTLs. Their large effects make them the easiest QTLs to map. Conversely, allelic variants at Mendelian loci can have graded effects, and given a particular population or environment, they are also QTLs. The effect size of a QTL is not in any way a measure of the importance of a gene in a biological process or network; in fact some of the most important genes may have only minor sequence variants.

In some instances a QTL may encompass two or more polymorphic genes that are close to each other on a chromosome. "Close" in this sense means that the two neighboring genes that modulate a trait have not been dissociated sufficiently by recombination events. The collective effect of such linked genes may be quite striking, and some effort may be required to dissect the individual contributions of two of more QTLs apart. There are some statistical short-cuts to help dissociate two polymorphic genes that contribute to a "single" composite QTL, but the ultimate solution is simply to use high resolution mapping panels (that is to say, groups of mice that collectively harbor many recombination events) to resolve one composite QTL into two or more constituents. An apt analogy is the differences of image resolution achieved by light and electron microscopes—the higher the frequency of the light source the better the resolution. In gene mapping, the greater the number of recombination events, the better the resolution. The ultimate goal is of course to resolve the effects of the single genes that underlie QTLs; to turn QTLs into QTGs (quantitative trait genes) or even QTNs (quantitative trait nucleotides).

Prelude to mapping

Four conditions. Methods used to map QTLs affecting the nervous system can be applied widely, with the following provisions:

The CNS trait must be variable. For example, to discover what genes control variation in brain weight or in numbers of Purkinje cells in mice (Fig. 1), we would first need to show that there are significant differences among inbred strains.

This variation must be heritable. Estimates of heritability should usually be above 10% to begin mapping QTLs. Fortunately, there are methods to boost the heritability of genetically controlled traits to very high values using certain types of mapping resources,; partciularly recombinant inbred strains.

Methods used to phenotype CNS or behavioral traits must be efficient. It will usually be necessary to phenotype several hundred cases to map one or more QTLs.

There must be a well-established linkage map for the species or genus that is to be the focus of research. This currently restricts analysis to laboratory mice and rats, and a half-wild outbred mammalian species (Homo sapiens).

Fig. 1. Variation in brain weight between two inbred strains and several generations of progeny can be used to estimate the minimum number of genes that affect brain weight. Brain weights of two inbred strains are shown to the far left. The upper left box is labeled BALB/cJ, and the lower left box is labeled CAST/Ei. These are names of two very different strains of mice. Each dot represents the brain weight of an individual mouse; the short horizontal lines indicate group means; the faint vertical bars within each box indicate the standard deviation; and the horizontal line at 445 mg indicates the mid-parental value (average of BALB/cJ and CAST/Ei). Brain weights of genetically identical BALB/cJ mice vary from a low of 445 mg to a high of 580 mg. Brain weights of the much smaller CAST/Ei mice range from 320 mg to 415 mg. When BALB/cJ females are mated to CAST/Ei males, the F1 progeny have brain weights between 450 and 560 mg. This does not differ greatly from the weights of the BALB/cJ parental strain and indicates that BALB/cJ alleles are likely to be dominant over CAST/Ei alleles (maternal effects may also be at work). F2 mice are generated by crossing F1 males and females. Brain weights of these F2 animals have a very wide range from 350 mg to 550 mg. The increase is because these F2 mice are genetically heterogeneous. Note that all F2 animals have genetically identical F1 mothers, so the variability is not from maternal effects. The F1 animals can also be crossed back to the parental strains, giving rise to the B₁ progeny shown to the right. The equation at the bottom of the figure is the Wright-Castle equation for estimating the minimum numbers of “effective factors” (genes) that contribute to the genetic variance in a trait. Delta P is the difference between parental strain means. V_F2 and V_F1 are the variances of the F2 and F1 values. For this particular set of data, we estimate that no fewer than 4 polymorphic genes account for the increases variance in the F2 relative to the isogenic F1. It is, of course, highly likely that far more than four genes contribute to the genetic control of brain weight. Click on the image to view a higher resolution copy in a separate window.

Getting a handle on the phenotype

"The importance of phenotype definition and delineation cannot be overemphasized. For what we refer to now as "gene mapping" is actually "phenotype mapping," with identificaiton of new gene loci merely a by-product in our current age of ignorance. After the year 2005, when most functional loci in the human genome will have been mapped and sequenced, there will still remain huge gaps in our knowledge about genotype-phenotype correlations. So as long as there is interest in identifying alleles that cause human phenotypes, there will be continuing, long-term interest in phenotype mapping." (from A. S. Aylsworth, 1998)

Sources of variance. Variation in the value of a trait or phenotype is often thought of as an experimental annoyance. However, when the aim is to map QTLs, the greater the genetic variance, the better the prospects of success (Figure 1). The variance that is initially measured in a quantitative neuroanatomical study (the square of the standard deviation) has many sources (Falconer & Mackay, 1996; Lynch & Walsh, 1998, chapters 5–7). For this reason, one of the first steps in a QTL analysis is to partition this variance into its major components. Three especially important components are (1) technical and sampling variance (Vt), (2) environmental variance (Ve), and (3) additive genetic variance (Va). There are several ways to separate them (Williams et al., 1996).

Technical error. Technical error can be estimated by repeated analysis of a subset of cases. In one recent study, we used a quantitative electron microscopic method to count retinal ganglion cells in mice. To estimate the reliability of this method, we simply recounted cases (Williams et al., 1996). Most cases were rephotographed and recounted by different lab members to estimate inter-observer reliability. We discovered that technical errors in our study were as important a source of variance as all environmental sources of variance combined. In this study to get a reliable strain average, we had to increase the number of cases to about 6 to 8 animals.

Box 1: The challenge of keeping records
   Keeping records on sources of animals, their dates of birth, litter size, sex, weight, and the many CNS traits that interest us can be difficult. Difficulties are compounded when reliance is placed on records scattered in different sets of notebooks kept by different collaborators, technicians, and students. Add a host of Excel spreadsheets and you have a recipe for data chaos. The efficiency and accuracy of record keeping become major issues when several investigators are collaborating and collecting data from as many as 100 animals per week. The solutionis to maintain these types of records in a relational database.
   We use an integrated relational database (FileMaker Pro: www.filemaker.com/) to keep track of almost all of these types of data. Data are entered directly into one of several interconnected FileMaker files, all of which reside on a network server (a Mac G4), backed up daily to a second hard drive, and archived on DVD-RAM at weekly intervals. Networked computers (primarily iMacs and iBooks) on small carts are kept in the mouse colony, near the hood, dissection table, and by the electronic balances. Entering data directly into a consolidated database greatly simplifies record keeping and minimizes data loss and data transcription errors.
   By typing in a case identification number we have immediate access to information in any of the related databases. These data can be exported easily for analysis in DataDesk, Excel, and Map Manager QTX.
   Converting our laboratory from notebooks and a set of disjointed Excel files to FileMaker took several months. Some of the more useful databases are available at www.nervenet.org.

Environmental and non-genetic developmental variance can be estimated by phenotyping related individuals. For example, in (Figure A) the variability in brain weight among BALB/cJ individuals is entirely due to environmental differences and technical error. Once technical errors have been factored out, the variance among inbred mice within a strain is almost purely environmental and is a good estimate of Ve. Variation in age, sex, body weight, litter size, age of mother and parity of her litters, exposure to pathogens, temperature and humidity variation, the sometimes amazing differences from bag to bag in non-synthetic mouse chows from major vendors (Michael Tordoff of the Monell Chemical Senses Center finds 10-fold variation in calcium content between bags), seasonal fluctuation in water quality, and a host of other environmental factors can often be minimized or carefully controlled in studies using mice. Failure to control for these factors can and will give rise to variation within colonies and between laboratories. For studies of robust Mendelian and quantitative traits, these sources of variation may not matter much, but if you are trying to track down a host of genes that affect the kinetics of neurogenesis in the hippocampus or of a sensitive behavioral trait then go to the effort to control as much as possible. For example, if practical feed animals a more expensive but much less variable synthetic chow and provide your colony with a consistent source of deionized water. (Box 1: The challenge of keeping records provides advice on building and maintaining a coherent and accurate laboratory database.) What we are left with after accounting for technical and environmental sources of variance is a large amount of unexplained or 'residual' variance. This is the genetic variance that we will be trying to split apart into a neatly defined set of precisely mapped QTLs.

Comparing strains. Before mapping QTLs, we need to know that the trait is variable and that the variation is heritable. An easy way to go about this is to phenotype 5–10 individuals from each of 10 or more common inbred strains of mice, all raised in the same colony under closely matched conditions. It is a good idea to gather cases from several different litters. If all mice of a particular strain come from a single litter, then the variance between strains could have as much to do with the health and experience of the mother as with her genotype.

In our electron microscopic study of ganglion cells, we found that 70–80% of the variance was heritable. Heritability, defined broadly to include all sources of genetic variance, was estimated by comparing levels of variance within and among 17 inbred strains. The average variance within a strain, Vw, was 13.7 (variance units are x10⁶cells2). When the technical error, Vt, was subtracted, the average environmental variance, Ve, was reduced to 4.65. In comparison, the additive genetic variance, Va, computed across strains was 27.9. From these values we computed the broad heritability, h2, using the equation

h² = 0.5Va / (0.5Va+Ve)     (Hegmann & Possidente, 1981).

Substituting values, heritability was estimated to be ~0.75—a relatively high value, but one that will probably be typical of many quantitative neuroanatomical traits (Wimer & Wimer, 1989).

Selecting strains. Which inbred strains or isogenic F1 hybrids are appropriate for a preliminary analysis of the heritability of a variable CNS trait in mice? My suggestions for an initial screen are in approximate order of priority:

C57BL/6J (the strains whose genome is being sequenced as part of a public NIH-supported effort; used to generate most major mapping resources)

DBA/2J (key strain in behavior genetics, used to make many recombinant inbred strains; this strain ihave been partly sequenced (1x coverage by Celera; a parental strain of the BXD series)

B6D2F1 (the F1 intercross of a female C57BL/6J and a male DBA/2J)

A/J (key strain used to make the AXB and BXA recombinant inbreds, an now consomic lines for QTL analysis; also sequenced 1X coverage by Celera)

B6AF1 (the F1 intercross of a female C57BL/6J and a male A/J)

BALB/cByJ (one of the parental strains of the CXB recombinant inbred strain set)

C3H/HeJ (one of the parental strains of the BXH recombinant inbred strain set)

129X1/SvJ (used to be known as 129/SvJ, widely used in generating transgenic and knockout lines, sequenced 1X by Celera)

B6129PF1 (an F1 intercross used extensively in transgenic mouse research)

BALB/cJ (key strain in behavior genetics, like A/J and BALB/cJ, this strains is an albino)

SM/J (parental strain of a very large RI set being made by Dr. James Cheverud, Wash U)

LG/J (parental strain of a very large RI set being made by Dr. Cheverud, Wash U)

AKR/J

SJL/J

CBA/J (one of the few strains with good hearing throughout life, key in auditory research)

FVB/NJ (key strain in transgenic research due to high fecundity and ease of injecting oocytes with DNA)

LP/J

PL/J

NZB/BINJ

NOD/LtJ (use with caution, this is a non-obese diabetic mouse strain

SWR/J (Swiss Webster inbred)

CAST/Ei (wild inbred strain, small and frisky, hard to breed)

SPRET/Ei (wild inbred strain, small and frisky, hard to breed)

PANCEVO/Ei (wild inbred strain, small and frisky, hard to breed, a colonial mound-building species from sourthern Eastern Europe and the Ukraine)

Note 1: As of January 2000, the genomes of four strains of mice have been partly sequenced (C57BL/6J, DBA/2J, A/J, and 129X1/SvJ). The collected sequence data should provide almost complete coverage of the mouse genome. Sequence data for three of these strains is still proprietary data belonging to Celera Genomics. You can keep up-to-date with the public efforts at http://www.ncbi.nlm.nih.gov/genome/seq/MmProgress.shtml (Pennisi 2000).

Note 2: If you need to determine the genetic difference between two strains, then visit http://pages.cidr.nhgri.nih.gov/mouse/mouse.html. The CIDR has genotypes at a set of 300 MIT markers for a growing number of strains. For a sophisticates comparison see Taylor (1972).

I have chosen these strains for four reasons: (1) most are readily available, (2) these strains differ greatly in both phenotypes and genotypes, (3) their genomes have been characterized extensively, and (4) many of these strains have been used to generate recombinant inbred strains that are proving to be especially useful to neurogeneticists. Some of these mice will not be suitable for certain studies–for example, C3H/HeJ, CBA/J, and SWR/J all carry the mutant rd1 allele of the Pdeb gene that causes photoreceptor degeneration, so none would not be appropriate for an analysis of retinal cell populations. For an excellent synopsis of strains of mice that are suitable for different types of neurogenetic/behavioral analysis see Crawley et al. (1997)

Level of genetic variation between strains. One key to successfully mapping QTLs is a high level of genetic variation among mice. Many strains listed above have now been genotyped ~6400 loci (Dietrich et al., 1994). It is therefore possible to determine at an early stage how feasible it will be to map QTLs using a cross between any two of these strains. For most pairs of strains, more than 30% of marker loci have reliable and easily scored differences (Fig. 1). In contrast, it would not be practical to map QTLs responsible for the heritable 100 mg difference in brain weight between BALB/cJ and BALB/cByJ because these strains are closely related, and just finding suitable marker loci would be a major undertaking. In this particular case a complementary analysis of differences in expression levels of genes or proteins might be extremely fruitful (Sandberg et al., (2000)).

Box 2: Resources for regression analysis
   Neuroanatomists are generally not familiar with regression analysis and its applications in quantitative neuroscience. Chapters 14–16 of Biometry provide a solid introduction to the topic. Genetics and Analysis of Quantitative Traits covers regression from a rigorous genetic perspective. Our group’s favorites are still The Cartoon Guide to Statistics and Data Reduction and Error Analysis for the Physical Sciences.
   There are numerous pitfalls in performing regression analysis. Mismeasured cases, runts, and hydrocephalic animals can ruin an analysis. It is essential to explore the data graphically, to handle or discard outliers, and to normalize distributions before plunging ahead. For advanced, yet practical, advice on applying regression analysis, see Data Analysis and Regression.
   The program DataDesk 6.0 is extremely well adapted for performing multiple regression analysis (linear and non-linear), as well as an armada of other statis-tical procedures. DataDesk allows sets of variables to be added or removed using a simple drag-and-drop interface. The program comes with an excellent and pragmatic statistics guide by Paul Velleman.
   The DataDesk web site at: http://www.lightlink.com/datadesk/ is a treat. Trial versions are available for Macintosh or Windows operating systems.

Sokal RR, Rohlf FJ (1981) Biometry. The Principle and Practice of Statistics in Biological Research, 2nd ed. WH Freeman, New York.

Lynch M, Walsh B (1998) Genetic Analysis of Quantitative Traits. Sinauer, Sunderland MA.

Mosteller F, Tukey JW (1977) Data Analysis and Regression. A Second Course in Statistics. Addison-Wesley, Reading MA.

Gonick L, Smith W (1993) The Cartoon Guide to Statistics. HarperCollins, New York.

Bevington PR, Robinson DK (1992) Data Reduction and Error Analysis for the Physical Sciences. 2nd ed. McGraw-Hill, New York.

Using multiple regression analysis to improve specificity of QTL mapping. Quantitative neuroanatomical methods can be used to generate unbiased estimates, but before these estimates can be employed to map genes, they will need to be fine-tuned using multiple regression techniques (Box 2: Resources for regression analysis). The reason is that a trait may be tightly correlated with other traits, and if care is not taken, it is possible to map the wrong QTLs or no QTLs at all. For example, assume that we are interested in mapping genes that specifically control normal variation in numbers of Purkinje cells in the cerebellum. If we count these neurons in a large sample of animals and use these raw numbers to map QTLs, we will almost certainly, and inadvertently, map genes that affect the entire cerebellum, the entire brain, or perhaps even the entire body. To sidestep this problem, and to map QTLs that have selective effects on Purkinje cells, we need to have data on body, brain, and cerebellar weight. The more data we have on what might be called “higher-order” phenotypes, the better.

In this particular example, we should perform a multiple regression analysis of Purkinje cell number against cerebellar weight, brain weight, body weight, sex, age, fixation quality, the identity of the investigator who carried out the dissection and counted the tissue, method of processing, and whatever other parameters we can demonstrate are statistically associated with our estimates of Purkinje cell number. If an individual parameter is not significant in this multiple regression, then it should be eliminated from the analysis. David Airey, Lu Lu, and I carried out an analysis like this, and after stripping away extraneous variance generated by many general factors we were able to map four QTLs that specifically control the size of the mouse cerebellum (Airey et al., 2001). The phenotype values used in this analysis were multiple regression residuals—the differences between expected and observed cerebellar weights. With this background on the genetics of cerebellar size, we are now poised to study particular cell populations within cerebellum. In almost any study of cell number in the nervous system, it will help to factor out global effects.

Mapping QTLs that modulate CNS traits

A QTL mapping study is in essence a search for statistically significant associations between variation in a quantitative trait such as Purkinje cell number and variation in genotypes at particular gene loci (Tanksley, 1993). If our aim is to map QTLs using a set of F2 intercross progeny, then we would compare the residuals computed by regression analysis to the genotypes that we have obtained at 70 to 100 marker loci (Fig. 1). To simplify the explanation of how we compare phenotypes and genotypes I will describe a hypothetical experiment and use this experiment to review some useful genetics. It will help those who are unfamiliar with the methods to spend a few minutes looking over Figure 3, which summarizes graphically, much of what follows.

The QTL mapping experiment starts with a morphometric analysis of Purkinje cells in two inbred strains of mice: C57BL/6 (B type; named for its black coat color) and DBA/2 (D type; named for its dilute brown coat color). These strains are fully inbred, meaning that alleles at all autosomal gene loci in mice of a given strain are identical. Pairs of chromosomes have precisely the same DNA sequence–imagine each chromosome as a string of 4000 Bs, with each B representing the B-type allele at each of 4000 genes on an average mouse chromosome.

Generating an F2 intercross. Assume that Purkinje cells in these two inbred strains have now been counted and that the majority of C57BL/6 mice have low counts (a residual -11,500 ± 2,700, as shown in Fig. 2), whereas the majority of DBA/2 mice have high counts (a residual of +12,300 ± 3,000, as shown in Fig. 2). There isn't much overlap between estimates, and our analysis demonstrates that heritability is high. We mate these mice, producing the first filial or F1 generation of progeny. Individual F1 mice are hybrids with BD genotypes, but these hybrids are also isogenic because each F1 mouse inherits one complete set of B chromosomes from its mother and one complete set of D chromosomes from its father. That is, all F1 offspring have precisely the same BD genotype at all autosomal loci. With the exception of sex differences, the quantitative variation among these F1 animals is purely environmental and technical (Ve + Vt).

We breed the F1 mice to generate an F2 generation. Half of the chromosomes packaged into gametes of the F1 mice are themselves hybrids, with long alternating stretches of B and D alleles generated during meiosis by reciprocal crossovers (recombinations) between aligned B and D chromatids. Furthermore, at any autosomal gene locus in the F2 generation, the ratio of BB, BD, and DD genotypes will be close to the expected Mendelian binomial ratio of 1:2:1. Now the total quantitative variation in cell number among F2 progeny is due both to environmental factors—the same environmental factors measured in parental strains and F1s—and to the random segregation of allelic variants at QTLs that affect Purkinje cell numbers. The total variance among F2 progeny—the variance we intend to assign to well-defined QTLs—is simply the total phenotypic variance, Vp, minus Ve + Vt.

Linkage and haplotypes. We do not know the genotype of a particular F2 animal at a particular gene locus unless we type that animal's DNA. However, we don't need to sequence or analyze each gene individually. We can infer genotypes of entire chromosomal regions quite reliably by typing only a few marker loci per chromosome. The reason this can be done is that during meiosis there is on average only one crossover per chromosome. In fact, the genetic length of a chromosome is defined by the frequency of recombination. A chromosome that is 50 centimorgans (cM) long experiences an average of one crossover event per meiosis, generating two recombinant chromosomes and two non-recombinant or parental-type chromosomes. (Note that in contrast to genetic length measured in cM units, the physical length of a chromosome is measured in bases or basepairs and because recombination frequencies are quite variable there is only a loose correspondence between physical and genetic lengths.) The closer two neighboring genes are to each other on a chromosome, the less likely it is that they will be separated by a recombination event (Tanksley, 1993). If genes are within 1 centimorgan (cM) of each other (about 2 million basepairs of DNA in mice but only 1 million basepairs in humans), they will be split apart only about 1 in 100 times, and as a result neighboring genes of particular types on a single chromosome (referred to as a haplotype) will tend to stay together from generation to generation. Geneticists refer to this association using the more imposing terms linkage disequilibrium or gametic phase disequilibrium. The greater the disequilibrium, the higher the probability that particular strings of genes or alleles present on a particular chromosome–the haplotype–will be inherited as a non-recombinant or parental-type unit in the F2 progeny.

The utility of linkage. This linkage between neighboring genes means that we do not need to analyze every gene in an individual F2 mouse to deduce the probable genotype at a particular locus. In fact, linkage disequilibrium is high enough in an F2 intercross that we can confidently infer the genotype at any of approximately 80,000 genes in the 3 billion basepair (bp) mouse genome by typing only 70 to 100 well-distributed marker loci that collectively sample less than 20,000 bp, or 0.006% of the genome. The closer a particular marker is to a QTL, the better the inference (Weiss and Terwilliger, 2000). A single marker locus will effectively sample a chromosomal interval of about 15 cM on either side. This region is called the marker's swept radius. As few as 3 to 4 markers will sweep an entire chromosome in an F2 intercross.

The first aim of QTL mapping is to find one or more marker loci, such as D5Mit294 shown in Figure 2, for which patterns of the three genotypes match those of phenotypes. We hope to find markers that are statistically associated and physically linked to QTLs that modulates

Fig. 2. Genotypes at a microsatellite locus on chromosome 5 (D5Mit294). The left lane is a DNA size standard, and the next two lanes are PCR samples from the parental strains B (higher band at 198 bp) and D (lowest band at 176 bp). Lanes 4 to 25 are the F2 samples. The bottom band—or bands, in the case of heterozygotes—define the genotype of each animal. The bands at the top of the figure are caused by DNA retained in the pipetting wells. Figure from work by G. Zhou (Zhou & Williams, 1997; see Detailed PCR protocols for mapping microsatellites.)

Purkinje cell numbers. Given that the parental strain, C57BL/6J, has fewer Purkinje cells, we expect that BB individuals will on average have fewer cells than DD individuals at marker loci linked to important QTLs. But for a trait controlled by as many as 10 or 20 QTLs we can also expect to find markers at which DD individuals have fewer cells. BD heterozygotes will typically have intermediate numbers of cells, but if either the B or D allele at a particular QTL is dominant, then heterozygotes as a group will have either high or low cell number.

In a QTL analysis it is important to realize that we are concerned with group means, not individuals. Some BB individuals may have more cells than BD or even DD individuals. In contrast to Mendelian traits, quantitative traits are controlled by numerous genes, and except in unusual circumstances, no single QTL is responsible for more than a relatively small fraction of the phenotypic variance. A major-effect locus may account for 20% of the genetic variance, but values less than 10% are far more common (Tanksley, 1993; Roff, 1997). Environmental and developmental perturbations will also generate exceptional phenotypes.

Interval mapping. If a QTL has a small effect or if a QTL has a large effect but is located far from flanking markers, then the statistical association between genotypes and phenotypes will be weak. In either case, we may fail to detect Purkinje cell QTLs (an error of omission, or Type II error). However, we often have data on markers that flank both sides of the QTL. Using genotype data at these flanking markers we can predict the most likely genotypes at any intermediate position between markers and compare these inferred genotypes with Purkinje cell numbers. This can greatly improve our ability to detect QTLs and it also allows us to statistically distinguish between nearby QTLs with small effects and more distant QTLs with large effects. The statistical method used to infer QTL genotypes and then compare them with actual phenotypes is called interval mapping (Lander & Botstein, 1989)

Figure 3. See the legend at the bottom of the figure.

Fig. 3. Summary of mapping QTLs using F2 intercross progeny. At the top of the figure I have illustrated pairs of homologous chromosomes from two inbred parental strains (C57BL/6J with thin black lines, DBA/2J with wider gray lines). The two chromosomes within each parent are identical and have identical alleles (B- or D-type) at each of the four marker loci, M1, M2, M3, and M4. A single QTL maps in the interval between M1 and M2. F1 hybrids are mated to produce a set of 500 F2 progeny. Some of the chromosomes in the F2s are recombinants and are made up of long segments of both parental strain chromosomes. The genotypes of the F2 at the four makers are shown at the bottom of the figure along with a simple scoring scheme based on numbers of B-type alleles each F2 animal has at each marker (also see Fig. 1). Gray regions throughout the figure contain data on phenotypes of the parental strains, the F1 hybrids, and the individual F2 cases. In the bottom right corner of the figure is a summary of correlations between genotype scores (0, 1, 2, or -1, 0, and +1) for the four F2 cases and their respective quantitative trait values (2.4, -1.9, -8.6, and 7.3). In this case the best correlation is to marker M1. As expected, the correlation is negative because B alleles are associated with lower values (compare the parental strains). Marker M4 has a reasonably high positive correlation, and it is possible that a second QTL in which a B allele is associated with a higher mean phenotype is located in this interval. With an n of only 4 even a correlation of 1.0 is insignificant. Unless a QTL has a very large effect it will usually be necessary to genotype and phenotype several hundred animals.

Genotyping microsatellite marker loci. We can determine the genotype of each F2 animal by measuring differences in the number of basepairs of DNA in highly variable repeat sequences called microsatellites (Fig. 2). Many thousands of these variable repeat sequences are scattered throughout the mammalian genome. Most microsatellites are in anonymous, non-coding stretches of DNA, but some map within introns, and more rarely, within exons. (This latter group can produce severe neurodegenerative diseases in humans, most notably spinocerebellar ataxia, Friedrich's ataxia, and Huntington's disease.)

A set of ~6700 highly polymorphic microsatellites called simple sequence length polymorphisms (SSLPs) have been mapped in mouse by Dietrich et al. (1994). When we say that microsatellites are polymorphic, we simply mean that different strains of mice—in our example, C57BL/6 and DBA/2—have alleles that differ in number of repeats. Fifty-two percent of all microsatellite loci in C57BL/6 and DBA/2 are polymorphic and differ by 2 bp or more (Dietrich et al, 1996). As illustrated in Figure 2, differences in the length of these DNA sequences are easy to detect and score by electrophoresing PCR products through agarose or polyacrylamide gels. The DNA is visualized using a fluorescent DNA–binding dye such as ethidium bromide.

Box 3: Programs for mapping QTLs
   Map Manager QTX is one of several programs that can be used to map QTLs. The program has a well-designed, almost intuitive, interface (Manly, 1993), and is available free at http://mapmgr.roswellpark.org/mmQT.html. Map Manager QTX is available for Macintosh and \Windows OS. It is one of the few programs that computes genetic maps for you; all you do is enter the genotypes for your cases. To take full advantage of the permutation test that is built into Map Manager QTX, install this program on a computer with a fast processor.
   Map Manager QTX is accompanied by an informative tutorial and manual (http://mcbio.med.buffalo.edu/mmrequest.html) that is well worth downloading, printing, and perusing. I also recom-mend highly a short review by Tanksley (1993) on QTL mapping strategies and their limitations.
   QGene at http://qgene.com links to a great QTL analysis program. You will need to create two text files for use with QGene. The easiest way to do this is to export the two files directly from Map Manager. QGene has a very flexible, fun, and powerful interface and is a perfect complement to Map Manager.
   The QTL Cafe at http://web.bham.ac.uk/g.g.seaton/ links to a Java applet that will help you carry out an online QTL analysis. You will need Netscape Navigator 4.05 or higher and you will need to have 3 data files before starting: a map file with information on the names and locations of markers, a file with genotypes, and a file with the case IDs and trait values.
   QTL Cartographer is a highly capable mapping program—one that may be particularly suitable for those with a background in UNIX and who are comfortable with advanced statistical analysis. The program is available for three operating systems at http://statgen.ncsu.edu/qtlcart/cartographer.html.
   MAPMAKER/QTL is a powerful program that implements a maximum likelihood method to map QTLs. This program and a tutorial are available from the Center for Genome Research at http://www-genome.wi.mit.edu.
   Numerous free or inexpensive programs for mapping QTLs are listed at http://www.stat.wisc.edu/biosci/linkage.html.

QTLs are defined by strong statistical associations. We are now ready to return to the question posed earlier: how do we compare variation in Purkinje cell number (or any other phenotype) with genotypes in the F2 generation? At any particular gene locus, each F2 animal will have BB, BD, or DD genotypes. In Figure 2, reading from left to right, the first 22 F2 genotypes, starting with lane 4, are HBHBD  HBBBH  HHDBH HHHBH  HH (H for heterozygotes). After all F2s have been genotyped at several markers on each chromosome, we need to evaluate the data and determine the likelihood that we have mapped one or more QTLs to particular chromosomal intervals. Information on several of the programs that are used for this purpose is provided in Box 3: Programs for mapping QTLs.

The initial mapping analysis will typically involve what is now referred to as simple interval mapping. Simple interval mapping assumes that we do not already know of any particular QTLs that influence variation in Purkinje cell number. However, if QTLs are successfully mapped in the first round of analysis (seeStatistical criteria), then we may advance to composite interval mapping.. This refinement is really nothing more than the extension of the multiple regression analysis we have already discussed, but instead of compensating for environmental effects or global variables like body size and brain weight, we compensate for the presence of one or more well defined QTLs.

The strength of association between variation in a phenotype and the F2 genotypes at marker loci can be assessed using a F statistic, the logarithm of the odds ratio (the LOD score), the likelihood ratio statistic (LRS), or a conventional probability of linkage. These statistics are interconvertable. LRS scores have a conventional chi-square distribution and are relatively easy to interpret. The LOD score is equal to the LRS divided by 4.61. Both statistics are ratios of probabilities (or likelihoods) that a QTL is, or is not, located within a tested chromosomal interval. But this really does not explain what these statistics are doing. Let me explain the idea using more familiar statistics: analysis of variance (ANOVA), Pearson product-moment correlations, and linear regression.

ANOVA. To perform a statistical analysis and to test the likelihood that there is a QTL linked to the locus D5Mit294 in Figure 2 above, we could simply divide cases into categories by genotype and perform a single-factor ANOVA. With two degrees of freedom among the three genotypes and a total of more than 100 progeny, we would need an F statistic above 3 to suspect that D5Mit294 might be linked to a QTL. In our example, we have been fortunate and the F statistic is 9.4, with an associated p of 0.00014. This would seem to be strong evidence that there is a QTL near this marker, but this statistic alone does not tell us much about differences among the genotypes.

Correlation and regression. To obtain additional information we could score genotypes by numbers of B alleles: BB = 2, BD = 1, and DD = 0 (computationally, it is sometimes preferable to score genotypes as -1, 0, and +1). It would then be possible to compute a correlation between phenotypes and genotypes to assess the strength of linkage. The correlation in this example is –0.28. The negative sign indicates that B alleles are typically associated with cases that have lower cell numbers. The amount of variance that this putative QTL generates can be estimated by squaring the correlation. In this example, r², the coefficient of determination, is 0.076. In other words, approximately 7.6% of the variance is associated with a QTL on chromosome (Chr) 5 near D5Mit294. We can also regress phenotype values against genotype values and determine whether the regression coefficient is significantly different from zero. In this case, the slope of the regression equation (–3700 cells per allele) is significantly different from zero. This analysis indicates that the substitution of a B allele for a D allele typically lowers the trait value in the F2 sample by 3700 Purkinje cells.

A series of these types of tests are carried out at many points along each chromosome generating a profile of the strength of the statistic as a function of position—a LOD score map. The region of the chromosome in which LOD scores are within 2 LOD of the peak defines a >95% confidence interval of the QTL's position (Darvasi, 1997a). In our imaginary Purkinje cell example, the LOD score reaches a peak of 4.2 between D5Mit294 and D5Mit346. The confidence interval is about 12 cM. Figure 4, is an example of a LOD score map in which a gene that specifically modulates brain weight has been discovered on the proximal half of Chr 11. The likelihood ratio statistic (4.6 times the LOD score) is shown on the y-axis. The linkage statistic reaches a peak of 22.8 at about 12 cM. The 2-LOD confidence interval for this QTL (Bsc1 for brain size control 1) extends from 10 to 20 cM.

Fig. 4. Interval mapping results for a QTL that controls brain weight in the mouse. The x-axis represents the full length of Chr 11, from the centromere at 0 cM to the left, to the telomere at about 72 cM to the right. The y-axis represents the strength of linkage of the brain weight phenotype data (BXD recombinant inbred strains) to different intervals on Chr 11. The brain weight data has a peak linkage with an LRS of 22.8 at about 12 cM. Fifteen genes or markers are labeled along the x-axis. Data are from Williams et al., (1997, in preparation).

In practice, QTL analysis is carried out along these lines but using statistical procedures that can assess more than just the statistical strength and position of QTLs (Lynch & Walsh, 1998; Liu, 1998). Several genetic models are usually tested to assess the ways in which alleles are likely to interact with each other (linear-additive interactions, non-linear dominance interactions, and combinations of both types). More sophisticated models may also explore whether QTLs on different chromosomes interact—again linearly or non-linearly. Non-linear interactions between loci is referred to as epistasis. It is premature to worry about epistasic interactions before any QTLs have been mapped, but it is worth stressing that a realistic understanding of genetic sources of variation in CNS structure and function will ultimately demand attention to complex interactions of networks of variant gene products.

Statistical criteria. What statistical criteria should be used to decide whether a QTL has been successfully mapped? Is the LOD score of 4.2 good enough? Lander and Schork (1994) deal with this issue at length and emphasize the need for stringent criteria. The main reason is that a genome-wide search of QTLs involves comparing phenotypes with many independent sets of genotypes. Remember that the gel shown in Figure 2 is just one of 50 or more gels of this type. This gel is used to test a single point in the genome (a point-wise test), but we actually test many points (or chromosomal intervals) and we are naturally much more interested in the genome-wide significance of our result. We have a classic multiple-comparisons problem, and a conventional point-wise criterion level of p = 0.05 is far too lenient. If we divide 0.05 by the 50 independent comparisons we are likely to make then we arrive at a safer estimate of the genome-wide criterion level we need to achieve with a single test. In other words, the point-wise p must to be less than 0.001 to be associated with an effective genome-wise p of less than 0.05. There has been a heated debate among quantitative geneticists regarding appropriate criteria, but there is general agreement that a QTL should not be claimed until the genome-wide significance of making a type I statistical error (an error of commission; declaring the presence of a QTL when there is really nothing there) is under 5% (Elston, 1998).

Permutation tests. In our own work we use a robust non-parametric method to determine the genome-wide 5% level for each trait that we map (Churchill & Doerge, 1994). The idea behind the permutation method is simple: we randomly reassign trait values to genotypes and then we treat these permuted datasets in the same way as the original data. We then see how often QTLs are "successfully" mapped with disordered data. If many of the permuted datasets produce LOD scores that are as good or better than that generated by the correctly ordered data, then we cannot place much, if any, confidence in our putative QTL. In contrast, if fewer than 1 in 100 of the permuted datasets reaches the level of correctly ordered data, then we can be confident that the probability of having made a type I error is under 0.01. In our studies, we typically run more than 10,000 permutations, and make a histogram of the single best scores from each permutation. Using these permutation histograms we can accurately estimate the strength of association between a trait and newly discovered QTLs (Fig. B). Returning to the Purkinje cell example one last time: the permutation analysis of cell counts reveals that the probability of obtaining a LOD score of 4.2 by chance is <0.008. We can now state with reasonable certainty that there is a QTL on proximal Chr 5 that controls Purkinje cell number. In a similar way, a permuation analysis for the brain weight dataset shown in Figure 4 demonstrates that a LOD score of 4.3 (or an LRS score of 19.7) is required to reach a genome-wide p at the 0.05 level.

Selective genotyping and phenotyping. The most extreme cases—those with highest and lowest trait values—are most informative for mapping QTLs (Taylor et al., 1994). These cases are more likely than others to be homozygous for alleles that increase or decrease trait values. It is therefore possible to improve the efficiency and power of detecting QTLs by initially genotyping only extreme cases. The choice of what fraction of cases to genotype depends on the relative cost of phenotyping and genotyping (Darvasi, 1997c). If phenotyping is relatively economical, one may be able to afford to generate large sets of progeny and initially genotype only 5% to 10% of cases from both tails of the distribution. In our laboratory the fraction of cases that we initially genotype is also influenced by the particular PCR and gel equipment that we use. Reactions are run in 96-well microtiter plates. We select 23 high and 22 low cases and run these with DNA from the two parental strains and the F1 hetrozygote (n = 48 samples). In this way, we are able to test two microsatellites per PCR run. Seventy markers can be analyzed in a week. Most of our F2 crosses contain 200 to 500 animals, so we are limiting our analysis to the extreme 5–10% in both tails. It is also possible to pool DNA from high and low groups and compare the relative intensity of alleles at markers that sample the entire genome (Taylor et al., 1994). When we find a chromosomal interval that harbors a suspected QTL, we then genotype all cases at these hot markers. This last step is particularly important because the initial analysis of the tails of the distribution may be influenced greatly by epistatic interactions between the main QTL and unlinked loci. Until all individuals have been typed, we will not have any good idea of the independent effects of the main QTL or of its actually chromosomal location (Darvasi, 1997c).

Once a QTL has been identified the procedure can be reversed–it is now possible to save time and money by phenotyping selectively. For example, it is possible to generate a very large sample of animals, genotype DNA taken from the tips of their tails when they are still pups, and then only phenotype particular genotypes as adults. Selective procedures like this have risks because they depend on the tails of the distribution being "well behaved." But tails of distributions are often the havens of unrecognized measurement errors, developmental flukes, and as mentioned above, of rare epistatic interactions. It is also good practice to sample from both tails of the distribution. An example of the power (and risk) of selective phenotyping is an interesting study by Chorney and colleagues (1998). They report that the insulin growth factor 2 receptor (IGF2R) is a marker for very slightly higher intelligence (4 IQ points) in a mixed population of Caucasians in the Cleveland area. Their analysis is based on a comparison of allelic differences between groups of very bright children (a selection as intense as 1 in 30,000 based on test scores) and groups of normal children. There are serious interpretive difficulties related both to the intense selection of individuals more than four standard deviations away from the mean and to the intentionally unbalanced sampling design that excludes humans with low or low-normal scores. What is often not appreciated is that QTLs can have their primary effects on the variance of a trait, rather than the mean of a trait, and obviously if only one tail of a distribution is sampled one runs the risk of confounding these types of QTLs. There is therefore still no direct evidence that a QTL near IGF2R actually influences mean intelligence. A simple analysis of patterns of IGF2R alleles in a large, normally distributed, sample would resolve this issue, but indications from the within-sample correlations performed by Chorney et al., do not give any cause for optimism (r = –0.07).

The major crosses for mapping QTLs

Box 4: Web Resources for Mapping QTLs
   http://www.nervenet.org and the Mouse Brain Library. My research group's web sites. Full texts of papers, grants, and an extended description of the 3D cell counting protocol. The Portable Dictionary of the Mouse Genome also available at http://www.nervenet.org, is a FileMaker database that includes current mapping information for more than 20,000 loci in mouse. The 10 MB database can be downloaded and used as a local resource. It contains data on all MIT microsatellite loci.
   http://www.nervenet.org/main/databases.html Our lab's database server. You can currently obtain data on fixed and fresh brain weights, sex, age, and body weights of >8000 animals belonging to >200 strains of mice.
   http://www.jax.org/ The Jackson Laboratory is the primary provider of mice for biomedical and genetic research. Their web site provides detailed information on a huge assortment of mice (inbred, transgenic, knockouts, recombinant inbred strains), gene mapping resources, and other tools. This site will also lead you to the Mouse Genome Database (MGD)—the definitive repository for mouse locus data.
   http://www.resgen.com/ Research Genetics Inc. is the primary source for microsatellite primers.
   http://www-genome.wi.mit.edu/ The Center for Genome Research at the Whitehead Institute includes a searchable databases of all MIT microsatellite loci and numerous other genetics resources for mapping in mouse and human populations.
   http://linkage.rockefeller.edu/ The Laboratory of Statistical Genetics at Rockefeller University includes a comprehensive links to linkage analysis software.
   http://nitro.biosci.arizona.edu/zbook/book.html A site associated with the superb text by Lynch and Walsh, Genetics and Analysis of Quantitative Traits. Many related QTL links. Supplementary material is organized by chapter.
   http://www.sfbr.org/nigms/Report.html A synopsis of the current status of quantitative genetics and prospects for funding at the federal level in the USA. The report highlights some of major conceptual problems we can expect to encounter, and perhaps resolve, in the next few decades.
   http://www.dartmouth.edu/~chance/ChanceLecture/contents.msri.html Bruce Weir's lecture at Dartmouth (Chance Lecture series) on the statistics of DNA forensic analysis.

What type of cross is best suited for different types of studies? There are currently five common types of line crosses that are commonly used to map QTLs in mice.

F2 intercross progeny. The F2 intercross (described briefly in our Purkinje cell example) is currently used in the first phase of the majority of QTL studies. The advantage of this cross between lines is that it is usually possible to scan the entire genome for QTLs by typing only 70 to 90 well-spaced marker loci. This used to be a major advantage, but genotyping is getting much easier, and other factors, such as precision and power of the analysis are now more important considerations. For most purposes a group of 200 or more F2 animals will need to be generated and typed. (One person working full time can genotype approximately 200 animals at 100 marker loci in one month. You would need four 96-well thermal cyclers; each run twice a day.) The more animals one can tolerate to type, the greater the power of the analysis and the greater the number of detected QTLs (Darvasi, 1998). In general, the efficiency of phenotyping is far more critical than that of genotyping, so be sure you spend time reviewing all data collection methods from start to finish and make those changes that will allow you to generated more numbers, more accurately, more quickly. Although the F2 is a very common ccorss, I don't recommend it anymore. It lacks adequate positional precision. One way to improve the F2 cross without much more work is to extend the cross for two more generations, out to generation 4 (F2 to G3 to G4). This will double the precision with which QTLs are mapped. the downside is that you will need to type twice as many marker loci, as many as 200 per animal. In carrying out a four generation cross be sure NOT to mate littermates when possible.

Backcross progeny. Mating F1 animals back to either or both parental strain generates one or two panels of backcross progeny. Backcross progeny are commonly used in mapping Mendelian traits, especially recessive mutations. There are only two genotype classes to contend with at any locus—the homozygote and the heterozgyote. The reduced number of genotypes simplifies scoring of genotypes and may allow QTLs to be detected more easily (Darvasi, 1998). An N2 cross may be particularly helpful in cases in which the F1 generation has an average phenotype very close to one of the parental strains; in other words, a case in which one set of parental alleles behaves in a dominant fashion. The F1 is then backcrossed to the "recessive" parental strain.

The main problem with both the F2 intercross and the backcross is that QTLs usually cannot be mapped precisely. Recombination events between nearby loci will be relatively uncommon in these animals, and even with large numbers of progeny it will often be difficult to pin down a QTL to a chromosomal interval of 10 cM—roughly 20 million bp of DNA. This is a long stretch of DNA that will typically contain hundreds of genes. Fishing for good candidate genes will be tough. If we can get down to a 1–2 cM interval then we may only have to contend with a few dozen candidate genes. There are now several ways to to greatly improve the precision with which QTLs are mapped (Darvasi & Soller, 1995; Darvasi, 1997b, 1998). The most simple method is the recombinant inbred intercross method that exploits a set of more than 102 very well mapped recombinant inbred strains now available from the Jackson Laboratory (Williams et al, 2001).

Advanced intercross progeny. An advanced intercross is generated by crossing F2 males and females to produce a third generation, referred to as G3 (G rather than F because filial matings are intentionally avoided). G3 individuals from different litters are then intercrossed in a way that minimizes inbreeding to produce a G4 generation (Darvasi, 1998). This process is repeated for several generation. Assuming that inbreeding can be minimized during this process, the cumulative amount of recombination between the original parental genomes doubles with each doubling of the generation number. Since all genetic maps are measured in units of recombination, not in basepairs, this process greatly increases the length of the genetic map. The consequence is a twofold improvement in the precision with which QTLs can be mapped for every twofold increase in generation number using a fixed number of offspring. Obviously, making an advanced intercross is a lot of work; in each generation 20 or more breeding cages need to be maintained (ideally 50 or more cages). The payoff is that at G8 the precision of mapping will be roughly four times greater than that of the F2 intercross. Because the genetic map is stretched so much, the number of marker loci that we need to use to scan the entire genetic map will also be increased four-fold. Each marker still only samples a region of 30 cM (the so-called swept radius), but the total length of the genome has now been stretched from 1400 cM to as much as 6000 cM. While we may have managed with 80 markers using an F2, we will need 320 markers to have the same assurance of detecting QTLs in a G8 cross. However, if we have already mapped QTLs using the F2 generation, or any of the subsequent generations, then we do not actually need to scan the entire genome of the G8 progeny. We need to remap only those chromosomal intervals that have been shown to harbor QTLs using earlier generations. The advanced intercross then can be used to map QTLs across the entire genome to a 95% confidence interval of ±2 cM. Even so this may require an analysis of 500 to 2000 cases (Darvasi, 1998). It is also practical to backcross an advanced intercross to one of the oroginal parental strains to generate an advanced backcross. The advanced backcross may have some advantages when traits are controlled by QTLs with dominant alleles or when the trait is influenced strongly by epistatic interactions.

A problem with an advanced intercross is that some chromosomal intervals will drift or be selected far from the Hardy-Weinberg genotype expectation. We expect a 1:2:1 ratio of BB, BD, and DD genotypes. But if B alleles on distal Chr 1 are significantly more advantageous then D alleles for fertility and survival then by the eighth generation there may be few animals with the DD genotype. This distrotion will result in non-syntenic (literally, "not on the same string or chromosome") gene loci being closely linked or associated in a statistical sense. For examle two regions on Chr 1 and Chr 9 that are both enriched for the BB genotype will be in linkage disequilibrium and thus seem to be linked despite their separation on different chromosomes. If uncorrected, such non-sytenic disequilibrium will generate spurious mapping results.

[A caution to anyone setting up an advanced intercross: Make sure to use as many possible parents in the penultimate generation as possible and only generate single litters from particular breeding pairs of mice. Rotate males in the the last generations rather than letting a single mating pair generate a large number of litters. Failure to rotate partners will result in marked inequities in the representation of haplotyhpes in the final generation used for mapping.]

Recombinant inbred strains: a neuroscientist's first choice. Recombinant inbred (RI) strains are the easiest means for neuroscientists to get started mapping QTLs. They have several significant advantages, the foremost being that isolating and typing DNA is not required. RI strains are generated in the same way as an advanced intercross with the important exception that at each generation, siblings are intentionally mated (Belknap et al., 1992; Silver, 1995, p. 207–213; Williams et al., 2001). By the 20th generation, progeny are almost entirely inbred. In each of these fully inbred strains, chromosomes of the original parental strains have recombined extensively: hence the name, recombinant inbred strains.

Unlike the case in the other genetic crosses we have considered, an entire strain rather than just a single mouse represents each recombinant genome. This feature is an obvious advantage for studies in which reliable neuroanatomical or behavioral traits are hard to generate from single animals. Accurate averages for each genotype can be obtained simply by phenotyping more animals. The absence of any heterozygotes also increases the phenotypic and genotypic variance for a given sample size, while at the same time reducing the number of genotype classes. This can make it easier to resolve QTLs.

Box 5: Mapping QTLs in human populations

Nuclear magnetic resonance imaging (MRI) makes it feasible to map QTLs that control variation in the size of CNS compartments in humans (Bartley et al., 1997). The general strategy of mapping QTLs is the same as that outlined in the main text, but obviously the complex genetic structure of human populations demands different tactics and is somewhat harder to accomplish since environmental factors cannot be eeasily controlled (Lynch & Walsh, 1998, chapter 16; Ott and Hoh, 2000; Weiss and Terwilliger, 2000). Kruglyak (1997) provides an introduction to one of several powerful new methods called linkage disequilibrium mapping.
   Several methods require information from related individuals, but the allele association method (Risch & Merikangas, 1996) can be used to map QTLs without any pedigree data. These mapping techniques could be readily combined with MRI and new chip-based methods to genotype single nucleotide polymorphisms (SNPs) to map quantitative neuroanatomical trait in humans. Allele association methods would be particularly adaptable to the analysis of postmortem brain tissues.

Bartley AJ, Jones DW, Weinberger DR (1997) Genetic variability of human brain size and cortical gyral patterns. Brain 120:257–269.

Kruglyak L (1997) What is significant in whole-genome linkage disequilibrium studies. Am J Hum Genet 61:810–812.

Risch N, Merikangas K (1996) The future of genetic studies of complex human disease loci in humans. Science 273:1516–1517.

The advantage of replicated recombinant genotypes. But for neuroscientists the most compelling advantage of RI strains is that it becomes possible to study relationships and correlations between many CNS traits. One can carry out immunocytochemical analyses using dozens of antibodies, then add receptor-binding studies, and finish with counts of cells using unbiased stereological methods. These structural parameters can then be compared with multiple behavioral traits (Dains et al., 1996). The accumulation of phenotypic data for these replicated genotypes, adds tremendous power to RI strains. If a QTL controls numbers of retinal ganglions, then does it also control variation in numbers of cells in the dorsal lateral geniculate nucleus, superior colliculus, or even a functionaly unrelated structure like the olfactory bulb? Do single QTLs have pleiotropic effects on multiple structures and cell populations? RI strains can be used to answer these questions. In comparison to other crosses in which each genotype is represented by a single mouse, RI strains are ideal for long-term correlative, collaborative, and corroborative studies of the genetic control of CNS structure.

At present there is little information on the CNS of RI strains (Belknap et al., 1992; Dains et al.; 1996, Williams et al., 1998a). My colleague Glenn Rosen and I have begun to redress this deficiency by systematically processing brains of many of RI strains with the idea of producing a library of sectioned material to be used specifically for 3D-counting of cell populations (see <www.mbl.org>). All the tissue is embedded in celloidin, cut in coronal or horizontal planes at 30 µm, and stained with cresyl violet. The current collection consists of ~350 cases, representing more than 40 strains.

If one of these recombinant inbred mice is genotyped, then the entire strain has been genotyped. For this reason, genotype data also accumulates as more researchers type these strains. We maintain a large database of error-checked and curated genotypes for five of the most common RI sets (AXB, BXA, BXD, BXH, and CXB). One of the largest and best-characterized RI sets consists of 35 BXD strains generated by Benjamin Taylor of the Jackson Laboratory by crossing C57BL/6J to DBA/2J (Taylor, 1989). Over 1750 gene and marker loci have now been genotyped in the majority of these strains. To map a QTL using these RI strains, we simply phenotype a sample from each of the 35 strains and then compare the pattern of strain averages or residuals with the distribution pattern of alleles at a subset of approximately 1000 loci. The procedure is explained in detail in Williams et al. (1998a) and complete genotype databases for BXD and other RI strains are available at www.nervenet.org. One last advantage of RI strains is that the genetic maps have a resolution that is 4-fold greater than that in an F2 cross. Like maps of an advanced intercross, those of RI strains are expanded, and in some cases it is possible to map a QTL with a precision of ± 2 cM (Williams et al., 1998a; Williams et al., 2001). This also means that QTLs mapped with RI strains will usually be associated with single genes, rather than sets of linked genes.

The main disadvantage of RI strains. At present the major disadvantage of RI strains is that there are still too few of them, and as a result only QTLs that have comparatively large effects can be mapped. This is not a serious problem at an early stage of analysis, before any QTLs have been mapped. In our work we have consistently been able to use the BXD strains to map at least one QTL controlling each of the following traits: ganglion cell number, eye size, brain weight, and cerebellar weight. But there are many other QTLs controlling each of these traits, and mapping QTLs with less robust effects will usually require larger numbers of recombinant genotypes. This has usually involved a second phase in which N2, F2 or advanced intercross progeny are used to confirm and extend the genetic analysis. To provide a better solution we have recently genotyped a set of 100 RI strains that all share one common parent (C57BL/6J). It is straightforward to pool data from multiple RI sets and to statistically assemble an RI super set to improve statistical power. Combining data from several sets of RI strains should double or triple the yield of QTLs. Another reported disadvantage of RI strains is that gene dominance effects can not be assayed since all of the RI strains are fully inbred. The absense of heterozygotes prevents one from determining the effects of a single allele on a trait. However, this is a trivial problem to overcome since it is easy to generate F1 intercrosses between RI strains and thereby make large numbers of obligate heterozygotes on any interval desired.

Congenic strains. Congenic strains are generated by crossing two inbred strains (generating an F1 hybrid) and then backcrossing repeatedly to one parental strains (Morel et al., 1997; Wakeland et al., 1997). The idea is to transfer and isolate a specific chromosomal interval from one strain (the donor) to another strain (the recipient). For example, we might wish to transfer an allele that is associated with a low Purkinje cell population from the genome of C57BL/6 onto the DBA/2J strain. To do this we would backcross the F1 progeny to DBA/2J. The backcross progeny—refered to as the N2 progeny—must then be genotyped at markers on both sides of the QTL and perhaps one additional marker half way between. (If the QTL is poorly localized three or even more markers may need to be typed to insure that the entire interval that may harbor the QTL is heterozygous in animals chosen for breeding.) On average 50% of the genome of the N2 progeny will be homozygous for DBA/2 alleles. Animals that have inherited one B allele at both markers M1 and M2 (Fig. 2) are then crossed back to the DBA/2 parental strain again. In this N3 backcross generation 75% of the genome will be homozygous for the DBA/2 genotype, but because we intentionally bred animals that were heterozygotes between M1 and M2, an average of half of the N3 progeny will still be heterozygous in this interval.

The process of genotyping a few specific markers in a small number of progeny and then crossing mice that carry B alleles back to DBA/2 is repeated for eight or more generations. Homozygosity for DBA/2 alleles increases from 75% at N3, to 87.5% at N4, and to 99.8% at N10. N10 progeny that are heterozygous for markers M1 and M2 are mated. Some of their offspring will be homozygous for C57 alleles in the M1–M2 interval and these mice and their offspring are used produce a constant stream of congenic animals that are homozygous for the C57 allele of our Purkinje cell QTL, but which are otherwise almost entirely DBA/2 type. We have transferred the low allele from one strain to another. We should probably also transfer the high Purkinje cell allele from DBA/2 to C57BL/6, generating what is called the reciprocal congenic strain. We have in essence turned a complex polygenic trait into a more tractable single gene trait. Now it is possible to phenotype many of these reciprocal congenic mice and compare Purkinje cells populations. Differences are due to the QTL in the M1–M2 interval. We can do this at any stage of development to determine when and where the allelic differences begins to affect Purkinje cell proliferation or survival.

Congenic strains are presently the main tool used to fine-map QTLs. This is accomplished by making sets of strains that are congenic for overlapping intervals that collectively define intervals that can be as short as 1 cM across the QTL's presumed location. Phenotypes of congenic strains are compared, and those that differ from parental strains are presumed to harbor the QTL (Darvasi, 1997b). Once a QTL has been mapped to this level of precision it become feasible to test candidate genes that map to the 2-LOD confidence interval.

There can be problems with a congenic approach to mapping QTLs. The primary problem is that as a QTL is introgressed into a recipient strain, it may lose its effects on phenotypes. To minimize frustration, the phenotype should be monitored during the process of introgressing the QTL (see Vadasz et al., 1998 for a beautiful example). QTLs can evaporate because key epistatic interactions upon which a phenotype depends are lost. In this predicament, it may be possible to identify and conserve intervals that appear to be critical for the penetrance of the QTL's effects.

Recombinant inbred intercross (RIX) progeny. This powerful yet simple QTL mapping method was devised by David Threadgill and colleagues and involves generating all or a subset of the many pairs of intercrosses (hence RIXs) between a set of recombinant inbred strains (Williams et al 2000). The derived RIX set extends the number of genomes available for phenotyping by a factor of n(n-1)/2, where n is the number of original RI strains. For example, the 13 CXB/By strains and the 35 BXD/Ty that have been used heavily can be supplemented by 6 and 17 times as many novel recombinant RIX genotypes, respectively. Each individual RIX strain has a unique but entirely predictable genome and like RI strains, many genetically defined RIX individuals can be phenotyped to greatly improve trait reliability. However, unlike an RI set, an RIX set closely resembles an F2 intercross, with a 1:2:1 segregation ratio of genotypes at marker loci. Furthermore, none of the RIX progeny are inbred, resulting in more robust mice from which more reliable traits can be extracted. Parental effects can be controlled to some extent by changing the polarity of the RIX; in other words, the maternal and paternal strains can be switched to produce reciprocal RIX progeny are genetically identical. Any significant differences between these reciprocal crosses are due to parental effects.

The statistical properties of RIX sets, that is, their power to detect QTLs, the positional precision of mapping data, and the accuracy of estimates of allelic effects of the trait of interest, are significantly better than those of the original RI set. For example, the statistical power of a complete RIX set (n = 561 RIXs) derived from the 34 BXD/Ty RI strains will typically be boosted 4- to 6-fold. QTLs that control as little as 5% of the phenotypic variance will often be detected with a power of better than 0.8. The 2-LOD confidence interval will also be somewhat smaller. Given the already extensive phenotypic data for RI strains of mice, the RIX method provides a highly effective means to confirm and extend existing QTL mapping data. And no genotyping is required! Selective analysis of specific RIX strains can also be a powerful method to test models of gene action and interaction effects among multiple QTLs. For an example of the application of this method see our recent abstract (Williams et al., 2000).

Like the other crosses, there are drawbacks to the RIX approach. The current problem with RIX mapping is that non-syntenic associations, that is, statistical linkage between different chromosomal segments, can be high. This problemin was already eluded to in the context of RI mapping (and see Williams et al., 2001), but in RIX mapping this problem has more obvious effects and will generate spurious QTLs. To exploit RIX mapping effectively will require carefully contructed RIX panels in which non-sytenic association has been controlled either by selection or by sophisticatedbx statistical control.

Combining results from two or more mapping studies. It is not difficult to perform a meta-analysis of multile independent set of mapping results using a method explained in Sokal and Rohlf (1996, section 18.1). In brief, one computes the combined probabilities of linkage in a number of crosses. This method was used to help map the Nnc1 locus (Williams et al., 1998, see equation in the methods) by combining data from two sets of RI strains, BXD and BXH. The same method can be used to combine an apparent hodge-podge of RI, F2, and N2 and advanced intercross data sets. This is most productive when the same sets of alleles are segregating in the different crosses (i.e., BXD, B6D2F2), but can even work in crosses that do not share the same parental genomes. The main difficulty in applying this method is deciding what values to actually combine. We have settled on a simple procedure in which all markers used in each cross are assigned a single "genome-wide" position value. These values have units of Morgans, and the range of values extends from 0.00 for the most centromeric marker on Chr 1 to 15.9 for the distal-most marker on Chr X. These uniform genome-position values provide a common framework map for every cross. The point-wise P values are compute for all markers and values between markers are interpolated in 1 cM bins. The P values of identical 1 cM bins are then combined. This pooling procedure (BXD plus and F2) was essential in mapping the first set of QTLs affecting olfactory bulb weight (Williams et al., 2001). (This paragraph added Oct 27, 2000 by RW)

The highs and lows of QTL analysis

QTL mapping is initially done at a level of analysis that is far removed from cellular and molecular mechanisms, and it may at times seem that the research has lost touch with biology. But keep in mind that this QTL analysis is itself just a prelude to a renewed molecular and cellular analysis—that the first aim is simply to determine where key regulatory genes are located. In some ways this first stage is like air reconnaissance: At high altitude, we hope to succeed in discerning the outlines of roads and walls marking lost cities in the desert. We now know the approximate locations, but we need to get on the ground to explore, to survey, and ultimately, to excavate these sites. The payoff can be great, and in QTL analysis the exploration on the ground does not need to be delayed for long. As soon as QTLs have been mapped, and long before candidate genes have been identified or cloned, QTLs can be used as "reagents" to probe neuronal development and function.

In our own work on neuron number control 1 (Nnc1)—a QTL that has pronounced effects on numbers of retinal ganglion cell populations—Richelle Strom and I have been able to show that this QTL modulates neurogenesis rather than cell death (Strom & Williams, 1999). We now also suspect that Nnc1 may be the thyroid hormone alpha receptor gene, and in collaboration with Guomin Zhou, Douglas Forrest, and Bjorn Vennström, we are now examining effects of inactivating this gene on the ganglion cell population.

There are several other powerful ways to exploit QTLs prior to cloning. Chromosomal segments containing QTL alleles associated with high or low phenotypes can be transferred to well-characterized inbred strains of mice—producing the congenic strains mentioned above (Darvasi, 1997b). Once a set of congenic mice has been generated, it becomes possible to explore developmental, pathological, and even environmental mechanisms that lead to differences between strains that carry the alleles associated with high and low traits. Furthermore, sets of high and low alleles at genes on different chromosomes can be combined in different combinations to test how QTLs interact to modify CNS architecture.

The near future of QTL mapping. The principal goal of QTL analysis is to identify the polymorphic sequences associated with each QTL. As the genomes of both humans and mice are sequenced in the next decade, the still arduous process of identifying genes will become progressively easier. We can soon expect to have detailed maps and databases of genes expressed in different tissue and cell types at different stages of development. It will then be possible to combine data on well-mapped QTLs with precise data on chromosomal positions of particular types of genes and expressed sequence tags (ESTs) to quickly winnow the list of candidate genes. This future belongs to those who start weighing, counting, measuring, and mapping now.

Acknowledgments

This work was supported in part by grants from the NEI (EY08868 and EY6627) and NINDS (NS35485) to RW. I thank Dr. Benjamin Taylor, James Cheverud, Ty Vaughn, and Robert Hitzeman for corrections and comments. I thank my colleagues Drs. Guomin Zhou, Richelle Strom, David Airey, and Glenn Rosen for their help. My thanks to Kathryn Graehl for editing, Alexander Williams for building internet sites, and to Drs. John Morrison and Patrick Hof for organizing the 1998 Short Course in Quantitative Neuroanatomy.

Literature Cited

Abercrombie M (1946) Estimation of nuclear population from microtome sections. Anat. Rec. 94:239-247.http://www.nervenet.org/papers/Aberfp.html

Airey DC, Lu l, Williams RW (2001) Genetic control of the mouse cerebellum: Identification of quantitative trait loci modulating size and architecture. J Neurosci. in review.http://www.nervenet.org/papers/cerebellum2000.html

Aylsworth, AS (1998) Defining disease phenotypes. In: Approaches to gene mapping in complex human diseases. Haines JL, Pericak-Vance MA, eds. Wiley-Liss, pp 53–76.

Belknap JK, Phillips TJ, O'Toole LA (1992) Quantitative trait loci associated with brain weight in the BXD/Ty recombinant inbred mouse strains. Brain Res Bull 29:337–344.

Buck KJ, Metten P, Belknap JK, Crabbe JC (1997) Quantitative trait loci involved in genetic predisposition to acute alcohol withdrawal in mice. J Neurosci 17:3946–3955.http://www.jneurosci.org/cgi/content/full/17/10/3946

Chorney MJ, Chorney K, Seese N, Owen MJ, Daniels J, McGuffin P, Thompson LA, Detterman DK, Benbow C, Eley T, Plomin R (1998) A quantitative trait locus associated with cognitive ability in children. Psychological Rev 9:159–166.

Churchill GA, Doerge RW (1994) Empirical threshold values for quantitative trait mapping. Genetics 138:963–971.

Crawley JN, Belknap JK, Collins A, Crabbe JC, Frankel W, Henderson N, Hitzemann RJ, Maxson SC, Miner LL, Silva AJ, Wehner JM, Wynshaw-Brois A, Paylor R (1997) Behavioral phenotypes of inbred mouse strains: implications and recommendations for molecular studies. Psychopharmacol 132:107–124.

Dains K, Hitzeman B, Hitzeman R (1996) Genetic, neuroleptic-response and the organization of cholinergic neurons in the mouse striatum. J Pharmacol Exp Ther 279:1430–1438.

Darvasi A, Soller M (1995) Advanced intercross lines, an experimental population for fine genetic mapping. Genetics 141:1199–1207.

Darvasi A (1997a) Behav Genet 27:125–132.

Darvasi A (1997b) Interval-specific congenic strains (ISCS): an experimental design for mapping a QTL into a 1-centimorgan interval. Mamm Gen 8:163–167.

Darvasi A (1997c) The effect of selective genotyping on QTL mapping accuracy. Mamm Gen 8:67–68.

Darvasi A (1998) Experimental strategies for the genetic dissection of complex traits in animal models. Nat Gen 18:19–24.

Dietrich W, Katz H, Lincoln SE, Shin HS, Friedman J, Dracopoli NC, Lander ES (1992) A genetic map of the mouse suitable for typing intraspecific crosses. Genetics 131:423–447.

Dietrich W, Miller JC, Steen RG, Merchant M, Damron D, Nahf R, Gross, A, Joyce DA, Wessle M, Dredge RD, Marquis A, Stein LD, Goodman N, Rage DC, Lander ES (1994) A genetic map of the mouse with 4,006 simple sequence length polymorphisms. Nature Gen 7:220–245.

Elston RC (1998) Methods of linkage analysis—and the assumptions underlying them. Am J Hum Genet 63:931–934. Avaiable at <http://www.journals.uchicago.edu/AJHG/journal/issues/v63n4/980636/980636.html>.

Falconer DS, Mackay TFC (1996) Introduction to quantitative genetics. 4^th ed. Longman Scientific, Harlow, Essex.

Gilissen E, Zilles K (1996) The calcarine sulcus as an estimate of the total volume of the human striate cortex: a morphometric study of reliability and intersubject variability. J Brain Res 37:57–66.

Greenspan RJ (2001) The flexible genome. Nature Reviews 2:383–387.

Gundersen HJG, Bagger P, Bendtsen TF, Evans SM, Korbo L, Marcussen N, Møller A, Nielsen K, Nyengaard JR, Pakkenberrg P, Sørensen FB, Vesterby A, West MJ (1988) The new stereological tools: disector, fractionator, nucleator, and point sampled intercepts and their use in pathological research and diagnosis. Acta Path Microbiol Immunol Scand 96: 857-881.

Hegmann JP, Possidente B (1981) Estimating genetic correlations from inbred strains. Behav Gene 11:103–114.

Kearsey MJ, Pooni HS (1996) The genetical analysis of quantitative traits. Chapman Hall, London.

Laird PW, Zijderbeld A, Linders K, Rudnicki MA, Jaenisch R, Berns A (1991) Simplified mammalian DNA isolation procedure. Nucleic Acid Res 19:4293.

Lander ES, Botstein D (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199.

Lander ES, Schork NJ (1994) Genetic dissection of complex traits. Science 265:2037–2048.

Lu L, Airey DC, Williams RW (2001) Genetic dissection of the mouse hippocampus: Identification of loci with specific effects on hippocampal size. Journal of Neurosicence 21:3503–3514. JN 0625-00.

Lui BH (1998) Statistical genomics. Linkage, mapping and QTL analysis. CRC Press, Boca Raton FL.

Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sinauer, Sunderland MA.

Manly K (1993) A Macintosh program for storage and analysis of experimental genetic mapping data. Mamm Genome, 4: 303–313.

Morel L, Mohan C, Yu Y, Croker BP, Tian N, Deng A, Wakeland EK (1997) Functional dissection of systemic lupus erthematosus using congenic mouse strains. J Immuno 158:6019–6028.

Ott J, Hoh J (2000) Statistical approaches to gene mapping. Am J Hum Genet 67:289–294.

Pennisi E (2000) Mouse sequencers take up the shotgun. Science 287:1179–1181.

Roff DA (1997) Evolutionary quantitative genetics. Chapman & Hall, New York.

Rosen GD, Williams RW (2001) Complex trait analysis of the mouse striatum: Independent QTLs modulate volume and neuron number. BMC Neuroscience 2:5 @ www.biomedcentral.com/1471-2202/2/5

Sandberg R, Yasuda R, Pankratz DG, Carter TA, Del Rio JO, Wodicka L, Mayford M, Lockhart DJ, Barlow C (2000) Regional and strain-specific gene expression mapping in the adult mouse brain. Proc Natl Acad Sci USA 97: 11039–11043.

Silver LM (1995) Mouse genetics: Concepts and applications. Oxford UP, New York.

Sokal RR, Rohlf FJ (1995) Biometry: the principels and practice of statistics in biological research, 3rd ed. Freeman and Co, San Francisco.

Suner I, Rakic P (1996) Numerical relationship between neurons in the lateral geniculate nucleus and primary visual cortex in adult macaque monkeys. Vis Neurosci 13:585–590.

Strom RC, Williams RW (1998) Roles of cell production and cell death in the generation of normal variation in neuron number. J Neurosci 18:9948–9953. http://www.nervenet.org/papers/dev.html.

Takahashi JS, Pinto LH, Vitaterna MH (1994) Forward and reverse genetic approaches to behavior in the mouse. Science 264:1724–1733.

Tanksley SD (1993) Mapping polygenes. Annu Rev Genet 27:205–233.

Taylor BA (1989) Recombinant inbred strains. In: Genetic variants and strains of laboratory mouse. 2^nd ed. pp 773–789. Oxford UP, New York.

Taylor BA (1972) Genetic relationships between inbred strains of mice. J. Heredity 63:83–86.

Taylor BA, Navin A, Phillips SJ (1994) PCR-amplification of simple sequence repeat variants from pooled DNA samples for rapidly mapping new mutations of the mouse. Genomics 21:626–632.

Vadasz C, Sziraki I, Sasvari M, Kabai P, Murthy LR, Saito M, Laszlovszky I (1998) Neurochemical Res 23:1337–1354.

Wakeland E, Morel L, Achey K, Yui M, Longmate J (1997) Speed congenics: a classic technique in the fast lane (relatively speaking). Immunology Today 18:472–477.

Weiss KM, Terwilliger JD (2000) How many diseases does it take to map a gene with SNPs? Nature Genetics 26:151–157.

Williams RW (2000) Mapping genes that modulate mouse brain development: a quantitative genetic approach. In: Mouse Brain Development (Goffinet AF, Rakic P, eds). Springer, New York, pp 21–49. Full text and figures available at http://www.nervenet.org/papers/BrainRev99.html.

Williams RW, Rakic P (1988) Three-dimensional counting: An accurate and direct method to estimate numbers of cells in sectioned material. J Comp Neurol 278:344–353. Full text with corrections and additions at http://www.nervenet.org/papers/3DCounting.html

Williams RW, Strom RC, Rice DS, Goldowitz D (1996) Genetic and environmental control of variation in retinal ganglion cell number in mice. J Neurosci 16:7193–7205.http://www.jneurosci.org/cgi/content/full/16/22/7193

Williams RW, Strom RC, Goldowitz D (1997) Soc Neurosci Abst

Williams RW, Strom RC, Goldowitz D (1998a) Natural variation in neuron number in mice is linked to a major quantitative trait locus on Chr 11. J Neurosci 18:138–146.http://www.jneurosci.org/cgi/content/full/18/1/138

Williams RW, Strom RC, Zhou G, Yan Z (1998b) Genetic dissection of retinal development. Sem Cell Devel Biol: in press. http://www.nervenet.org/papers/Retina_Rev.html

Williams RW, Gu J, Qi S, Lu L (2001). The genetic structure of recombinant inbred mice: high-resolution consensus maps for complex trait analysis. Release 1, January 15, 20001 at nervenet.org/papers/bxn.html. Genome Biology, in review.

Wimer RE, Wimer CC (1989) On the sources of strain and sex differences in granule cell number in the dentate area of house mice. Dev Brain Res 48:167–176.

Zhou G, Williams RW (1997) Mapping genes that control variation in eye weight, retinal area, and retinal cell density. Soc Neurosci Abst 23:864. http://www.nervenet.org/papers/ZhouSN98.html

Copyright © 1998 by R.W. Williams

Neurogenetics at University of Tennessee Health Science Center