HBP Rosen Striatum M430V2 (Nov04) GCRMA

Download datasets and supplementary data files


This November 2004 data freeze provides estimates of mRNA expression in the striatum (caudate nucleus of the forebrain) of BXD recombinant inbred mice measured using Affymetrix Mouse Genome 430 2.0 short oligomer microarrays. Data were generated at Beth Israel Deaconess Medical Center (BIDMC, Boston MA) by Glenn D. Rosen with the support of a Human Brain Project (HBP) grant. Approximately 125 brain samples from 24 strains were used in this initial experiment. Data were processed using the Affymetrix Microarray Suite 5 (MAS 5) transform. To simplify comparison among Nov04 data sets, values of each array have been log2 transformed and adjusted to an average expression of 8 units.

Experiment design

Animals were obtained from The Jackson Laboratory and housed for several weeks at BIDMC until they reached ~2 months of age (range from 55 to 62 days). Mice were killed by cervical dislocation and brains were removed and placed in RNAlater for 5 to 10 minutes prior to dissection. Cerebella and olfactory bulbs were removed; brains were hemisected, and both striata were dissected using a medial approach by Rosen that typically yields 5 to 7 mg of tissue per side. The purity of this dissection has been validated by an analysis of acetylcholinestase activity. A pool of dissected tissue from 3 or 4 adults (approximately 25-30 mg of tissue from 6 striata) of the same strain, sex, and age was collected in one session and used to generate cRNA samples. Rought 90 to 95% of all cells in the striatum are medium spiny neurons (Gerfen, 1992, for a review of the structure and function of the neostriatum).

mRNA processing: We used the Amersham Biosciences cRNA synthesis kit protocol.

About cases

This data set includes estimate of gene expression for 24 genetically uniform lines of mice: C57BL/6J (B6, or simply B), DBA/2J (D2 or D), and 22 BXD recombinant inbred (RI) strains derived by crossing female B6 mice with male D2 mice and then inbreeding progeny for over 21 generations. This set of RI strains is a remarkable resource because these strains have been extensively phenotyped for hundreds of interesting traits over a 25-year period (see the WebQTL BXD Published Phenotypes database). A significant advantage of this RI set is that both parental strains (B6 and D2) have been extensively sequenced and are known to differ at approximately 1.8 million SNPs. Coding variants (mostly single nucleotide polymorphisms and insertion-deletions) that may produce interesting phenotypes can be rapidly identified in this particular RI set.

BXD1 through BXD32 were produced by Benjamin A. Taylor starting in the late 1970s. BXD33 through BXD42 were also produced by Taylor, but from a second set of crosses initiated in the early 1990s. These strains are all available from the Jackson Laboratory, Bar Harbor, Maine.

The table below lists the arrays by strain, sex, and age. Each array was hybridized to a pool of mRNA from 3 to 4 mice.




C57BL/6J (B6) DBA/2J (D2)
B6D2F1 (F1)   BXD1  
BXD2 ♂♀ BXD5  
BXD9 ♂♀ BXD11
BXD14 BXD15 ♂♀
BXD16 BXD18  
BXD19 BXD21 ♂♀
BXD22   BXD23  
BXD24   BXD25  
BXD27 ♂♀ BXD28 ♂♀
BXD29 ♂♀ BXD31
BXD32 BXD33 ♂♀
BXD34 ♂♀ BXD38 ♂♀
BXD39 BXD40 ♂♀
BXD42 ♂♀    

Select the strain name in the table above to review details about the specific cases and to view the array quality control image processed using the PerfectMatch program by Li Zhang.

About platform

Affymetrix Mouse Genome 430 2.0 array: The 430v2 array consists of 992936 useful 25-nucleotide probes that estimate the expression of approximately 39,000 transcripts (many are actually duplicates). The array sequences were selected late in 2002 using Unigene Build 107. The array nominally contains the same probe sequence as the 430A and B series. However, we have found that roughy 75000 probes differ from those on A and B arrays.

About data processing

Affymetrix CEL files obtained from the BIDMC Genomics Core were processed as follows.
  • Step 1: We added an offset of 1 to the CEL expression values for each cell to ensure that all values could be logged without generating negative values.
  • Step 2: We took the log base 2 of each cell.
  • Step 3: We computed the Z score for each cell.
  • Step 4: We multiplied all Z scores by 2.
  • Step 5: We added 8 to the value of all Z scores. The consequence of this simple set of transformations is to produce a set of Z scores that have a mean of 8, a variance of 4, and a standard deviation of 2. The advantage of this modified Z score is that a two-fold difference in expression level corresponds approximately to a 1 unit difference.
  • Step 6: We plotted these modified Z score probe level expression estimates in DataDesk. Male-female scatterplots of the probe signals were compared strain by strain to highlight poor array data sets. A total of 36 arrays passed this stringent quality control step.
  • Step 7: We computed the arithmetic mean of the values for the set of microarrays for each of the individual strains. We have not corrected for background beyond the background correction implemented by Affymetrix in generating the CEL file.
Probe set data from the CHP file: The expression values were generated using the MAS 5. The same simple steps described above were also applied to these values. Every microarray data set therefore has a mean expression of 8 with a standard deviation of 2. A 1 unit difference therefor represents roughly a two-fold difference in expression level. Expression levels below 5 are usually close to background noise levels.


Data were generated with funds to Glenn Rosen from P20 MH62009 (see below for specifics). Samples and arrays were processed by the Genomics Core at Beth Israel Deaconess Medical Center by Towia Libermann and colleagues.


This text file originally generated by GDR, RWW, and YHQ Nov 2004. Updated by RWW Nov 17, 2004; GDR and RWW, Dec 23, 2004.