|
|
|||||||
Articles |
Biological Engineering Division, Massachusetts Institute of Technology, Cambridge, MA 02139
Submitted November 14, 2005; Revised January 9, 2006; Accepted January 17, 2006
Monitoring Editor: Marshall Sundberg
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Advances in RNAi coincide with genome-wide data collection from large-scale sequencing projects and from DNA microarray experiments. Sequence data are available for nearly every model organism, and sequencing technology advances daily, promising ever cheaper and faster options (Zwick, 2005). There is little doubt that these data sets inform every aspect of discovery science: illuminating questions of evolution, cell physiology, and genetic control just to name a few, despite that very little information can be gleaned from an organism's DNA content alone. Experimental tools to perturb and then study cells should be paired with computational tools that compare sequences or outcomes if an investigator is to find meaning in the strings of G's, A's, T's, and C's that are too numerous to examine by hand.
DNA microarray technology is one means of exploiting sequence information to understand the gene expression pattern of a cell as a whole (Bowtell and Sambrook, 2003). In this technique, mRNA pools are isolated from cells that have been treated distinctly. These pools are differentially labeled, usually with fluorescent dyes, and then simultaneously hybridized to a slide with DNA sequences spotted or synthesized at particular addresses on the slide's surface. By comparing the dye's signal intensities at each address, investigators can assess relative expression levels for each gene represented on the slide, more than 18,000 genes for the experiment described here.
It seems RNAi, sequence databases, and microarray analysis could be combined to provide a powerful suite of tools (Baum and Craig, 2004), enabling rational and precise modulation of gene expression and subsequent study of the cell-wide effect of such perturbationat least in theory. In reality, RNAi experiments are hampered by unintended consequences of the RNA treatment (Couzin, 2004; Samuel, 2004); sequence databases are not fully annotated, and microarray data sets are difficult to correlate from lab to lab (Tan et al., 2003; Marshall, 2004; Bammler et al., 2005; Allison et al., 2006). Rather than ignore or simplify these ambiguities, a series of investigational laboratory experiments was developed to allow undergraduate students to accentuate them. Originally, this series was used in a class with 23 engineering students at Massachusetts Institute of Technology (MIT, Cambridge, MA); however, this curriculum could be used in its entirety or partially in a variety of lab contexts. Admittedly, MIT has technically savvy undergraduate students, excellent lab facilities, and enviable course budgets but that combination of good fortune is not required for the adoption of this lab series in other institutions or with other student cohorts. Because the primary goals are to provide a greater understanding of the techniques involved, and the data collected and its limitations, the effectiveness of these exercises would not be diminished if some of the wet lab work is omitted and precollected data sets are used for analysis (e.g., using sample data available at http://hdl.handle.net/1721.1/30603). Fundamentally important to this series, and what sets it apart from other "how to" descriptions for teaching microarrays and bioinformatics (e.g., Altman, 1998; Campbell, 2003; Brewster et al., 2004; Bradford et al., 2005; Shachak et al., 2005), is the sense that science is an ongoing process of discovery requiring student imagination and creativity. Rather than offer a collection of established facts or cookbook directions to follow, these experiments ensure that students avoid a "hands-on but brain-off" lab experience.
| OVERVIEW OF RNAi/MICROARRAY EXPERIMENT |
|---|
|
|
|---|
Similarly, DNA microarrays were described as a remarkable but flawed technology. The number of genes that can be simultaneously queried and the microfabrication techniques involved in array production were stressed as great advances. Some technical frustrations associated with microarrays, such as differences in dye stability and incorporation, were described as were the problems encountered when comparing different array platforms (e.g., Agilent, Affymatrix, and Combimatrix) due to their dissimilar manufacturing and hybridization protocols. Probe design questions arose, requiring a short but productive foray into the world of bioinformatics; exploring the meaning of E-values provided by the BLAST program; and questions such as "How many base pairs define a unique sequence in the human genome?" Finally, data analysis ambiguities were described, dropping the jaws of many in the class who could not believe no consensus has been reached for measurements such as background subtraction or repression ratios (Zhang et al., 2005; Allison et al., 2006). Before ever handling their own arrays, students were already feeling more modest about what a completely successful experiment would look like, realizing that "to a hammer, everything looks like a nail," i.e., that microarrays might not be the best experimental approach all the time.
The immediate experimental goal of the investigation was to design an siRNA that would specifically silence expression of Renilla luciferase when a plasmid expressing that gene and the student-designed siRNA were cotransfected into a human cell line. A positive control for Renilla luciferase silencing and a transfection control of firefly luciferase were provided. An outline for each day's laboratory work is presented in Figure 1.
|
|
The transfection pattern followed by the students is outlined in Figure 2. Two six-well plates of 5070% confluent cells were provided for each pair. Students treated the cells with the transfection agent alone or with the reporter plasmid with and without siRNAs. The reporter plasmid (psiCHECK-2; Promega, Madison, WI) constitutively expresses both firefly and Renilla luciferase, with the former serving as control for transfection efficiency. As a positive control for RNAi, students were given a validated siRNA, previously established to decrease Renilla luciferase activity by
90%. Students were also given a "nontargeting" siRNA. This commercially available reagent has no effect on Renilla luciferase expression, and, equally important, has at least four mismatches to every known human gene, giving "minimal, reproducible nonspecific target effects" according to the vendor. The validated and the nontargeting siRNAs were used as controls that would bracket the luciferase activity measurements students would make in the following lab period, giving the high and low extremes for silencing to which they could compare their siRNAs.
|
|
Day 4: Microarray Hybridization
Microarray analysis allowed the students to examine changes in HeLa cell gene expression due to off-target effects and nonspecific responses to the siRNA treatment. Students isolated total RNA from cells transfected with the reporter plasmid and their experimental siRNA. They compared the expression pattern in this sample to another sample, most often choosing a sample that had been transfected with one of the control siRNAs, reasoning that off-target effects might be identified by comparing the profile of affected genes. This comparison, however, could not reveal gene expression altered in the same way by both treatments, and some students feared they were making the "wrong" choice. Everyone eventually came to realize that microarrays are limited to pairwise comparisons and that more than two samples would be needed to answer every question of interest. For example, cells transfected with and without siRNA were required to find nonspecific effects of siRNA treatment, but from these data it would not be evident how the expression profile differed for two siRNAs.
Students isolated total RNA from their luciferase assay extracts (diluting the lysate 1:2 with the RTL-BME lysis reagent in the QIAGEN RNA purification kit [QIAGEN, Valencia, CA] and then proceeding as directed) and performed cDNA synthesis reactions according to the instructions in the Genisphere (Hatfield, PA) 900-HS array kit (http://www.bio.davidson.edu/projects/gcat/GCATprotocols.html#general; Figure 4). With this method, polyA-RNA is amplified with reverse transcriptase priming from oligo(dT) primers provided by the vendor. These primers contain one of two "capture sequences," each specific for a fluorescent probe. Students hybridized their cDNA pair to a single microarray, Human v1A (Agilent Technologies, Palo Alto, CA), containing 22,000 60-mer oligonucleotides (oligos) specific for 18,000 human genes. Another source for microarrays is the Genome Consortium for Active Teaching (http://www.bio.davidson.edu/projects/gcat/gcat.html), a nonprofit organization whose mission is to make this technology widely available to undergraduates. The next day, the arrays were washed, reprobed with the Cy3 (green) and Cy5 (red) fluorescent labels, and then scanned. The teaching faculty performed the washing, but students could be made responsible for this step if time permits.
|
To everyone, the data collected from the microarray seemed vast and difficult to navigate. What signal measurements should they compare? Is examining the mean or median more appropriate? What gave rise to the scratches, bubbles, and bright spots on the image? Can those artifacts be identified on the spreadsheet and corrected? Where was the oligo for luciferase? Undoubtedly, the homework and lecture information leading up to the day of data analysis was helpful, but equally clear is the observation that the students were most fully engaged when confronted by the data and aware of the intellectual challenge required to manage it. The importance of analyzing their own data cannot be overlooked as a factor in their willingness to immerse themselves wholeheartedly in the next level of intellectual challenge, but sample data could be used.
Each group began by trimming their data sheets to include only probe identity information and their associated Cy3 and Cy5 signal and background intensity measurements, comparing the mean and the medians of each to decide which gave a more reliable estimate for spot intensity. With some guidance by the teaching staff, students identified and deleted unused negative and positive controls from their spreadsheets and then considered whether the Cy3 and Cy5 signals needed to be normalized. Most concluded the red and green channels were well matched, a consequence of Genisphere's indirect labeling technique, and that the array represented enough genes to expect parity in the two samples analyzed. Most groups subtracted background signal though one pair (both Electrical Engineering and Computer Science majors) first used variations in background signal to explore ideas about machine signal noise.
Students enjoyed wrestling the data into a usable form but were even more delighted as they dug into their "cleaned up" measurements to make discoveries about gene expression. They followed the convention for converting intensity measurements to log2 values (Campbell and Heyer, 2003) and then thoughtfully considered which log2 values were meaningful. Students found different ways to parse their data further in an effort to discover trends and biological responses. With the only requirement being a justifiable and thoughtful approach to present to the class at the next session, some groups considered only highly expressed genes, using an M versus A plot to identify the border between low and high expression (Bowtell and Sambrook, 2003). Others considered only dramatically affected genes, eliminating any that differed <16-fold in the two treatments. Still other students restricted their analysis to reproducible signal measurements: those genes with duplicate spots on the array. Regardless of their approach, the students were surprised then delighted by their autonomy to analyze their data.
Day 6: Student Presentations
Student understanding of the material in this series of experiments is amenable to many kinds of assessment. A formal lab report is a good choice if development of scientific writing facility is a goal. An alternative form of assessment stressing oral presentation skills was chosen for this investigation. Students were asked to present their findings as a 10-min talk. There were specific written guidelines to help them structure the talk, including the approximate time and number of PowerPoint slides they should dedicate to the introduction, data presentation, and summary. A list of "dos and donts" was also included, with specific guidance about format (e.g., do make every element of your slide visible to the entire room. This means 20-point font or greater), content (do say what your study contributed to the field), and presentation style (dont read lists from slides). In addition to written guidelines, ideas about the assignment were exchanged though a lively discussion enjoyed by the students and teaching faculty during class time, with students offering their impressions, some very funny, of different lecturing styles encountered at MIT. In future offerings of this class, more time will be set aside for this discussion.
Students submitted an electronic file with their presentation as well as a print copy of each slide listing the associated "talking points." These outlined the content and transitions for each slide the student intended to describe, ensuring that the talks were fully planned in advance and also providing some of the more nervous public speakers with a text if they got lost or flustered during the presentation itself. Despite the large variation in the detail and length of the talking points, they were an invaluable aid in documenting and assessing the talks and they were particularly helpful to those students less comfortable with public speaking or for whom English was not a first language.
| LEARNING OUTCOMES |
|---|
|
|
|---|
Knowledge Outcomes
Throughout this experiment, students gain first-hand experience with powerful and new technologies for expression engineering. Some of the facts, terms, concepts, and theories it emphasizes are loss of function analysis, siRNA nomenclature/structure/processing, RNAi off-target effects and how to experimentally control for them, bioluminescent reactions and their use as a readout for gene expression, microarray technology, and data analysis. Students' self-identified knowledge outcomes were recorded on an anonymous survey and included specifics of RNAi (e.g., RNA-induced silencing complex, dicer, antisense, and target), some greater understanding of data analysis (e.g., luciferase and microarray data analysis), or both.
Student PowerPoint presentations were evaluated with a grading rubric (Figure 5A), giving some relative measure for student understanding of the material. Although a scale of 15 was established, assigned scores fell only on the upper end of the scale, indicating a complete or near complete understanding of the experiment. Nearly all student presentations included a complete statement of the experimental goal, offering some version of "siRNA design, efficacy, and specificity" to answer the question "What did we do and why?" Most students also gave a complete and accurate description of the experimental steps, and if they omitted any steps, it was invariably those guiding their siRNA design. Most presentations included completely accurate descriptions of the experimental technology, although almost one-half of the students had small misunderstandings, for example, citing 22,000 as the number of human genes on the microarray when in fact there were approximately 18,000 gene represented, with other spots either replicates or controls. Finally, student facility with data interpretation was measured, looking particularly at their description of signals on the microarray. All students correctly described the green, red, and yellow spots as indicative of different signal ratios, but more than one-half the class did not explicitly state how the green or red signals were a consequence of up- or down-regulated gene expression. This is an important aspect of microarray analysis, an aspect that every student should have described but only one-half did describe. Other important aspects of their data interpretation were uniformly correct, including their conclusions from the luciferase assays and transfection controls.
|
Skills
This investigation prepares students to capably and skeptically approach any gene expression experiment that includes RNAi. They obtain research-related experience that includes sensitive, hands-on work with RNA, mammalian tissue culture, enzymatic assays, and microarrays. Students gain analytical skills through their critical evaluation of experimental design and careful data analysis. Their communication skills also improve because they are obliged to keep careful records of their experimental data and to present their findings to the group. Finally, this experiment requires that students collaborate and share data, reagents, and credit for ideas, thereby building their teamwork skills. Thus, students are "learning by doing," actively engaging with the material and its challenges.
Attitude
Perhaps the most long-lasting outcome of this series of investigations is to achieve some positive change in student attitude toward themselves as investigators. Student feedback was collected to measure such change, by using a class-specific survey and the standard MIT course evaluation form, both of which the students submitted anonymously (Figure 5C). The class-specific survey asked students open-ended questions seeking their opinions about the class, about their confidence with the material, and about their recommendations to others who might want to take the class. Although responses were submitted by only 10 of the 23 enrolled students, the feedback was overwhelmingly positive. The comment that occurred most often was some version of "this lab was cool because we did real experiments instead of ones to illustrate a concept." From this sort of remark, it can be gathered that as students, they felt capable and entitled to discovery-driven learning. Another frequently expressed sentiment was some version of "I liked using new technologies," showing appreciation and excitement to be doing techniques that were the "envy of their graduate school friends."
Of the questionnaires received, nine of the 10 respondents would recommend the class to a friend, with the 10th offering no comment. The small class size and "a lab that was fun" were frequently mentioned in this regard. The experience also generated further interest in the discipline, because eight of the 10 respondents planned to take more classes in biological engineering, and the remaining respondents offered no comment or indicated some uncertainty. One student, who signed her name to the survey, wrote how the lab restored her confidence in the "drama" of biological research. Another wrote, "I feel much more interest in the future of bioengineering after the microarray experiment not really because it was so neat but more because it reveals how much more there is to be known."
| CONCLUSIONS |
|---|
|
|
|---|
As an educator, it is an interesting challenge to teach at the leading edge of a new field, a place where the science is still incompletely understood and the norms are not yet established. This series of experiments is an opportunity to embrace the limitations of what is known rather than be frustrated by them. Students were asked to apply their best creative and thoughtful efforts to a real problem, and, freed from the worry of being wrong, the class achieved remarkably diverse and thoughtful endpoints. This series also reinforced my belief in teaching "authentically," asking students to learn the way scientists do, in the context of an interesting question (Campbell, 2004). The experience confirmed my notion that, as a teacher, it is better to ask the right questions with the students rather than to provide all the answers.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
Altman, R. B. (1998). A curriculum for bioinformatics: the time is ripe. Bioinformatics 14, 549550.
Bammler, T., et al. (2005). Standardizing global gene expression analysis between laboratories and across platforms [correction published in Nat. Methods (2005). 2, 477]. Nat. Methods 2, 351356.[CrossRef][Medline]
Baum, B., and Craig, G. (2004). RNAi in a postmodern, postgenomic era. Oncogene 23, 83368339.[CrossRef][Medline]
Bowtell, D., and Sambrook, J. (2003). DNA Microarrays: A Molecular Cloning Manual, Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
Bradford, W. D., Cahoon, L., Freel, S. R., Hoopes, L.L.M., and Eckdahl, T. T. (2005). An inexpensive gel electrophoresis-based polymerase chain reaction method for quantifying mRNA levels. Cell Biol. Educ 4, 157168.[Medline]
Brewster, J. L., Beason, K. B., Eckdahl, T. T., and Evans, I. M. (2004). The Microarray Revolution: perspectives from educators. Biochem. Mol. Biol. Educ 32, 217227.
Bridge, A. J., Pebernard, S., Ducraux, A., Nicoulaz, A.-L., and Iggo, R. (2003). Induction of an interferon response by RNAi vectors in mammalian cells. Nat. Genet 34, 263264.[CrossRef][Medline]
Campbell, A. M., and Heyer, L. J. (2003). Discovering Genomics, Proteomics, and Bioinformatics, San Francisco: Benjamin Cummings.
Campbell, A. M. (2003). Public access for teaching genomics, proteomics, and bioinformatics. Cell Biol. Educ 2, 98111.[Medline]
Campbell, A. M. (2004). Open access: a PLoS for education. PLoS Biol 2, 560563.
Couzin, J. (2002). Breakthrough of the year. Small RNAs make big splash. Science 298, 22962297.
Couzin, J. (2004). RNAi shows cracks in its armor. Science 306, 11241125.
Elbashir, S. M., Martinez, J., Patkaniowska, A., Lendeckel, W., and Tuschl, T. (2001). Functional anatomy of siRNAs for mediating efficient RNAi in Drosophila melanogaster embryo lysate. EMBO J 20, 68776888.[CrossRef][Medline]
Fire, A., Xu, S., Montgomery, M. K., Kostas, S. A., Driver, S. E., and Mello, C. C. (1998). Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature 391, 806811.[CrossRef][Medline]
Hannon, G. J. (2003). RNAi: A Guide to Gene Silencing, Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
Heyer, L. J., Moskowitz, D. Z., Abele, J. A., Karnik, P., Choi, D., Campbell, A. M., Oldham, E. E., and Akin, B. K. (2005). MAGIC Tool: integrated microarray data analysis. Bioinformatics 21, 21142115.
Huppi, K., Martin, S. E., and Caplen, N. J. (2005). Defining and assaying RNAi in mammalian cells. Mol. Cell 17, 110.[CrossRef][Medline]
Jackson, A. L., Bartz, S. R., Schelter, J., Kobayashi, S. V., Burchard, J., Mao, M., Li, B., Cavet, G., and Linsley, P. S. (2003). Expression profiling reveals off-target gene regulation by RNAi. Nat. Biotechnol 21, 635637.[CrossRef][Medline]
Judge, A. D., Sood, V., Shaw, J. R., Fang, D., McClintock, K., and MacLachlan, I. (2005). Sequence-dependent stimulation of the mammalian innate immune response by synthetic siRNA. Nat. Biotechnol 23, 457462.[CrossRef][Medline]
Marshall, E. (2004). Getting the noise out of gene arrays. Science 306, 630631.
Marshall, W. S., and Kaiser, R. J. (2004). Recent advances in the high-speed solid phase synthesis of RNA. Curr. Opin. Chem. Biol 8, 222229.[CrossRef][Medline]
Miyagishi, M., and Taira, K. (2005). siRNA becomes smart and intelligent. Nat. Biotechnol 23, 946947.[CrossRef][Medline]
Reynolds, A., Leake, D., Boese, Q., Scaringe, S., Marshall, W. S., and Khvorova, A. (2004). Rational siRNA design for RNA interference. Nat. Biotechnol 22, 326330.[CrossRef][Medline]
Samuel, C. E. (2004). Knockdown by RNAiproceed with caution. Nat. Biotechnol 22, 280282.[CrossRef][Medline]
Scacheri, P. C., et al. (2004). Short interfering RNAs can induce unexpected and divergent changes in the levels of untargeted proteins in mammalian cells. Proc. Natl. Acad. Sci. USA 101, 18921897.
Shachak, A., Ophir, R., and Rubin, E. (2005). Applying instructional design theories to bioinformatics education in microarray analysis and primer design workshops. Cell Biol. Educ 4, 199206.[Medline]
Tan, P. K., Downey, T. J., Spitznagel, E. L., Jr, Xu, P., Fu, D., Dimitrov, D. S., Lempicki, R. A., Raaka, B. M., and Cam, M. C. (2003). Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Res 31, 56765684.
Que, Q., and Jorgensen, R. A. (1998). Homology-based control of gene expression patterns in transgenic petunia flowers. Dev. Genet 22, 100109.[CrossRef][Medline]
Zhang, D., Wells, M. T., Smart, C. D., and Fry, W. E. (2005). Bayesian normalization and identification for differential gene expression data. J. Comput. Biol 12, 391406.[CrossRef][Medline]
Zwick, M. E. (2005). Technology: a genome sequencing center in every lab. Eur. J. Hum. Genet 13, 11671168.[CrossRef][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | ARCHIVE | SEARCH | TABLE OF CONTENTS |