|
|
|||||||
ARTICLES |
Department of Biology, Morgan State University, 1700 East Coldspring Lane, Baltimore, Maryland 21251
Submitted September 5, 2002; Revised November 5, 2002; Accepted November 19, 2002
| ABSTRACT |
|---|
|
|
|---|
Key Words: functional genomics bioinformatics helixloophelix proteins undergraduate student laboratory Caenorhabditis elegans
| INTRODUCTION |
|---|
|
|
|---|
By the end of the investigations described here, students were expected to meet the following objectives: 1) Students were to demonstrate a rudimentary knowledge of bioinformatics, including the ability to differentiate between "having" a gene and "expressing" a gene; and 2) students were to understand basic approaches to functional genomics, including one specific technique for assaying for gene expression. It was also anticipated that students would increase their skills at effectively communicating their research activities through written and/or oral presentation. Student assessment was based on their laboratory notebooks, research articles, and one in-class examination. To measure how strongly these investigations influenced student knowledge and understanding of bioinformatics, students completed pre- and post-activity surveys.
In addition to providing an in-class demonstration of functional genomics, this activity allowed students to participate actively in an ongoing research project aimed at identifying new HLH proteins that regulate C. elegans development. Student motivation and enthusiasm for this project increased once they realized that this activity was not a "canned" laboratory exercise and that their results would be used to analyze actual genes of interest. Since the implementation of this activity, professors in the biology department have seen an increase in the number of students who express an interest in gaining research experience. It seems likely, then, that the implementation of this activity into our curriculum will increase the number of undergraduate students who pursue professional careers as has been suggested previously (Boyer Commission on Educating Undergraduates in the Research University, 1998).
With traditional approaches to studying genetics, a scientist would associate a particular phenotype with a disease or disorder and work to identify the abnormal gene product thought to cause the disease. One classical example of such a traditional approach is the identification of the gene responsible for sickle cell anemia. J. Herrick officially described the phenotype for sickle cell anemia in 1910. This phenotype includes the characteristic sickle shape that deoxygenated red blood cells maintain in patients with the disease (Herrick, 1910). Once the phenotype was described, E.A. Beet and J.V. Noel each proposed in 1949 that sickle cell anemia is a recessive disorder, that an individual must inherit one "bad" or mutant copy of the unknown gene from each parent to display the sickle phenotype (Scott, 1983). Hemoglobin was suggested to be that "bad" protein after Pauling and others used electrophoresis to show that hemoglobin from sickle cell patients did not have the same electrical charge as hemoglobin from patients without the disease. Finally, in 1956, V. Ingram showed that the amino acid sequence of the hemoglobin protein is different in patients with sickle cell anemia. Scientists have since sequenced the gene for hemoglobin and have identified exactly which nucleotide is changed in people with sickle cell disease (for reviews see Schroeder, 1981; Scott, 1983).
In the case of sickle cell anemia, an animal model was not required. The role of hemoglobin in red blood cells was known before the mutation for sickle cell anemia was identified. Sometimes, however, scientists will create animal models of a particular disease and use those models to understand the function of the gene.
Functional genomics is defined as the study of gene expression to describe the functions of all genes in a genome (Griffiths, 1999). It is based entirely on the premise of the central dogma for molecular genetics, that DNA sequences are used as the template for RNA synthesis and that the RNA is subsequently used as a template for protein synthesis. A gene is expressed when RNA and protein are produced because of the sequence information provided within a specific region of DNA (Crick, 1970).
Functional genomics may be considered the reverse of traditional genetic approaches. For functional genomics, the genome of a model organism, such as the fruit fly, Drosophila melanogaster, or the flowering plant, Arabidposis thaliana, is sequenced (Reinke and White, 2002), and the sequencing data are stored in large databases. Powerful computer programs are then used to predict which sequences represent genes and which gene regions code for proteins, as well as to identify motifs characteristic for specific protein functions. The science of identifying genes and predicting protein products and functions through various computer algorithms is called bioinformatics. Molecular geneticists use bioinformatic approaches to select a gene for study, perhaps by first identifying a human gene of interest and looking for predicted similar genes in the model organism. Once predicted genes are identified, RNA analysis is used to determine if the predicted gene is expressed. Various techniques are then used to interfere with normal expression of the gene. The model organism is observed closely, and the resulting changes in the animal's phenotype indicate possible functions of the gene product. Molecular geneticists then design experiments to characterize the gene product further at the protein level (Baxevanis, 2001).
Caenorhabditis elegans is a nonparasitic soil nematode and is ideal for these types of functional genomics studies. First, C. elegans has been used extensively as a model of eukaryotic development and is ideal for classroom use because it is so inexpensively maintained. Second, the C. elegans genome has been completely sequenced and geneticists have already characterized hundreds of morphological, behavioral, and neurological phenotypes using traditional approaches (Kim, 2001). The availability of gene sequences makes it feasible to perform database searches and rudimentary sequence analysis in the classroom. Third, techniques for selectively eliminating gene expression have been fully developed in this organism. These techniques are easily modified for use in the classroom. Furthermore, DNA and RNA purification from mutant and/or normal strains of C. elegans can be completed within a single 2-h class period. For a brief introduction to C. elegans, visit the Caenorhabditis elegans Server at http://elegans.swmed.edu.
The C. elegans life cycle depends on the temperature at which it is grown. When maintained at 25°C, these animals complete their life cycle within 3 to 4 days. At 16°C, their life cycle is approximately 7 to 9 days. C. elegans are either hermaphrodites, which are capable of self-fertilization but cannot fertilize each other, or males, which can fertilize the hermaphrodites. When unfertilized, a normal hermaphrodite produces 300 progeny, while a hermaphrodite fertilized by a male will produce approximately 1000 progeny. Because the hermaphrodites are capable of self-fertilization, a single culture of C. elegans contains animals at different stages of their life cycles (Epstein and Shakes, 1995). These stages include fertilized eggs, four larval stages, and adult animals (see Figure 1).
|
| SEQUENCE ANALYSIS OF THE Caenorhabditis elegans GENOME |
|---|
|
|
|---|
As part of the laboratory exercise, students used three databases to perform a rudimentary sequence analysis of the selected genes. Sequence information is available in its simplest and most direct form at the C. elegans web site known as WormBase (http://www.wormbase.org). Students performed a sequence search in WormBase using the cosmid ID for their predicted gene, C17C3.8, F31A3.2, or F31A3.4. In WormBase, students were able to select links to display the spliced or unspliced DNA sequence or the deduced amino acid sequence for their gene. WormBase also provided students with information about the number and boundaries of the exons for their gene and the gene's chromosomal location.
To identify functional domains within the protein, students copied the deduced amino acid sequences into the SMART database (http://smart.embl-heidelberg.de/). In SMART, symbols are used to represent protein domains. Students used SMART to determine which amino acids within their protein make up the HLH domain. Finally, they also used the BLAST engine (http://www.ncbi.nlm.nih.gov) to identify non-C. elegans homologues of the predicted proteins. BLAST was also useful for performing alignments of their predicted proteins with other HLH proteins. After the 2-h laboratory session, students worked with their laboratory partners to address twelve questions pertaining to the sequence of their genes (see Table 1). They used their responses to write the results section of their research articles, which included a schematic of their assigned gene. Figure 2 is a sample of one student's schematic for the sequence C17C3.8.
|
|
| RNA ANALYSIS OF GENE EXPRESSION |
|---|
|
|
|---|
To determine if their specific gene is expressed, students isolated total RNA from mixed-stage populations of wild-type Caenorhabditis elegans hermaphrodites. Samples of the student protocols are available from the authors upon request. Students visualized their RNA by agarose gel electrophoresis and determined the RNA concentration by UV spectroscopy. Students then used approximately 1 µg of the total RNA in RT reactions. They subsequently used one-fourth of the RT reaction as the template in the PCR. As a negative control, total RNA that was not treated with reverse transcriptase was used as the template in the PCR. The positive control for the PCR contained genomic DNA as a template. After agarose gel electrophoresis, students correlated their sequence analysis data with the RT-PCR data. In the results section of their research articles, they were required to discuss these data and to suggest methods for confirming their preliminary RT-PCR data. Figure 3 shows sample RT-PCR results and analysis for one student group.
|
| ASSESSMENT OF STUDENT LEARNING |
|---|
|
|
|---|
|
When interpreting the results, however, approximately one student per group (25% of the class) did not associate the size differences between the genomic DNA and the cDNA fragments with splicing. The remaining 75% of the class clearly used the sequence analysis data to interpret the RT-PCR data. Namely, they were able to predict the sizes of the fragments to be expected from RT-PCR and they were able to suggest methods for confirming their preliminary data.
Further assessment included a weekly inspection of their laboratory notebooks. Students who kept detailed and accurate notes were more likely to understand the objectives of these exercises. Their notes often included questions about the theoretical aspects of the lab and indicated topics they needed to study further. For example, one question during the sequence analysis was to determine the expected size of the PCR products for their gene. One student made notes reminding herself to review how to determine where a given primer would bind. Likewise, students who simply copied the protocols were not able to provide well-written research articles at the end of the laboratory activity.
| ASSESSMENT OF LABORATORY EFFECTIVENESS |
|---|
|
|
|---|
|
Part B of the pre-/postactivity survey (Figure 5B) consists of 10 short-answer questions that deal with students' knowledge of gene expression, transcription, and the central dogma of genetics. The rubric used to assess the survey is included in Figure 5B. Many of the questions in this section of the survey were discussed during the lecture component of the class. As such, approximately 30% of the students were able to describe accurately the central dogma (question 1), gene expression (question 3), PCR (question 5), transcription factors (question 7), splicing (question 8), and cDNA (question 10) in the preassessment survey. During the postsurvey, 87% of the class gave at least an average description in response to each of those questions. Thirty percent of the students gave an excellent description in response to each of those questions. Prior to the activity, students were not able to name molecular biology or genetics databases (question 2) and were not able to describe nucleic acid quantitation accurately (question 6). Both of these questions were answered by 100% of the students after the activity was completed.
Part C of the survey (Figure 5C) was included only with the postactivity survey. This section consists of five questions that solicit comments and suggestions from the students about ways to improve the activity for future classes. Most students made comments that were extremely useful for improving the laboratory exercise. When asked, "Which activity in the sequence analysis segment best demonstrated how DNA sequence and protein sequence are related?" one student gave the following response.
It was really difficult to figure out which exons of my gene contained the helixloophelix domain. I was forced to go back and forth between SMART and Worm-Base because my answers didn't make sense. Then I realized I had to convert the amino acid numbers that SMART gave me to nucleotide number before I could use WormBase to find the exon. I hated doing that part but I will never forget how to do it.
When asked to describe the least helpful portion of the sequence analysis lab one student wrote,
Using WormBase to determine the sizes of the unspliced and spliced DNA could be done differently. Although the site clearly said spliced and unspliced, I did not connect that to introns and exons and splicing that we talked about in class until it was too late. You should give the students the unspliced sequence and make them pick out the spliced sequence themselves.
Students were asked for ways to improve this laboratory activity. They suggested adding another week to give them more time. The strongest suggestion was that another week be added where students are able to include assays to confirm their RT-PCR data. Another good suggestion from the students was that future classes look for genes that are more obviously related to human development.
| CONCLUSION AND FUTURE PLANS |
|---|
|
|
|---|
Many of our students choose to major in biology because they are interested in medical or dental school. Very few of them have been exposed to basic biomedical research, and many of the students believe that research is very much like the packaged, classroom laboratory exercise. One of the major goals in adapting this laboratory activity was to expose students to the actual research environment. This 2-week activity served as a reasonable "long-term" project that required students to keep accurate and detailed notes of their laboratory activities. Additionally, this was an activity where the answer was not predetermined, and in fact, there was no "correct" answer. This activity also stressed the essentials of experimental design and encouraged students to consider the next logical step in completing the goals of the research project. Students seemed more excited about the laboratory activity after they realized that their results provided preliminary data for an ongoing research project. At the end of the activity, more students (approximately 20% of the class) expressed an interest in summer internships and graduate programs.
Because student awareness of function genomics and bioinformatics did increase, and because more students were interested in information about research careers, this activity will remain a part of our course. The number of students who take this course during one semester varies between 16 and 25. As luck would have it, 24 students were enrolled in this course when this activity was first implemented. Students worked in groups of four to complete the laboratory exercises. Four was chosen as the maximum number of students in a group so that each student could be an active participant. Prior to each lab, students split the protocols so that each member of the group was responsible for a given segment. The best example of this is during the RNA isolation procedure. All four students were given cultures of nematodes to wash and pellet. These pellets were frozen and combined in one container and ground to a fine powder. While two students were working to grind the worms, the other two students were gathering the materials needed for the subsequent RNA isolation, labeling the microcentrifuge tubes, and preparing any of the necessary dilutions. Likewise, during the analysis of the RNA, one pair of students worked to make serial dilutions for spectrophotometric readings while the other pair prepared the agarose gels needed for electrophoresis. While the large number of students in this course was certainly not an ideal situation, it had the advantage of providing two sets of data for each gene sequence assayed. The undergraduate research assistants were given the job of resolving any conflicts between the two sets of data.
Clearly, one advantage of implementing this laboratory activity is that every biology major at Morgan State University will have some experience in laboratory research. It should be noted, however, that the size of the class will drastically affect the amount of time and effort that goes into preparing for the laboratory. We were able to implement this activity for the first time, despite the large class size, because of the efforts of two undergraduate students. These students helped with medium preparation and strain maintenance and optimized the PCR conditions for this activity. These students also prepared aliquots of buffers and other reagents provided in the commercial kit. Finally, the students did follow-up experiments that included repeating the RNA purifications and RT-PCR to confirm the results obtained by the class. Recent funding from the National Science Foundation will provide a laboratory technician to help with these activities in future classes.
Although this laboratory activity was considered a success, it is still a work in progress. There are several modifications planned for future classes. First, the laboratory activity will be extended another 4 weeks, so that it covers the entire first half of the semester. With this modification, students will have more time to isolate and quantitate the total RNA prior to RT-PCR. Second, students will clone their products from RT-PCR into a vector containing T7 and T3 promoter sequences. These vectors allow in vitro transcription using either T7 or T3 polymerase. Third, the students will synthesize double-stranded RNA in vitro and use that RNA for RNA interference assays. With the addition of by these activities, students will actually be able to see if the functions of their gene products are required for normal development. Adding these sections will also allow students to link the molecular aspects of genetics to the more classical aspects of gene transmission.
In this activity, students used sequences predicted to encode HLH transcription factors because these proteins regulate several critical processes in eukaryotic development and because that is the current research focus for our laboratory. Future implementations of this lab will probably use genes that produce more dramatic affects during RNA interference or analogues of human genes that are of interest to the students. This activity can easily be modified to assay the activity of any gene or gene family and to use RNA isolated from other eukaryotic systems. As a whole, it provides a wonderful mechanism for demonstrating the relevance and importance of functional genomics to understanding eukaryotic gene expression.
| APPENDIX |
|---|
|
|
|---|
Oligonucleotide Design for RT-PCR Oligonucleotide primers were ordered from Integrated DNA Technologies (www.idtdna.com) 6 weeks prior to the class activity. Primers corresponded to the termini of the coding region so that the entire gene or cDNA would be amplified in PCR. Primers were typically 15 to 21 nucleotides that were complementary to either strand of DNA and were designed so that their melting temperatures (Tm) were within 4°C of one another. Undergraduate research assistants optimized the PCR amplification conditions 4 weeks prior to the class activity. Primer sequences used for this study and PCR conditions are listed in Table A1.
|
Growth and Maintenance of Caenorhabditis Elegans Cultures The wild-type strain of C. elegans (N2) and the bacterial strain OP-50 (used for feeding C. elegans) can be obtained from the Caenorhabditis Genetics Center (http://biosci.umn.edu/CGC) by submitting a request by E-mail to stier{at}biosci.cbs.umn.edu. This request should be submitted at least 10 weeks ahead of time and should include a one-sentence description of what you will do with the strain. Strains are usually shipped by U.S. mail within 10 days of the request. Educational and nonprofit organizations receive the strains gratis.
Preparation for these laboratory activities will involve growing large cultures of Caenorhabditis elegans for purification of DNA and total RNA. Techniques for preparing the media, bacteria, and nematodes for use have been described previously in this journal (Guziewicz et al., 2002) and by Epstein and Shakes (1995).
Nematode Preparation for DNA Extractions. These nematodes were prepared (minimally 10 days before genomic DNA preparation. Twenty seeded 100 x 15-mm NGM agar plates were inoculated with 10 to 15 adult hermaphrodites 6 days prior to collection. Plates were incubated at 22°C, with the lid facing down, for the 6-day period. During this time, approximately two generations of hermaphrodites were produced. Since each hermaphrodite laid approximately 300 eggs, a total of approximately 18,000,000 animals was produced. At the end of 6 days, the animals were rinsed off the surface of the agar into a 50-ml conical centrifuge tube using sterile water. All 6 plates could be combined into a single tube. The animals were then centrifuged for 3 min at 3000g. The water was siphoned off using a sterile pipette and discarded. The worms were then resuspended in 50 ml of fresh sterile water and centrifuged again. After the water was removed, worm pellets were resuspended in 6 to 8 ml of EN buffer (100 mM NaCl, 10 mM EDTA, pH 7.6). Aliquots of 50 µl were transferred to 1.5-ml microcentrifuge tubes and frozen using a dry iceethanol bath. These frozen stocks could be stored indefinitely until used for DNA isolations.
Nematode Growth for RNA Isolations. Six days prior to the laboratory activity, four to eight seeded 100 x 15-mm NGM agar plates were inoculated with 10 to 20 wild-type hermaphrodites (adult animals) for each laboratory group. Animals were allowed to reproduce for 6 days at 22°C.
Isolation of Genomic DNA For the class of 24 students, six tubes of 50-µl worm aliquots were allowed to thaw on ice. After thawing, 450 µl of worm lysis buffer (0.1 M Tris, pH 8.5, 0.1 M NaCl, 50 mM EDTA, 1% sodium dodecyl sulfate [SDS]) and 20 µl of 20 mg/ml proteinase K (in Tris-EDTA [TE], pH 7.4) was added to each tube. The tubes were vortexed vigorously and then incubated at 62°C for 45 min. The tubes were vortexed at 5-min intervals throughout the incubation period. Clearing of the solution was used as an indication of worm lysis. After the incubation, each tube was first extracted with 500 µl of phenol, followed by a phenol:chloroform extraction and a subsequent chloroform extraction. The DNA was precipitated for 10 min at room temperature by adding 40 µl of 10 M NH4Oac and 750 µl of ethanol. Following centrifugation at 18,000g for 5 min, the DNA pellet was washed with 70% ethanol and centrifuged again. All of the ethanol was removed, and the pellet was air-dried prior to resuspension in 50 µl of TE, pH 7.4. After precipitation, genomic DNA could be stored at 20°C indefinitely.
Total RNA Purification Total RNA was purified using the RNeasy Total RNA Purification Kit (catalog No. 74104) from Qiagen, Inc. (Valencia, CA; www.qiagen.com). Each group of students was given four 100 x 15-mm NGM agar plates containing mixed-staged populations of C. elegans hermaphrodites. The animals were rinsed from the plates with approximately 12 mL of sterile distilled water per plate. The worms were pelleted by centrifugation at 3000g for 15 min and rinsed with an additional 50 ml of sterile water. The animal pellets were quick-frozen on dry ice, ground to a fine powder using a mortar and pestle, and homogenized using a Qiashredder column from Qiagen, Inc. (catalog No. 79654). The total RNA was treated with DNAfree (catalog No. 1906) from Ambion, Inc. (Austin, TX; www.ambion.com) to remove contaminating genomic DNA. The yield of RNA was determined spectrophotometrically, and the quality of the RNA was checked by gel electrophoresis. In the classroom, the purification process took 1.5 h. The most time involved collecting the animals and grinding them for subsequent purification. To streamline this process, instructors might consider collecting and freezing the animals prior to the class activity, but this may decrease the levels of RNA obtained. RNA quantitation and gel electrophoresis were not performed in the same class period as the purification step.
RT-PCR Analysis of Gene Expression RT and PCR were performed in separate tubes. Generally, students used 1.0 µg of total RNA in RT reactions with a 1.0 µM concentration of each oligonucleotide primer. Prior to the addition of buffers and enzyme, students incubated the primers with the RNA at 70°C for 10 min. The RNAprimer mixture was placed immediately on ice. Reaction components were then added and RT was carried out using the OmniScript Reverse Transcriptase Kit (catalog No. 205110; Qiagen, Inc.), following the manufacturer's directions. As a negative control, each group handled a second 1.0-µg aliquot of total RNA in the same manner but did not add the enzyme reverse transcriptase.
PCR was performed using the Taq PCR Core Kit (catalog No. 201223) from Qiagen, Inc. Control tubes as the PCR contained 100 ng of genomic DNA as template. For RT-PCR, one-fourth of the RT reaction (or the RT control) was added to the tube for PCR. PCR was performed in a PerkinElmer thermocycler. The program consisted of an initial denaturation step at 95°C for 5 min, followed by 30 cycles at 94°C for 1 min, 58°C for 1 min, and 72°C for 1 min. The program ended with a final elongation step at 72°C for 10 min.
Students completed the RT and PCR setup in one class period. The reactions were removed from the thermocycler by the instructors and frozen at 20°C until the next class period. At that time, PCR products were analyzed by gel electrophoresis.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
* Corresponding author. E-mail address: cjohnso9{at}morgan.edu.
| REFERENCES |
|---|
|
|
|---|
Boyer Commission On Educating Undergraduates in the Research University (1998). Reinventing Uudergraduate Education: A Blueprint for America's Research Universities. http://naples.cc.sunysb.edu/Pres/boyer.nsf.
Crick, F. (1970). Central dogma of molecular biology.Nature . 227,561 563.[CrossRef][Medline]
Epstein, H.F., and Shakes, C. (1995).Caenorhabditis elegans, Modern Biological Analysis . San Diego: Academic Press.
Gilbert, S.F. (2000). Developmental Biology, 6th ed. Sunderland, MA: Sinauer Associates.
Griffiths, A.J.F. (1999). Introduction to Genetic Analysis. New York: W.H. Freeman.
Guziewicz, M., Vitullo, T., Simmons, B., and Kohn, R.E.
(2002). Analyzing defects in the Caenorhabditis elegans nervous
system using organismal and cell biological approaches. Cell Biol.
Educ. 1,18
25.
Herrick, J.B. (1910). Peculiar elongated and sickle-shaped red blood corpuscles in a case of severe anemia. Arch. Intern. Med. 6,517 521.
Kim, S.K. (2001). Functional genomics: The worm scores a knockout. Curr. Biol. 11,R85 R87.[CrossRef][Medline]
Ledent, V., and Vervoort, M. (2001). The basic
helix-loop-helix protein family: Comparative genomics and phylogenetic
analysis. Genome Res 11,754
770.
Massari, M.E., and Murre, C. (2000). Helix-loop-helix
proteins: Regulators of transcription in eukaryotic organisms. Mol.
Cell. Biol. 20,429
440.
Reinke, V., and White, K.P. (2002). Developmental genomic approaches in model organisms. Annu. Rev. Genomics Hum. Genet. June 6.
Schroeder, W.A. (1981). The background of our knowledge of the mutant hemoglobins. Tex. Rep. Biol. Med. 40,137 42.
Scott, R.B. (1983). Historical review of legislative and national initiatives for sickle cell disease. Am. J. Pediatr. Hematol. Oncol. 5,346 351.[Medline]
van Ommen, G.J. (2002). The Human Genome Project and the future of diagnostics, treatment and prevention. J. Inherit. Metab. Dis. 25,183 188.[CrossRef][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | ARCHIVE | SEARCH | TABLE OF CONTENTS |