Letter to the EditorFree Access

Set Theory, Logic, and Probability: The Integration of Qualitative Reasoning into Teaching Statistics for Quantitative Biology

Chenshu Liu
Ruijun Zhu

Department of Pathology and Cell Biology, Columbia University College of Physicians and Surgeons, New York, NY 10032

Integrated Program in Cellular, Molecular and Biomedical Studies, Graduate School of Arts and Sciences, Columbia University Medical Center, Columbia University College of Physicians and Surgeons, New York, NY 10032

Summer Medical and Dental Education Program (SMDEP) and MedPrep, Columbia University College of Physicians and Surgeons, New York, NY 10032

Search for more papers by this author

, and

Ruijun Zhu

Department of Pathology and Cell Biology, Columbia University College of Physicians and Surgeons, New York, NY 10032

Search for more papers by this author

Published Online:13 Oct 2017https://doi.org/10.1187/cbe.16-06-0184

DEAR EDITOR:

As the readers of CBE—Life Sciences Education know, modern biology has come a long way from its beginnings as a qualitative and descriptive science to its current status as a quantitative science, increasingly exploiting mathematical and computational tools to achieve mechanistic understanding of living systems (Howard, 2014; Liu and Mao, 2016). With the exponential increase in the amount of publications involving “quantitative biology” (Figure 1A; Corlan, 2004), it is important to remind ourselves and our students of the central role that qualitative and deductive reasoning continues to play in modern biology. The idea of probability is fundamental to qualitative reasoning and to learning biostatistics at the undergraduate level, as pointed out by Masel and colleagues in a recent article (Masel et al., 2015). We agree with the authors that probability not only provides the foundation for statistics course work but also is pivotal to implementing a logical and scientific way of thinking in the real world. Here we would like to echo the authors’ points and bring to the attention of readers a novel approach we have used in teaching probability in an undergraduate course. Specifically, at the beginning of the course, we have incorporated set theory, Venn diagrams, and basic propositional logic (Klement, 2004; Henle, 2007), which we believe were quite helpful to students in learning challenging concepts like tail probability and hypothesis testing.

The Venn diagram has become increasingly popular for representing data in a way that facilitates reasoning by propositional logic (Figure 1A, inset; Venn, 1888). We used Venn diagrams to visually represent basic operations (conjunction, disjunction, and negation) when first teaching frequentist probability. For example, we asked students to make an inference using data from Liu et al. (2015) regarding a given conditional statement (“if gene A is depleted then there will be errors in mitosis”) to decide which of the following is correct: 1) if gene A is not depleted mitosis will have no errors; 2) error-free mitosis requires the presence of gene A; or 3) if there are errors in mitosis, gene A must be depleted. Students were encouraged to draw Venn diagrams to depict the relationship between “depleting gene A” and “erroneous mitosis” (Figure 1B). Using the Venn diagram, students came to recognize that any event falling in the set of “depleting gene A” must also be within the set of “erroneous mitosis” (i.e., depleting gene A in a cell is sufficient to cause erroneous mitosis), but not vice versa. Meanwhile, any event outside the set of “depleting gene A” could still be within the realm of “erroneous mitosis.” Therefore, inferences 1 and 3 are deductively invalid, while inference 2—the contrapositive of the original conditional statement—is deductively valid and logically equivalent to the original (Figure 1B). With the aid of Venn diagrams, it became easier to understand the mechanisms of common fallacy, such as affirming the consequent (e.g., inference 3 in the example; Hempel, 1966).

We believe that the combination of Venn diagrams and basic propositional logic—in particular, the notion that a conditional statement is logically equivalent to its contrapositive—lays a foundation for introducing more complex topics of tail probability and hypothesis testing (Figure 1, C and D). Valid deductions can be performed based purely on the structure of propositions. If a certain observation (e.g., sample mean) is under an assumed distribution, then the probability of making this observation (or more extreme observations) should be fairly big; this “tail probability” is empirically considered “big” when greater than 5%. If, however, this probability is fairly small (lower than 5%), then it can be reliably inferred that the sample is under a different distribution (Figure 1C). A natural next step from the concept of tail probability is hypothesis testing, in particular why the test conclusion has to go with “unable to reject the null hypothesis” (i.e., the assumed distribution) rather than “accept the null hypothesis,” given a p value greater than 5%. Otherwise, it would be a fallacy of affirming the consequent. Finally, propositional logic can also be used to understand how one-sided versus two-sided hypothesis tests differ in their stringencies (Figure 1D).

We believe that engaging students in qualitative reasoning through the use of set theory and Venn diagrams (e.g., visualization of tail probability and “rejection region”) and the use of propositional logic (e.g., law of contraposition) holds unique potential to support students in learning basic statistics with quantitative data. We hope Masel and colleagues will continue to study how to effectively support students in qualitative reasoning that promotes their statistical understanding. Perhaps they or others will measure how informal experiences such as ours could contribute to developing the quantitative biologists of the future.

ACKNOWLEDGMENTS

This work was supported by a fund from the Robert Wood Johnson Foundation to C.L. in conjunction with the Statistics for Quantitative Biology summer course at Columbia University Medical Center from 2014 to 2015. The authors thank D. Mowshowitz (Columbia University) for insight on teaching.

REFERENCES

Corlan AD (2004). Medline trend: automated yearly statistics of PubMed results for any query In: http://dan.corlan.net/medline-trend.html (accessed 4 June 2016). Google Scholar

Hempel CG (1966). Philosophy of Natural Science, vol. 18, Englewood Cliffs, NJ: Prentice-Hall. Google Scholar

Henle JM (2007). An Outline of Set Theory, New York: Dover. Google Scholar

Howard J (2014). Quantitative cell biology: the essential role of theory. Mol Biol Cell 25, 3438-3440. Medline, Google Scholar

Klement KC (2004, Ed. J FieserB Dowden, Propositional logic In: Internet Encyclopedia of Philosophy, www.iep.utm.edu/prop-log (accessed 8 June 2016). Google Scholar

Liu C, Chuang J-Z, Sung C-H, Mao Y (2015). A dynein independent role of Tctex-1 at the kinetochore. Cell Cycle 14, 1379-1388. Medline, Google Scholar

Liu C, Mao Y (2016). Diaphanous formin mDia2 regulates CENP-A levels at centromeres. J Cell Biol 213, 415-424. Medline, Google Scholar

Masel J, Humphrey PT, Blackburn B, Levine JA (2015). Evidence-based medicine as a tool for undergraduate probability and statistics education. CBE Life Sci Educ 14, ar42. Link, Google Scholar

Venn J (1888). The Logic of Chance: An Essay on the Foundations and Province of the Theory of Probability, with Especial Reference to Its Logical Bearings and Its Application to Moral and Social Science, and to Statistics, 3d ed., rewritten and enlarged, London: Macmillan. Google Scholar

Vol. 15, No. 4

December 01, 2016

Metrics

Downloads & Citations

Downloads: 169

History

Information

© 2016 C. Liu and R. Zhu. CBE—Life Sciences Education © 2016 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

PDF download