|
|
|||||||
Letters to the Editor |

*Department of Geology and Center for Teaching and Learning, Idaho State University, Pocatello, ID 83209; and
Department of Physics, United States Air Force Academy, CO 80840
Reliability is a fundamental quality of the internal consistence of any measuring instrument. Researchers sometimes use "reliable" and its derivative terms, yet fail to address reliability. Bowers et al. (2005) claimed that the knowledge survey (KS) "does not reliably measure student learning as measured by final grades or exam questions." They addressed their purpose (p. 311) "to evaluate how closely students' performance track with their confidence in their knowledge of the course material," through correlating "plotted pre- and post-KS scores against final grades (p. 314)." This approach assumes that tests/grades of unknown reliability are appropriate standards for judging other measures. Their article offers a case study in drawing conclusions without considering reliability.
The split-halves Spearman-Brown reliability (R) measure (Jacobs and Chase, 1992) derives from the r (r) obtained from individuals' scores on two halves of a single test. It is a routine method for quantifying reliability and is applicable to both tests and knowledge surveys. When Bowers et al. attributed specific claims to us: "They report that KS results represent changes in students' learning (p. 311)," and "... are a good representation of student knowledge (p. 316)," they omitted mention of the surprising reliability that characterizes knowledge surveys' pre- and postcourse measures (Figure 1).
|
|
|
Unrealistic expectations for high numerical correlation coefficients between grades and knowledge surveys persist until one understands limits imposed by reliability. Thereafter, understanding permits better interpretations. The positive numerical correlations Bowers et al. (2005) reported as "low" between their postcourse knowledge surveys and grades then seem surprisingly high, given limits imposed by reliability of tests and grades.
We emphasized assessment of learning in classes through use of aggregate data (Nuhfer and Knipp, 2003), which differs from Bowers et al.'s evaluative efforts to predict an individual's grades from her/his knowledge surveys. At the class level, we were most impressed by Bowers et al.'s Figure 1. It revealed the pattern change from no correlation between grades and the precourse KS to a persistent positive correlation between grades and the postcourse KS. Their tools (tests and the knowledge survey) remained constant through their pre- and postcorrelations, so the profound pattern change seen in every class seems most simply explained by students' increased understanding of specific content. We suggest correlating the high-reliability pre- and postcourse KS measures as an additional change indicator.
Although Bowers et al.'s article derived from our work, neither the authors from Nuhfer's own Idaho State University campus nor the Journal's editors engaged us in review. The result is an article including several attributions to knowledge surveys and our thoughts/intentions that we disclaim. Readers who compare our ideas about knowledge surveys as presented by Bowers et al. with those published in our words (Nuhfer and Knipp, 2003; Theall et al., 2005; Wirth et al. 2005) should anticipate discrepancies.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
Jacobs, L. C., and Chase, C. I. (1992). Developing and Using Tests Effectively: A Guide for Faculty, San Francisco: Jossey-Bass.
Nuhfer, E. B., and Knipp, D. (2003). The knowledge survey: a tool for all reasons. To Improve the Academy. 21, 5078. http://www.isu.edu/ctl/facultydev/resources1.html (accessed 16 October 2006).
Theall, M., Abrami, P. C., Arreola, R., Franklin, J., Nuhfer, E., and Scriven, M. (2005). Valid Faculty Evaluation Data: Are There Any? AERA Annual Meetings Program Interactive Panel Presentation, American Educational Research Association Symposium, Montreal, 4 14, 240. Summaries available at http://www.cedanet.com/meta/AERA2005valid.pdf and http://www.isu.edu/ctl/facultydev/extras/MeaningEvalsfract_files/MeaningEvalsfract.htm (accessed 16 October 2006).
Wirth, K. R., Perkins, D., and Nuhfer, E. B. (2005). Knowledge surveys: a tool for assessing learning, courses, and programs. Geological Society of America Annual Meetings Program with Abstracts. 37(7), 119. http://gsa.confex.com/gsa/2005AM/finalprogram/abstract_97119.htm (accessed 16 October 2006).
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | ARCHIVE | SEARCH | TABLE OF CONTENTS |