ASCB logo LSE Logo

Successful Integration of Data Science in Undergraduate Biostatistics Courses Using Cognitive Load Theory

    Published Online:https://doi.org/10.1187/cbe.19-02-0041

    Biostatistics courses are integral to many undergraduate biology programs. Such courses have often been taught using point-and-click software, but these programs are now seldom used by researchers or professional biologists. Instead, biology professionals typically use programming languages, such as R, which are better suited to analyzing complex data sets. However, teaching biostatistics and programming simultaneously has the potential to overload the students and hinder their learning. We sought to mitigate this overload by using cognitive load theory (CLT) to develop assignments for two biostatistics courses. We evaluated the effectiveness of these assignments by comparing student cohorts who were taught R using these assignments (n = 146) with those who were taught R through example scripts or were instructed on a point-and-click software program (control, n = 181). We surveyed all cohorts and analyzed statistical and programming ability through students’ lab reports or final exams. Students who learned R through our assignments rated their programming ability higher and were more likely to put the usage of R as a skill in their curricula vitae. We also found that the treatment students were more motivated, less frustrated, and less stressed when using R. These results suggest that we can use CLT to teach challenging material.