How do first-year writing students cite research?
A large-scale citation analysis
Research-based writing is a foundational component of freshman and advanced writing programs across the nation. Regardless of scope or prompt, research-based writing springboards students into conversations about information literacy, academic integrity, and source evaluation. Just as freshman composition serves as the near-universal introduction to college-level writing and analysis, the research-based writing assignment introduces students to the concepts of academic research and scholarly conversation. My presentation will showcase citation data analyzed from over 2000 freshman composition citations in order to help define weaknesses in freshman information literacy abilities and source discrimination techniques.
literature review + research questions
Many researchers have already begun to probe student citations for what they can uncover about student research and writing patterns. The citation project, for example, collects and analyzes student citations in order to discuss plagiarism in first-year writing (Moore; Sandra; Serviss). These projects dig into student writing and examine the context around a student’s in-text citations. Citation analysis has also been used to evaluate the effectiveness of one-shot library instruction (Mohler; Silfen and Zgoda; Ursin and Lindsay; Howard), the impact of course-mandated citation guidelines (Carlson; Davis; Robinson and Schlegl), and the frequency of different source-types in bibliographies (Cooke and Rosenthal; Gadd, Baldwin and Norris; Jenkins; Krause; Leiding; Mill). Previous studies have used rubrics that measure sources on scales such as “outstanding,” “acceptable,” or “unacceptable” (Lantz). These rubrics assign value to sources without categorizing what types of sources are appearing with relative frequency. While those citation analysis studies help to improve research-based writing by examining student writing bordering in-text citations, so far very few of these studies attempt to define the shape of student citation patterns in the works cited page itself–specifically within the first-year writing classroom. This presentation will do just that by attempting to answer the following questions:
To uncover the shape of research in this corpus, we conducted a citation analysis on 200 papers and 2048 citations from students enrolled in a 2019 semester of first-year writing. Because we had many questions, we broke data analysis into 4 passes. The purpose and methodology for each pass is outlined below:
Pass 1: Organize and clean data
Pass 2: Perform citation analysis
Pass 3: Assess accuracy of sources (superficial “correctness”)
Pass 4: Label publisher for each source
In this pass, all citations were moved from de-identified papers stored as .pdf files to a single spreadsheet. Each citation was given a unique identifier in combination with its parent research paper and source number within that paper. In this process, 4 papers had no bibliography or 0 sources and were removed from subsequent passes. The year of each citation was also collected in this pass.
results + initial discussion
What types of sources are students citing?
This graph on the right shows that students are overwhelmingly citing peer-reviewed, academic sources. Their next go-to source types are popular and self-published sources.
This is just another way to look at what types of sources students are using. Maybe it’s easier for you contextualize this data when it is seen as a percentage of a whole instead of just as a bar.
How accurately are students citing sources?
While these categories are rather broad, they do help inform us that students are generally able to mimic the forms of MLA convention.
How old are student sources?
Not old. In fact, very, very young. I broke this data down into discrete sections to see how old popular sources were, how old books were, how old academic articles were, but there is not much difference.
How many sources are students using?
Students are using ~10 sources per paper on average. I’m pretty sure that the template assignment page that the new GSIs get every August says 8-10, so that is about as expected. But now we know.
What sources are students citing the most?
The most popular sources are almost all popular news outlets. In fact, the first non-popular source that appears “PLoS ONE” is a database similar to EEBSCO rather than an individual magazine or publisher.
There is a huge spread. The most popular source is only 1.4% of the total citations. Of the 1037 citations coded in pass 4, there 837 sources used only once.
For a full list of the most popular sources, click the tab below labeled “Open full list.”