The NCTE transcripts: a dataset of elementary math classroom transcripts


Journal article


Dorottya Demszky, Heather Hill
arXiv preprint arXiv:2211.11772, arXiv, 2023 May


Link <<
Cite

Cite

APA   Click to copy
Demszky, D., & Hill, H. (2023). The NCTE transcripts: a dataset of elementary math classroom transcripts. ArXiv Preprint ArXiv:2211.11772. https://doi.org/10.48550/arXiv.2211.11772


Chicago/Turabian   Click to copy
Demszky, Dorottya, and Heather Hill. “The NCTE Transcripts: a Dataset of Elementary Math Classroom Transcripts.” arXiv preprint arXiv:2211.11772 (May 2023).


MLA   Click to copy
Demszky, Dorottya, and Heather Hill. “The NCTE Transcripts: a Dataset of Elementary Math Classroom Transcripts.” ArXiv Preprint ArXiv:2211.11772, arXiv, May 2023, doi:10.48550/arXiv.2211.11772.


BibTeX   Click to copy

@article{demszky2023a,
  title = {The NCTE transcripts: a dataset of elementary math classroom transcripts},
  year = {2023},
  month = may,
  journal = {arXiv preprint arXiv:2211.11772},
  publisher = {arXiv},
  doi = {10.48550/arXiv.2211.11772},
  author = {Demszky, Dorottya and Hill, Heather},
  howpublished = {},
  month_numeric = {5}
}

Abstract

Classroom discourse is a core medium of instruction - analyzing it can provide a window into teaching and learning as well as driving the development of new tools for improving instruction. We introduce the largest dataset of mathematics classroom transcripts available to researchers, and demonstrate how this data can help improve instruction. The dataset consists of 1,660 45-60 minute long 4th and 5th grade elementary mathematics observations collected by the National Center for Teacher Effectiveness (NCTE) between 2010-2013. The anonymized transcripts represent data from 317 teachers across 4 school districts that serve largely historically marginalized students. The transcripts come with rich metadata, including turn-level annotations for dialogic discourse moves, classroom observation scores, demographic information, survey responses and student test scores. We demonstrate that our natural language processing model, trained on our turn-level annotations, can learn to identify dialogic discourse moves and these moves are correlated with better classroom observation scores and learning outcomes. This dataset opens up several possibilities for researchers, educators and policymakers to learn about and improve K-12 instruction. 

Share



Follow this website


You need to create an Owlstown account to follow this website.


Sign up

Already an Owlstown member?

Log in