I’m an Assistant Professor of Information Science in the Luddy School of Informatics, Computing, and Engineering at Indiana University Bloomington. I’m also an Adjunct Assistant Professor of Linguistics in the College of Arts & Sciences. Before coming to Indiana, I was a Neukom Fellow at the Neukom Institute for Computational Science and the Leslie Center for the Humanities at Dartmouth College.
My research explores the uses of statistics and machine learning in the study of very large text collections (of novels, academic journals, book reviews, newspapers, etc.). Much of my recent work falls under the heading of sociology of literature.
- 2022-06-22 New paper at JCDL 2022: “Reliable editions from unreliable components: estimating ebooks from print editions using profile hidden markov models” https://dl.acm.org/doi/10.1145/3529372.3533292. An extended version is available on arXiv, https://arxiv.org/abs/2204.01638.
- 2022-01-10 Open access version of Humanities Data Analysis: Case Studies with Python now available at https://www.humanitiesdataanalysis.org/.
- 2021-04-19 New paper with Haining Wang and Patrick Juola at EACL 2021: “Mode Effects’ Challenge to Authorship Attribution.” https://aclanthology.org/2021.eacl-main.97.pdf
- 2021-03-17 New paper with Michael Betancourt: “Reassembling the English Novel, 1789-1919.” Cultural Analytics, 2021. https://doi.org/10.22148/001c.19102.
- 2021-01-10 Humanities Data Analysis: Case Studies with Python available. Open access version will be released in January 2022 at https://www.humanitiesdataanalysis.org/.
Luddy Hall 2124
700 N. Woodlawn Avenue
Bloomington, IN 47408
ORCID ID: 0000-0002-4967-0879