Professor Heather Haveman and doctoral candidate Jaren Haber are analyzing about 70,000 research articles gathered from JSTOR, the leading online repository of journal articles for the social sciences. We are developing a flexible and reproducible method to review academic literature that takes advantage of massive online collections containing nearly all articles published in academic journals. The goal is to harness computers to review the entire corpus of published literature, by charting engagement with specific theories or topics over time and across subfields. Specifically, we are developing a method to construct and validate dictionaries, lists of concepts (unigrams, bigrams, and trigrams) related to a specific theory or topic. We are looking for research apprentices run models to predict hand-coded scores on a subsample of documents using N-grams and various word embedding models, and to visualize and convey the results.

Spring 2020
Social Sciences