About

Hi! I'm Jake, a PhD student in the Department of Economics at Harvard University. I graduated from Yale College in 2020, where I pursued a double major in Statistics and Data Science (S&DS) and Ethics, Politics, and Economics (EP&E).

I'm interested in topics at the intersections of economics, computer science, and statistics.

Curriculum Vitae Google Scholar Semantic Scholar

Publications

Efficient OCR for Building a Diverse Digital History

Carlson, Jacob, Tom Bryan, and Melissa Dell. "Efficient OCR for Building a Diverse Digital History." ACL (2024). Oral presentation.

Paper Codebase

American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers

Dell, Melissa, Jacob Carlson, Tom Bryan, Emily Silcock, Abhishek Arora, Zejiang Shen, Luca D'Amico-Wong, Quan Le, Pablo Querubin, Leander Heldring. "American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers." NeurIPS D&B (2023).

Paper Dataset

EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge

Bryan, Tom, Jacob Carlson, Abhishek Arora, Melissa Dell. "EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge." EMNLP SD (2023).

Paper Python Package

Dyadic Clustering in International Relations

Carlson, Jacob, Trevor Incerti, and P. M. Aronow. "Dyadic Clustering in International Relations." Political Analysis (2024).

Paper Replication R Package Stata Command

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

Shen, Zejiang, Ruochen Zhang, Melissa Dell, Benjamin Lee, Jacob Carlson, and Weining Li. "LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis." ICDAR (2021). Oral presentation.

Paper Website

Working Papers

A Unifying Framework for Robust and Efficient Inference with Unstructured Data

Carlson, Jacob and Melissa Dell. "A Unifying Framework for Robust and Efficient Inference with Unstructured Data." (2025).

Paper

True and Pseudo-True Parameters

Andrews, Isaiah, Harvey Barnhard, and Jacob Carlson. "True and Pseudo-True Parameters." (2024).

Paper

Contact

Email: See Curriculum Vitae

Twitter/X: @J_S_Carlson