Publications
-
The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces
- ArXiv preprint, 2023
-
The semantic scholar open data platform
- ArXiv preprint, 2023
-
MSˆ2: A Dataset for Multi-Document Summarization of Medical Studies
- EMNLP, 2021
-
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
- NAACL, Human Language Technologies, 2021
-
Improving the accessibility of scientific documents: Current state, user needs, and a system solution to enhance scientific PDF accessibility for blind and low vision users
- ArXiv, 2021
-
SciA11y: Converting Scientific Papers to Accessible HTML
- ASSETS 2021 Posters and Demonstrations, Artifact Award 1st Place
-
MedICaT: A Dataset of Medical Images, Captions, and Textual References
- EMNLP, 2020
-
SCIREX: A Challenge Dataset for Document-Level Information Extraction
- ACL, 2020
-
Fact or Fiction: Verifying Scientific Claims
- EMNLP, 2020
-
Quantifying Sex Bias in Clinical Studies at Scale With Automated Data Extraction
- JAMA Network Open, 2019
-
Structural Scaffolds for Citation Intent Classification in Scientific Publications
- NAACL, Human Language Technologies, 2019
-
Construction of the Literature Graph in Semantic Scholar
- NAACL, Human Language Technologies, 2018
-
A Dataset of Peer Reviews (PeerReaD): Collection, Insights and NLP Applications
- NAACL, Human Language Technologies, 2018
-
Apoptosis-related Genes Control Autophagy and Influence DENV-2 Infection in the Mosquito Vector, Aedes Aegypti
- Insect Biochemistry & Molecular Biology, September 2016