SciSight: Combining faceted navigation and research group detection for COVID-19 exploratory scientific search

Tom Hope, Jason Portenoy*, Kishore Vasan*, Jonathan Borchardt*, Eric Horvitz, Daniel S. Weld, Marti A. Hearst, and Jevin D. West
preprint  2020

Tl;DR: SciSight is a novel framework for exploratory search of COVID-19 research that integrates two key capabilities: first, exploring interactions between biomedical facets (e.g., proteins, genes, drugs, diseases, patient characteristics); and second, discovering groups of researchers and how they are co... nnected.

Gender trends in CS authorship

Lucy Lu Wang, Gabriel Stanovsky, Luca Weihs, and Oren Etzioni
CACM  2020

Tl;DR: An analysis of 2.87 million computer science papers reveals that, if current trends continue, parity between the number of male and female authors will not be reached in this century. With optimistic projection models, gender parity is forecast to be reached by 2100 in CS, but projected to be reache... d within two to three decades in the biomedical literature.

China catching up to US in AI research

Field Cady and Oren Etzioni
blog  2019

Tl;DR: We analyzed over two million academic papers, and found that China has already surpassed the US in published AI papers. If current trends continue, China is poised to overtake the US in the most-cited 50% of papers this year, in the most-cited 10% of papers next year, and in the 1% of most-cited pap... ers by 2025.

Quantifying Sex Bias in Clinical Studies at Scale With Automated Data Extraction

Sergey Feldman, Waleed Ammar, Kyle Lo, Elly Trepman, Madeleine van Zuylen, and Oren Etzioni
JAMA  2019

Tl;DR: We extracted counts of women and men from over 40k published clinical trial articles and found substantial underrepresentation of female participants in 7 of 11 disease categories, especially HIV/AIDS, chronic kidney diseases, and cardiovascular diseases.

Does ArXiv help increase citation counts?

Sergey Feldman, Kyle Lo, and Waleed Ammar
preprint  2018

Tl;DR: We explore the degree to which papers prepublished on arXiv garner more citations, in an attempt to paint a sharper picture of fairness issues related to prepublishing. We observe that papers submitted to arXiv before acceptance have, on average, 65% more citations in the following year compared to... papers submitted after, even after accounting for variables such as venue and author influentialness.

A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications

Dongyeop Kang, Waleed Ammar, Bhavana Dalvi, Madeleine van Zuylen, Sebastian Kohlmeier, Eduard Hovy, and Roy Schwartz
NAACL  2018

Tl;DR: We present the first public dataset of scientific peer reviews available for research purposes, containing 14.7K paper drafts and the corresponding accept/reject decisions in top-tier venues.