Skip to main content
Cornell university
Cornell Statistics and Data Science Cornell Statistics and Data Science
  • About Us

    About Us
    Cornell's Department of Statistics and Data Science offers four programs at the undergraduate and graduate levels. Undergraduates can earn a BA in statistical science, social...

    Welcome to the Department of Statistics and Data Science
    History
    Facilities
    Statistics Graduate Society
    Recently Published Papers
  • Academics

    Academics

    Undergraduate
    PhD
    MPS
    PhD Minor in Data Science
    Courses & Course Enrollment
  • People

    People

    Faculty
    Field Faculty
    PhDs
    Emeritus Faculty
    Academic Staff
    Staff
    Research Areas of Expertise
    Statistical Consultants
  • News and Events

    News and Events

    Events
    News
  • Resources

    Resources

    Professional Societies and Meetings
    Affiliated Groups
    Career Services
    Cornell Statistical Consulting Unit
  • Alumni

    Alumni
    Cornell's Statistics and Data Science degrees prepare students for a wide variety of careers, from academia to industry.  See the After Graduation page for a general overview of...

    Alumni Profiles

Search form

You are here

  1. Home

Researchers prefer same-gender co-authors, study confirms

A black and white graphic showing silhouettes of men and women with the genders separated by a dotted line
Tuesday, August 15, 2023

Researchers are more likely to pen scientific papers with co-authors of the same gender, a pattern that cannot be simply explained by the varying gender representation across scientific disciplines and time, according to joint research from Cornell and the University of Washington.

Mining a digital corpus of 560,000 published research articles over a 50-year period, the research team observed consistent gender homophily – the tendency of authors to collaborate with others who share their gender. While this observation by itself is not new, the researchers went further, using novel methods to rule out seemingly logical explanations – like a field’s gender balance, say, or a field’s authorship norms for writing research papers.

The team’s findings suggest a behavioral component is in play when scientists seek out collaborators.

“One might posit that the gender homophily observed when considering a broad scientific landscape may be due to varying levels of gender representation,” said Y. Samuel Wang, assistant professor of statistics and data science and author of “Gender-Based Homophily in Collaborations Across a Heterogeneous Scholarly Landscape,” which was published in PLOS One in April. “However, our findings suggest that something beyond that is going on.”

Comprising scholars in statistics, information science, biology and philosophy, the team mined a massive corpus of articles published between 1960 and 2011 from the online repository JSTOR. To help link genders to more than 800,000 author names, the team relied on social security records and crowdsourced data. Because of the limitations in the data set, this research was limited to men and women authors and didn’t factor in nonbinary identities, researchers said.

From there, the team grouped authors from the same fields and eras, creating 50,000 hypothetical reconfigurations of authors.

“We re-simulated hypothetical datasets. Our thinking was: How different is what we actually observed versus these hypothetical scenarios that we constructed?” said Wang, also an assistant professor of social statistics in the ILR School. “Very different, it turns out. This suggests that some other source of homophily is occurring in the data we observed.”

As to why researchers tend to collaborate with those of the same gender, Wang and his team can’t say definitively. Data science methods can’t measure intent – why collaborators choose who they choose, but the team’s findings suggest that consideration of gender may be a factor, Wang said.

Wang’s University of Washington co-authors are: Carole J. Lee, associate professor of philosophy; Jevin D. West, associate professor in the Information School; Carl T. Bergstrom, professor of biology, and Elena A. Erosheva, professor of statistics and social work.

This research was supported by the National Science Foundation and the University of Washington’s Royalty Research Fund.

Louis DiPietro is a writer for the Cornell Ann S. Bowers College of Computing and Information Science.

News Categories

  • News
  • Student Profiles
  • Home
  • About Us
  • Contact Us
  • Careers
© Cornell University Department of Statistics and Data Science

1198 Comstock Hall, 129 Garden Ave., Ithaca, NY 14853

Social Menu

  • Facebook
  • Twitter
  • YouTube
Cornell Bowers CIS College of Computing and Information Science Cornell CALS ILR School

If you have a disability and are having trouble accessing information on this website or need materials in an alternate format, contact web-accessibility@cornell.edu for assistance.