Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Inferring Gender from First Names: Comparing the Accuracy of Genderize, Gender API, and the gender R Package on Authors of Diverse Nationality

View ORCID ProfileAlexander D. VanHelene, Ishaani Khatri, View ORCID ProfileC. Beau Hilton, View ORCID ProfileSanjay Mishra, Ece D. Gamsiz Uzun, View ORCID ProfileJeremy L. Warner
doi: https://doi.org/10.1101/2024.01.30.24302027
Alexander D. VanHelene
1Lifespan Cancer Institute, Rhode Island Hospital, Providence, Rhode Island
2Center for Clinical Cancer Informatics and Data Science, Legorreta Cancer Center, Brown University, Providence, Rhode Island
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexander D. VanHelene
Ishaani Khatri
3Warren Alpert Medical School, Brown University, Providence, Rhode Island
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
C. Beau Hilton
4Department of Internal Medicine, Vanderbilt University, Nashville, Tennessee
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for C. Beau Hilton
Sanjay Mishra
1Lifespan Cancer Institute, Rhode Island Hospital, Providence, Rhode Island
2Center for Clinical Cancer Informatics and Data Science, Legorreta Cancer Center, Brown University, Providence, Rhode Island
3Warren Alpert Medical School, Brown University, Providence, Rhode Island
MS, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sanjay Mishra
Ece D. Gamsiz Uzun
2Center for Clinical Cancer Informatics and Data Science, Legorreta Cancer Center, Brown University, Providence, Rhode Island
3Warren Alpert Medical School, Brown University, Providence, Rhode Island
5Center for Computational Molecular Biology, Brown University, Providence, Rhode Island
6Department of Pathology and Laboratory Medicine, Brown University, Providence, Rhode Island
MS, PhD, FAMIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeremy L. Warner
1Lifespan Cancer Institute, Rhode Island Hospital, Providence, Rhode Island
2Center for Clinical Cancer Informatics and Data Science, Legorreta Cancer Center, Brown University, Providence, Rhode Island
3Warren Alpert Medical School, Brown University, Providence, Rhode Island
MD, MS, FAMIA, FASCO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeremy L. Warner
  • For correspondence: jeremy_warner@brown.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Meta-researchers commonly leverage tools that infer gender from first names, especially when studying gender disparities. However, tools vary in their accuracy, ease of use, and cost. The objective of this study was to compare the accuracy and cost of the commercial software Genderize and Gender API, and the open-source gender R package. Differences in binary gender prediction accuracy between the three services were evaluated. Gender prediction accuracy was tested on a multi-national dataset of 32,968 gender-labeled clinical trial authors. Additionally, two datasets from previous studies with 5779 and 6131 names, respectively, were re-evaluated with modern implementations of Genderize and Gender API. The gender inference accuracy of Genderize and Gender API were compared, both with and without supplying trialists’ country of origin in the API call. The accuracy of the gender R package was only evaluated without supplying countries of origin since. The accuracy of Genderize, Gender API, and the gender R package were defined as the percentage of correct gender predictions. Accuracy differences between methods were evaluated using McNemar’s test. Genderize and Gender API demonstrated overall 96.6% and 96.1% accuracy, respectively, when countries of origin were not supplied in the API calls. Genderize and Gender API achieved the highest accuracy when predicting the gender of German authors with accuracies greater than 98%. Genderize and Gender API were least accurate with South Korean, Chinese, Singaporean, and Taiwanese authors, demonstrating below 82% accuracy. The gender R package achieved below 86% accuracy on the full dataset. In the replication studies, Genderize and gender API demonstrated better performance than in the original publications. Our results indicate that Genderize and Gender API are highly accurate, except when evaluating South Korean, Chinese, Singaporean, and Taiwanese names. We also demonstrated that Genderize can provide similar accuracy to Gender API while being 4.85x less expensive.

Author Summary Gender disparities in academia have prompted researchers to investigate gender gaps in professorship roles and publication authorship. Of particular concern are the gender gaps in cancer clinical trial authorship. Methodologies that evaluate gender disparities in academia often rely on tools that infer gender from first names. Tools that predict gender from first names are often used in methodologies that determine the gender ratios of academic departments or publishing authors in a discipline. However, researchers must choose between different gender predicting tools that vary in their accuracy, ease of use, and cost. We evaluated the binary gender prediction accuracy of Genderize, Gender API, and the gender R package on a gold-standard dataset of 32,968 clinical trialists from around the world. Genderize and Gender API cost money to use, while the gender R package is free and open source. We found that Genderize and Gender API were more accurate than the gender R package. In addition, Genderize is cheaper than Gender API, but is more sensitive to inconsistencies in name formatting and the presence of diacritical marks. Both Genderize and Gender API were most accurate with western names.

Introduction

One of the most well-documented disparities in STEM is gender disparity [1,2]. This issue is especially notable in the cancer clinical trial domain, with underrepresentation of women in the leadership of pivotal trials documented as recently as within the last decade [3]. The study of gender disparity in scientific authorship often requires the determination of gender from very limited data, e.g., author forenames. Software [4–9] that infers gender from forenames could potentially enable researchers to automate gender prediction in large datasets. Commercial gender prediction services [10,11] such as Genderize and Gender API programmatically predict gender from first names. The gender R package [12] is an open-source alternative to these proprietary gender prediction tools.

Gender prediction software has demonstrated high accuracy when evaluating Western first names, but often falters when evaluating names from Asian cultures [13]. Further, the presence of diacritical marks and hyphens reportedly affects the accuracy of gender prediction in some tools [14]. Few studies [15] to date have evaluated differences in accuracy in gender predicting software between Western and non-Western names. To our knowledge, no studies have evaluated how different ways of delimiting two-part first names e.g. Jean-Pierre vs Jean Pierre vs Jeanpierre, affect gender prediction accuracy.

We compared the gender prediction accuracy of Genderize, Gender API, and the gender R package using a large manually curated registry of cancer clinical trialists with labeled genders and diverse nationalities. In addition, we quantified the accuracy of these tools by author nationality and compared different strategies for delimiting two-part forenames, which are common in the English language spelling of Korean, Chinese, Singaporean, and Taiwanese names.

Materials and Methods

Three gender prediction tools: 1) Genderize; 2) Gender API; and 3) the gender R package, were tested on a gold-standard registry of cancer clinical trialists with manually determined binary gender. Trialists’ names and affiliations were sourced from the HemOnc knowledge base, [16] a continually growing resource created to capture the standard-of-care treatments in the fields of hematology and oncology. The binary gender classifications used in our study refer to socially constructed gender categories, not biological sex [17,18]. Names in HemOnc are primarily sourced from the MEDLINE records of published clinical trials and undergo extensive normalization to account for the presence of diacritics, middle initials, misspellings, multipart last names represented as middle names, and other variations. When first names are not available through MEDLINE, the original manuscripts are examined for this information. Binary gender is determined by a combination of automated mappings of typically masculine or feminine forenames (e.g., John; Rebecca), web searches of publicly available information such as biographies on academic web pages, and consensus determinations including consultation with native speakers. If gender cannot be determined after these efforts, the author is labeled as “unknown gender”. A subset of journals does not provide forenames; in these cases, the gender is labeled as “could not be determined.” Country affiliations sourced from MEDLINE also undergo extensive normalization.

Gender prediction accuracy was defined as the percent of individuals whose gender was correctly predicted, as compared to the gold standard dataset. The percent of incorrect gender predictions and the percent of names with no predicted gender were also calculated. For binary statistical tests, gender predictions were categorized as successes or failures – correct gender predictions were defined as successes, while names with incorrect or absent predictions were failures.

All trialists with a gender determination were evaluated with Genderize and Gender API on 2023-11-21 using the R package httr (version 1.4.7). Both US Social Security Administration (SSA) and US Census Integrated Public Use Microdata Series (IPUMS) name datasets were used as a reference when predicting names with the gender R package [12] (version 0.6.0).

Genderize and Gender API were used to predict names with and without supplying a country of origin for the subset of authors with a singular country of affiliation. The Gender R package was only tested without supplying country names because the SSA and IPUMS methods do not provide that functionality. Two-part names were concatenated without any delimiter e.g. Jean-Pierre was converted to jeanpierre. Middle names were removed, unless an author had a first initial/middle name, in which case their middle name was used. Gender bias in name prediction was descriptively evaluated by calculating the percent of names that were misgendered, compared to the gold standard labeled dataset. In an additional analysis, accuracy differences resulting from delimiting two-part first names with different characters were evaluated. Two-part first name prediction accuracy was also evaluated using the first half of two-part names only. For example, the name Jean-Pierre was tested four ways: 1) jean-pierre; 2) jean pierre; 3) jeanpierre; and 4) jean.

In addition to predicting the gender of a first name, Genderize and Gender API also report an estimated probability that a gender prediction is correct. We evaluated the correlation between these API-reported probability estimates and the gold standard labeled dataset with linear regressions and Brier scores. Names with a reported probability less than or equal to 50% were excluded from the regression and Brier scores.

The gender prediction accuracies of Genderize and Gender API were also separately evaluated using publicly available datasets from two studies [15,19] that tested gender prediction in 2018 and 2021, respectively. The dataset [20] provided by Santamaria 2018 consisted of 5,779 names sourced from various other datasets. The dataset [21] sourced from Sebo 2021 consisted of 6,131 Swiss physicians. The names from these public datasets were not modified prior to our evaluation on 2023-11-07. Nor were nationalities supplied to Genderize and Gender API when evaluating these public datasets, following the original experimental design.

All software accuracy comparisons were computed in R version 4.3.1. Differences in accuracy between methods were evaluated using the default R stats package implementation of McNemar’s test [22]. Data analysis was facilitated with tidyverse [23] (version 2.0.0), haven [24] (version 2.5.3), readxl [25] (version 1.4.3), testthat [26] (version 3.1.10), ggpmisc [27] (version 0.5.5), and patchwork [28] (version 1.1.3) R libraries.

Results

Out of 40,273 unique clinical trialists present in the HemOnc KB as of 2023-11-21, 37,420 (92.9%) had a resolvable first name and were thus eligible for gender determination. This group was sourced from 7,473 clinical trial manuscripts published between 1947-2023. After excluding trialists with gender not yet determined (n=4,360, 11.7%), those with a determined unknown gender (n=78, 0.2%), and those with a determined gender but initial-only first names (n=14, <0.1%), the final analysis set included 32,968 trialists with predetermined binary gender. Of the 32,968 trialists, 11,398 (34.6%) were designated as women. There were 7849 unique names after normalizing first initial/middle name combinations to only include a middle name. The remainder of names were shared by more than one individual. Michael was the most common name, with 473 (1.4%) occurrences. Only 1,899 (24.2%) of names occurred more than twice.

Of 25,240 trialists with a known site affiliation, 24,930 (98.8%) were affiliated with sites in a single country and were assigned to the country of their affiliated institution when querying Genderize and Gender API with nationalities. When excluding clinical trialists without a recorded country of origin, the number of trialists and unique names was 24,930 and 6,756, respectively. The final analysis set included trialists from 87 countries, the most abundant being the US with 9,485 (38%) affiliated trialists. There were 7,569 first name-country combinations that occurred only once. The most common first-name-country combination was David-US with 201 (0.8%) instances. Only 1,760 (7.1%) of first name-country combinations appeared more than twice. The 100 most common trialist name-country combinations are presented in Supplementary Table 1.

Gender prediction accuracy when country of origin was not supplied (baseline case)

The overall accuracy of Genderize when predicting gender for the full dataset without supplying country was 96.6% with 2.3% incorrect gender predictions and 1.1% of names yielding no prediction (Table 1). Similarly, the overall accuracy of Gender API was 96.1% with 2.7% incorrect gender predictions and 1.1% of names resulting in no prediction. The accuracy of the gender R package’s predictions was lower, with 79.8% and 85.7% accuracy with the IPUMS and SSA methods, respectively. Names of men were misgendered as women less than 3% of the time for all gender prediction tools (Table 1). Names of women were misgendered over 3% of the time for all services except the gender R package when using SSA data as a reference. The difference in the percent of correct gender predictions between Genderize and Gender API was significant in favor of Genderize (p<0.001). Likewise, the accuracy difference between the gender R package methods were also significant (p<0.001), in favor of the SSA method. Gender API demonstrated higher gender prediction accuracy when two-part names were delimited with a space: the percent of correctly inferred genders rose from 96.1% to 96.3%.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1: Accuracy of Gender Predictions on 32,968 Included Trialists

After restricting Genderize’s predictions to trialists affiliated with a single country, the percentage of correct, incorrect, and missing predictions were 96.2%, 2.6%, and 1.2% respectively (Fig 1A). Genderize achieved the highest accuracy when evaluating first names from German authors, and the lowest accuracy when evaluating names from South Korean, Chinese, Singaporean, and Taiwanese authors. When evaluating the same 24,929 clinical trialists with Gender API, the percentage of correct, incorrect, and missing predictions were 95.8%, 3%, and 1.3% respectively. Gender API also had high accuracy when predicting the gender of German authors, and the lowest accuracy when evaluating names from South Korean, Chinese, Singaporean, and Taiwanese authors. The difference in accuracy between Genderize and Gender API is significant (p<0.001), in favor of Genderize.

Fig 1:
  • Download figure
  • Open in new tab
Fig 1: Accuracy of Gender Predictions.

Panel A shows the gender prediction accuracies when countries are not included in the API call. Panel B shows the results when countries are included in the API call. The top 4 countries with the most trialists and all Asian countries are plotted. Method a is genderize and method b is Gender API. Each bar is labeled with the fraction and percentage of correct gender predictions. Two-part first names were appended together without a delimiting character.

Gender prediction accuracy when country of origin was supplied to the API

The gender prediction accuracies when countries of origin were supplied to Genderize and Gender API are visualized in Fig 1B. Supplying the countries of origin alongside first names in the API call decreased the percentage of correct gender predictions when using Genderize from 96.2% to 95.4%, while also reducing the percentage of incorrect predictions from 2.6% to 2.1%. Conversely, including countries of origin increased the ratio of correct gender predictions of Gender API from 95.8% to 96% and decreased incorrect predictions from 3% to 2.7%. Supplying countries also increased the percentage of names with no gender prediction for Genderize from 1.2% to 2.5%, while Gender API remained constant at 1.3%. The difference in accuracy between Genderize and Gender API was significant in favor of Gender API (p<0.001).

Gender Prediction accuracy when using different characters to delimit two-part forenames

Gender prediction accuracy when evaluating two-part names was higher when countries were not included in the API call in all contexts except when calling Genderize with the first half of a two-part name, e.g., Jean-Pierre as jean. Genderize was most accurate (76.4%) when no character was used to delimit two-part names, e.g., Jean-Pierre represented as jeanpierre (Fig 2). Genderize provided zero predictions for two-part first names delimited with a space. In contrast, Gender API achieved the highest gender prediction accuracy when delimiting two-part names with a space (83.5%). Gender prediction accuracy for two-part names was worse than for one-part names when countries were not included in the API call and two-part names were separated without a delimiter: OR 0.07 (95% CI 0.06-0.08) for Genderize and OR 0.08 (95% CI 0.07-0.09) for Gender API, respectively.

Fig 2:
  • Download figure
  • Open in new tab
Fig 2: Accuracy of Gender Predictions Based on Delimiter between Two-Part Names.

Panel A is Genderize and Panel B is Gender API. Plot facets correspond to the type of delimiter separating two-part names. Stacked bars correspond to correct, incorrect, and no predictions respectively. Bars are labeled with the count and percent of correct gender predictions.

The accuracy of Genderize and Gender API were evaluated for statistical significance by comparing the percent of correct gender predictions between delimiter categories. The difference in Gender prediction accuracy between Genderize and Gender API when evaluating two-part names without a delimiting character and including countries in the API call was not significant. All other comparisons between Genderize and Gender API were significant in favor of Gender API (p<0.001).

Gender prediction accuracy by API-reported confidence thresholds

There was high agreement overall between gender prediction services and the gold standard labeled dataset (Fig 3). Genderize reported over 50% confidence in gender predictions for 32,573 (98.8%) trialists. Similarly, Gender API reported over 50% confidence for 32,587 (98.8%) trialists. Gender API demonstrated a correlation of 0.91 between its reported confidence and actual accuracy, compared to Genderize’s correlation of 0.82. The Brier scores for Gender API and Genderize were 0.0077 and 0.0048 respectively.

Fig 3:
  • Download figure
  • Open in new tab
Fig 3: Experimental Name Prediction Accuracy At Different API Probability Cutoffs.

Names with gender predictions were aggregated into the following API reported probability bins: 50%-55%, 55%-60%, 60%-65%, 65%-70%, 70%-75%, 75%-80%, 80%-85%, 85%-90%, 90%-95%, 95%-100%. The API reported probabilities within each bin were averaged and plotted on the x-axis. The experimentally determined gender prediction accuracies for the names in each bin are visualized on the y-axis.

Replication of analyses by Santamaria 2018 and Sebo 2021

The original dataset used by Santamaria consisted of 5779 first names with known genders, 34% of whom were women. Only 0.4% of the 5779 had diacritical marks. In addition, 1.1% and 2% of names contain spaces or hyphens, respectively. The original paper reported 80% accuracy using Genderize and 87% using Gender API. In our re-analysis, Genderize predicted the correct gender 92.5% of the time. Similarly, Gender API achieved 92.8% accuracy. The difference in accuracy between Genderize and Gender API was not statistically significant.

The dataset originally analyzed by Sebo 2021 included 6131 names of whom 50.3% were women. Diacritical marks were present in 6.6% of names. 10.2% of names contained spaces, and 6.6% of names included a hyphen. The original paper reported 81% accuracy using Genderize and 97% with Gender API. In our re-analysis, the accuracy of Genderize and Gender API on these 6131 names were 86.2% and 98% respectively. McNemar’s test indicated that the differences in accuracy was statistically significant (p<0.001), in favor of Gender API. Gender API was 99.5% accurate when evaluating names with diacritical marks, while Genderize was 71.7% accurate.

Cost and Accessibility

Genderize and Gender API provide a graphical user interface, while the gender R package requires programming. Genderize [10] provides 1,000 free predictions per day, whereas Gender API [11] only allows 100 free predictions per month. Gender API currently costs 4.85x more than Genderize for a monthly subscription that provides 100,000 predictions.

Discussion

Genderize and Gender API both demonstrated over 95% overall accuracy on our gold-standard dataset of cancer clinical trialists. Genderize was slightly more accurate than Gender API when countries were not included in the API call. Conversely, Gender API performed slightly better than Gender API when countries were included. For both services, including countries reduced the number of incorrect gender assignments at the cost of increasing the number of names with no predicted gender (Fig 1, Supplementary Table 2, Supplementary Table 3). The gender R package performed worse than Genderize or Gender API (Table 1).

Genderize and Gender API differed in how their accuracy was affected by the delimiter separating two-part first names. Genderize was most accurate when two-part first names were appended together without a delimiter (Fig 2). In fact, Genderize appeared to be incompatible with two-part first names that were delimited by a space as the service yielded zero correct predictions when evaluating such names. Conversely, Gender API performed best when two-part first names were delimited with a space. The slightly higher overall gender prediction accuracy attained by Genderize compared to Gender API is partially an artifact of our decision to append two-part names without a delimiter in the baseline comparison, since Gender API performed best when two-part names were delimited with a space.

A commonality between this analysis and several previous studies was the lower prediction accuracy of Genderize and Gender API when evaluating Asian names, with the exception of Japanese names [15,19]. The higher accuracy achieved for both services in our re-analysis of Santamaria’s dataset indicates that both services have improved since 2015, although Genderize improved by a larger margin. Gender API outperformed Genderize when re-analyzing Sebo’s dataset largely because Gender API handled two-part names that were delimited by spaces as well as names with diacritical marks. In fact, a follow up study [14] by the same author recommended removing diacritical marks and modifying two-part names to improve the accuracy of Genderize.

This study’s results should be interpreted with certain caveats in mind. We did not filter out recurring first names during this analysis because the count of names in real-world datasets like ours tends to follow a long-tail distribution [29]. The process for determining the “gold-standard” gender of each of the trialists relied on inference from available information. Affiliation data was missing for a substantial subset of authors, mostly due to the older practice of MEDLINE including only the affiliation of the first author; a substantial number of high-profile oncology journals (e.g., the Journal of Clinical Oncology and Blood) did not include clear 1:1 mappings for author-to-affiliation for a period of time; this issue affects at least 320 (4%) of manuscripts in the HemOnc KB. A substantial subset of recently added authors to the HemOnc KB have not had their gender determined yet (13.7%), and this subset has some important differences from the set of determined genders. Most notably, the undetermined subset has many more Asian hyphenated names (43.8% vs 3.8%) and authors with a country of affiliation including South Korea, China, Singapore, and/or Taiwan (45.6% vs 3.42%). It is thus likely that our results represent a “best-case scenario” and that automated gender mapping will become increasingly difficult as cancer clinical trials are increasingly conducted in the Asia-Pacific region [30,31]. Additionally, a researcher’s nationality in our data set does not always reflect the cultural origin of their first name as some researchers immigrated to the country of their academic affiliation.

It is important to note that Genderize, Gender API, and the gender R package assume a gender binary. However, a recent survey [32] found that 1.6% of U.S. adults identify as transgender or nonbinary. With new algorithmic advancements such as Genderize and Gender API, it is imperative that inclusivity is incorporated. Going forward, tools that infer gender based on name should be trained on data that include trangender and nonbinary people, and they should include the option to predict an individual as non-binary or transgender. Gender prediction is not simply a binary classification problem. Transgender individuals will likely make up a small percentage of the dataset of names, which could make obtaining a correct prediction for these individuals very challenging. Yet, by not incorporating them, we are excluding countless people from this algorithm and ensuring the prediction of their gender to have a 0% accuracy.

Both Genderize and Gender API demonstrated high gender prediction accuracy with Western names that were highly normalized without middle or last names or diacritical marks. The cost per name evaluated with Genderize is also several times cheaper than Gender API. However, Genderize loses accuracy compared to Gender API when name formatting becomes less consistent. The SSA and IPUMS methods of the gender R package were less accurate but are open-source alternatives. The results from this study provide a new benchmark for gender inference tools.

Data Availability

The data that support the findings of this study are publicly available from the Harvard DataVerse: https://dataverse.harvard.edu/privateurl.xhtml?token=6d620f82-5ef2-4ea6-90a5-19fb8ca4fe80

Alexander D. VanHelene: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data Curation, Writing - Original Draft, Writing - Review & Editing, Visualization. Ishaani Khatri: Conceptualization, Writing - Original Draft, Writing - Review & Editing. C. Beau Hilton: Formal analysis, Writing - Review & Editing. Sanjay Mishra: Methodology, Formal analysis, Investigation, Data Curation, Visualization, Writing - Review & Editing, Funding acquisition. Ece D. Gamsiz Uzun: Visualization, Writing - Review & Editing. Jeremy L. Warner: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data Curation, Writing - Original Draft, Writing - Review & Editing, Visualization, Supervision, Funding acquisition.

Acknowledgements

We would like to acknowledge the efforts of the editorial board of HemOnc.org.

References

  1. 1.↵
    Chatterjee P, Werner RM. Gender Disparity in Citations in High-Impact Journal Articles. JAMA Netw Open. 2021;4: e2114509. doi:10.1001/jamanetworkopen.2021.14509
    OpenUrlCrossRef
  2. 2.↵
    Murphy M, Callander JK, Dohan D, Grandis JR. Women’s Experiences of Promotion and Tenure in Academic Medicine and Potential Implications for Gender Disparities in Career Advancement: A Qualitative Analysis. JAMA Netw Open. 2021;4: e2125843. doi:10.1001/jamanetworkopen.2021.25843
    OpenUrlCrossRefPubMed
  3. 3.↵
    Dymanus KA, Butaney M, Magee DE, Hird AE, Luckenbaugh AN, Ma MW, et al. Assessment of gender representation in clinical trials leading to FDA approval for oncology therapeutics between 2014 and 2019: A systematic review-based cohort study. Cancer. 2021;127: 3156–3162. doi:10.1002/cncr.33533
    OpenUrlCrossRefPubMed
  4. 4.↵
    Wais K. Gender Prediction Methods Based on First Names with genderizeR. R J. 2016;8/1: 17–37.
  5. 5.
    Cevik M, Haque SA, Manne-Goehler J, Kuppalli K, Sax PE, Majumder MS, et al. Gender disparities in coronavirus disease 2019 clinical trial leadership. Clin Microbiol Infect. 2021;27: 1007–1010. doi:10.1016/j.cmi.2020.12.025
    OpenUrlCrossRefPubMed
  6. 6.
    Topaz CM, Sen S. Gender Representation on Journal Editorial Boards in the Mathematical Sciences. Danforth CM, editor. PLOS ONE. 2016;11: e0161357. doi:10.1371/journal.pone.0161357
    OpenUrlCrossRefPubMed
  7. 7.
    Nielsen MW, Andersen JP, Schiebinger L, Schneider JW. One and a half million medical papers reveal a link between author gender and attention to gender and sex analysis. Nat Hum Behav. 2017;1: 791–796. doi:10.1038/s41562-017-0235-x
    OpenUrlCrossRefPubMed
  8. 8.
    Sebo P, Clair C. Are female authors under-represented in primary healthcare and general internal medicine journals? Br J Gen Pract. 2021;71: 302. 1–302. doi:10.3399/bjgp21X716249
    OpenUrlCrossRef
  9. 9.↵
    Szymkowiak M. Genderizing fisheries: Assessing over thirty years of women’s participation in Alaska fisheries. Mar Policy. 2020;115: 103846. doi:10.1016/j.marpol.2020.103846
    OpenUrlCrossRef
  10. 10.↵
    Genderize Documentation. In: Genderize [Internet]. [cited 2 Jan 2024]. Available: https://genderize.io/
  11. 11.↵
    Gender API - Determines the gender of a first name. [cited 2 Jan 2024]. Available: https://gender-api.com/?utm_source=adwords&utm_medium=cpc&utm_campaign=ga3&price-set=OTG&gad_source=1&gclid=Cj0KCQiAhc-sBhCEARIsAOVwHuQF7HdhUmWlWZfU851GAvRl4ziBWaEc6tDDR_XKmG6I904GgzaqYr4aAoCfEALw_wcB
  12. 12.↵
    Mullen L. gender: Predict Gender from Names Using Historical Data. 2021. Available: https://github.com/lmullen/gender
  13. 13.↵
    Sebo P. How accurate are gender detection tools in predicting the gender for Chinese names? A study with 20,000 given names in Pinyin format. J Med Libr Assoc. 2021;110. doi:10.5195/jmla.2022.1289
    OpenUrlCrossRef
  14. 14.↵
    Sebo P. Using genderize.io to infer the gender of first names: how to improve the accuracy of the inference. J Med Libr Assoc. 2021;109. doi:10.5195/jmla.2021.1252
    OpenUrlCrossRefPubMed
  15. 15.↵
    Santamaría L, Mihaljevic H. Comparison and benchmark of name-to-gender inference services. PeerJ Comput Sci. 2018;4: e156. doi:10.7717/peerj-cs.156
    OpenUrlCrossRefPubMed
  16. 16.↵
    Warner JL, Cowan AJ, Hall AC, Yang PC. HemOnc.org: A Collaborative Online Knowledge Platform for Oncology Professionals. J Oncol Pract. 2015;11: e336–e350. doi:10.1200/JOP.2014.001511
    OpenUrlAbstract/FREE Full Text
  17. 17.↵
    Heidari S, Babor TF, De Castro P, Tort S, Curno M. Sex and Gender Equity in Research: rationale for the SAGER guidelines and recommended use. Res Integr Peer Rev. 2016;1: 2. doi:10.1186/s41073-016-0007-6
    OpenUrlCrossRefPubMed
  18. 18.↵
    CIHR Institute Of Gender And Health. What a difference sex and gender make : a gender, sex and health research casebook. 2012 [cited 18 Jan 2024]. doi:10.14288/1.0132684
  19. 19.↵
    Sebo P. Performance of gender detection tools: a comparative study of name-to-gender inference services. J Med Libr Assoc. 2021;109. doi:10.5195/jmla.2021.1185
    OpenUrlCrossRef
  20. 20.↵
    Mihaljevic H, Santamaria L. Evaluation of name-based gender inference methods. GenderGapSTEM-PublicationAnalysis; 2023. Available: https://github.com/GenderGapSTEM-PublicationAnalysis/name_gender_inference
  21. 21.↵
    Sebo P. Performance of gender detection tools: a comparative study of name-to-gender inference services. 2021 [cited 2 Jan 2024]. doi:10.17605/OSF.IO/KR2MX
    OpenUrlCrossRef
  22. 22.↵
    R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Available: https://www.R-project.org/
  23. 23.↵
    Wickham H, Averick M, Bryan J, Chang W, McGowan L, François R, et al. Welcome to the Tidyverse. J Open Source Softw. 2019;4: 1686. doi:10.21105/joss.01686
    OpenUrlCrossRefPubMed
  24. 24.↵
    Wickham H, Miller E, Smith D. haven: Import and Export “SPSS”, “Stata” and “SAS” Files. 2023. Available: https://CRAN.R-project.org/package=haven
  25. 25.↵
    Wickham H, Bryan J. readxl: Read Excel Files. 2023. Available: https://CRAN.R-project.org/package=readxl
  26. 26.↵
    Wickham H. testthat: Get Started with Testing. 2011. Available: https://journal.r-project.org/archive/2011-1/RJournal_2011-1_Wickham.pdf
  27. 27.↵
    Aphalo P. ggpmisc: Miscellaneous Extensions to “ggplot2.” 2023. Available: https://CRAN.R-project.org/package=ggpmisc
  28. 28.↵
    Pedersen T. patchwork: The Composer of Plots. 2023. Available: https://CRAN.R-project.org/package=patchwork
  29. 29.↵
    Clauset A, Shalizi CR, Newman MEJ. Power-Law Distributions in Empirical Data. SIAM Rev. 2009;51: 661–703. doi:10.1137/070710111
    OpenUrlCrossRefPubMedWeb of Science
  30. 30.↵
    Akiki V, Troussard X, Metges J, Devos P. Global trends in oncology research: A mixed-methods study of publications and clinical trials from 2010 to 2019. Cancer Rep. 2023;6: e1650. doi:10.1002/cnr2.1650
    OpenUrlCrossRef
  31. 31.↵
    Terada M, Nakamura K, Matsuda T, Okuma HS, Sudo K, Yusof A, et al. A new era of the Asian clinical research network: a report from the ATLAS international symposium. Jpn J Clin Oncol. 2023;53: 619–628. doi:10.1093/jjco/hyad033
    OpenUrlCrossRef
  32. 32.↵
    Minkin AB Juliana Menasce Horowitz, Kim Parker and Rachel. The Experiences, Challenges and Hopes of Transgender and Nonbinary U.S. Adults. In: Pew Research Center’s Social & Demographic Trends Project [Internet]. 7 Jun 2022 [cited 2 Jan 2024]. Available: https://www.pewresearch.org/social-trends/2022/06/07/the-experiences-challenges-and-hopes-of-transgender-and-nonbinary-u-s-adults/
Back to top
PreviousNext
Posted January 31, 2024.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Inferring Gender from First Names: Comparing the Accuracy of Genderize, Gender API, and the gender R Package on Authors of Diverse Nationality
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Inferring Gender from First Names: Comparing the Accuracy of Genderize, Gender API, and the gender R Package on Authors of Diverse Nationality
Alexander D. VanHelene, Ishaani Khatri, C. Beau Hilton, Sanjay Mishra, Ece D. Gamsiz Uzun, Jeremy L. Warner
medRxiv 2024.01.30.24302027; doi: https://doi.org/10.1101/2024.01.30.24302027
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Inferring Gender from First Names: Comparing the Accuracy of Genderize, Gender API, and the gender R Package on Authors of Diverse Nationality
Alexander D. VanHelene, Ishaani Khatri, C. Beau Hilton, Sanjay Mishra, Ece D. Gamsiz Uzun, Jeremy L. Warner
medRxiv 2024.01.30.24302027; doi: https://doi.org/10.1101/2024.01.30.24302027

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (399)
  • Allergy and Immunology (708)
  • Anesthesia (200)
  • Cardiovascular Medicine (2918)
  • Dentistry and Oral Medicine (333)
  • Dermatology (249)
  • Emergency Medicine (438)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1032)
  • Epidemiology (12711)
  • Forensic Medicine (12)
  • Gastroenterology (827)
  • Genetic and Genomic Medicine (4567)
  • Geriatric Medicine (415)
  • Health Economics (726)
  • Health Informatics (2913)
  • Health Policy (1068)
  • Health Systems and Quality Improvement (1074)
  • Hematology (386)
  • HIV/AIDS (922)
  • Infectious Diseases (except HIV/AIDS) (14081)
  • Intensive Care and Critical Care Medicine (842)
  • Medical Education (422)
  • Medical Ethics (115)
  • Nephrology (467)
  • Neurology (4335)
  • Nursing (234)
  • Nutrition (636)
  • Obstetrics and Gynecology (801)
  • Occupational and Environmental Health (734)
  • Oncology (2261)
  • Ophthalmology (643)
  • Orthopedics (258)
  • Otolaryngology (324)
  • Pain Medicine (278)
  • Palliative Medicine (83)
  • Pathology (499)
  • Pediatrics (1196)
  • Pharmacology and Therapeutics (502)
  • Primary Care Research (494)
  • Psychiatry and Clinical Psychology (3734)
  • Public and Global Health (6916)
  • Radiology and Imaging (1524)
  • Rehabilitation Medicine and Physical Therapy (895)
  • Respiratory Medicine (915)
  • Rheumatology (436)
  • Sexual and Reproductive Health (443)
  • Sports Medicine (383)
  • Surgery (486)
  • Toxicology (60)
  • Transplantation (210)
  • Urology (178)