“Too Soon” to Count? How Gender and Race Cloud Notability Considerations on Wikipedia

Not everyone deserves their own Wikipedia page; that’s why Wikipedia’s notability guidelines exist. But definitions of notability are unevenly applied across race and gender lines:

Wikipedia’s editors are less likely to consider you “notable” if you’re not a white man.

Using a combination of qualitative and statistical analysis of Wikipedia pages nominated for deletion, Mackenzie Emily Lemieux and Rebecca Zhang join with Francesca Tripodi to explore how Wikipedia’s notability considerations are applied for female and BIPOC academics. They examined two key metrics used in the process of establishing notability on Wikipedia: the Search Engine Test and the “Too Soon” metric. The search engine test determines if a person’s online presence is well covered by reputable, independent sources. In their analysis, Lemieux, Zhang, and Tripodi found that this test predicts whether or not white male academics’ pages will be kept or deleted. But academics who are women and people of color are more likely to have their Wikipedia page deleted—even if they have equivalent or greater online presence than their white male peers.

The second metric, “too soon,” is a label applied to Wikipedia pages when a Wikipedian thinks there aren’t enough independent, high quality news sources about the page’s subject. Women of all races are more likely than men to be considered not yet notable (i.e., “too soon” to be on Wikipedia). The online encyclopedia’s editors were more likely to justify this label applied to women based on their career stages (e.g., “she’s an assistant professor” and therefore not yet notable). But this tag was applied to women on average further in their careers than men who received the tag. Individual bias continues to disadvantage women and people of color on Wikipedia; and Wikipedia continues to allow these hidden biases to influence processes of determining notability.

Who are you citing? In communication studies, the most cited authors are skewed even more white and male than previously thought.

Principal researcher Deen Freelon, and coauthors Meredith Pruden, Kirsten Eddy, and Rachel Kuo published a study detailing the degree to which race, gender, and location affect who gets cited in the top journals in the communication field. “Inequities of race, place, and gender among the communication citation elite, 2000–2019,” published in Journal of Communication, identifies a group of 1,675 highly cited communication scholars. As people cited the most often in the top communication journals, these scholars are the discipline’s “power elite;” their work disproportionately shapes what theories and research questions are considered valuable significant to the discipline. Building on previous work documenting serious inequalities in the field, Freelon and his collaborators show that these disparities are even more pronounced among top citations.

These “Elite” are 91.5% white, 74.3 male, and 78.6% located in the United States. These percentages are even more skewed white and male than general citation statistics found in previous work. And it gets worse when you apply an intersectional lens: among the 23 elite communication scholars who are Hispanic or Latine, only five are women. Of the 14 Black scholars in this elite group, only one is a woman (and she is employed by a department outside of communication studies).

Every single citation can “reify or resist” these inequities. For scholars seeking to resist these trends, the authors offer a series of recommendations:

Review the reference lists for your recent publications, presentations, course syllabi, and teaching pedagogy. Take note of how many citations claim to represent “universal” or “generalizable” theory and ask whether they apply only to Western, Educated, Industrialized, Rich, and Democratic (WEIRD) countries and people.
Prioritize diversity in your reading lists. When searching for new scholarship, always begin with those written by scholars from underrepresented groups. Follow other work by these scholars, take note of whom they are citing, and read those publications too.
Seek out responses to and critiques of longstanding or foundational work in the field, particularly those approaching this work through the lens of equity or diversity, and include these perspectives in your work.
Diversify the locations you cite. What are the origins of the literature you most frequently cite? Are they mostly WEIRD? Consider the broader global applicability (or lack thereof) of your work. Cite examples of similar issues occurring in other countries beyond your geographic region, or reconsider how your work can be more globally applicable and engage with scholarship that supports that endeavor.
Draw from existing resources aimed at equitable citation practices, such as AEJMC’s Inclusive Citation (iCite) Project, Women Also Know Stuff, People of Color Also Know Stuff, #CiteASista, Rockefeller Inclusive Science Initiative, Community of Online Research Assignments (Project CORA), Communication Scholars for Transformation, and The University of British Columbia’s Decolonization and Anti-Racism guide.
If you are active on social media, diversify your academic following to be exposed to new arguments and research.
Structural steps to increase citational justice include adding citation diversity statements to journal “About” pages; for journals to include the race and gender proportions of cited authors (aided by software that automatically detects these quantities [e.g., Alcantara Castillo et al., 2020]); diversifying journal editorial boards, associate editor teams, and referee invitations; and providing journal authors the option of submitting and publicizing their own demographic information in their articles.

As Stewart Coles added in sharing the study, “The striking thing about this study is not that whites, men, & USians are overrepresented among the communication citation elites—we been knew that. Rather, it's the startling degree to which this overrepresentation exists and persists. May this move the conversation forward.”

Moral outrage and shared moral norms energize networked, coordinated harassment online

Networked, coordinated harassment is done by all kinds of communities, from partisan political groups to fandoms. Though the origins of harassment are not necessarily identity-based, the resulting attacks use race, gender, sexuality, religion, and other attributes as vectors, making it more likely that people with marginalized identities will be harassed in ways that are intersectional/more harmful for individuals with multiple marginalized identities.

The key point is that while harrassers draw from identity-based stereotypes in their attacks, they understand their actions as morally justified and based in the target’s actions, rather than their identity. Marwick offers two examples, “I’m not against Anita Sarkeesian because I’m a misogynist/anti-feminist, but because she’s a scammer/liar,” and “I’m not against the 1619 project/Nikole Hannah Jones b/c I’m racist/my white ID is threatened but because she’s a liar who hates white people and white children.” In these cases, the speaker justifies their harassment of women by defining the woman as immoral and themselves therefore as moral actors for policing their immoral behavior.