Researchers looking into online toxicity found a way to connect supposedly anonymous posts on the site  (EJMR) to IP addresses over the past dozen years, according to a draft paper leaked early online.
While EJMR is an academic jobs forum, it âalso includes much content that is abusive, defamatory, racist, misogynistic or otherwise âtoxic,ââ the paper says.
âEJMR is sometimes dismissed as not being representative of the economics profession, including claims that the most frequent users on the platform are not actually economists,â  says. âHowever, our analysis reveals that the users who post on EJMR are predominantly economists, including those working in the upper echelons of academia, government and the private sector. In this paper, we identify the scheme used to assign usernames for each post written by an anonymous user on EJMR.
âWe show how the statistical properties of that algorithm do not anonymize posts, but instead allows the IP address from which each post was made to be determined with high probability.â
ÌÇĐÄVlog
Florian Ederer, one of the authors and Questrom professor of economics and management at Boston University, said an updated version of the paper was presented at the National Bureau of Economic Research last week.
Paul Goldsmith-Pinkham, a co-author and an assistant professor of finance at the Yale School of Management, said, âWe have no intention of releasing personally identifiable information.â
ÌÇĐÄVlog
EJMR posters are aware of the leaked paper. One top thread there is titled, âHow is what Ederer did not illegal?â
âEJMR is currently melting down with people convinced their careers are in danger, presumably because theyâve said some very nasty and/or stupid things in locations that will easily identify them,â tweeted Ben Harrell, assistant professor of economics at Trinity University, in Texas. âIn the end, nothing of value will be lost.â
Asked for comment, EJMR sent an email on 19 July saying, âyou may wish to consider what a neutral actor (ChatGPT) thinks about the study.â
EJMRâs email then includes a question to that artificial intelligence programme: âWould reverse engineering partial hash codes of thousands of website users to get their IPs with brute force be considered hacking?â ChatGPT, according to the email, replied âYes, that activity would certainly be considered hacking, and more specifically, it would be illegal and unethical.â
Later that day, the website sent this email: âIt is essential to maintain an anonymous forum in the economics profession. EJMR has been used to expose multiple counts of plagiarism, corruption and serious professional misconduct that would not likely have been shared for fear of retaliation by their higher ups or colleagues. Indeed one of the co-authors of the paper had their  exposed by anonymous EJMR users, calling into question the motivation for the study.
âThis paperâs attempt to expose the identities of the vast majority of good natured users, using the excuse of there being a very small number of toxic posts on the site, is something that many people find troubling, deeply unethical, and may well be illegal.â
Professor Ederer said: âWhat weâre doing is not hacking.â
Dr Goldsmith-Pinkham said the draft paper was placed in a private cache online. It ended up on GitHub, and at least one professor who was not an author said they downloaded it and shared it on Twitter.
ÌÇĐÄVlog
Kyle Jensen, the third author, is also at the Yale School of Management.
The draft paper does include a chart showing, for each of the top 25Â U.S. News-ranked economics departments, the percentage of total posts labelled toxic.
ÌÇĐÄVlog
The University of California, Los Angeles, at nearly 15Â per cent toxic, ranked first, followed by Yale and the University of California, San Diego, both above 10Â per cent.
âAlthough posting on EJMR is generally frowned upon in the economics profession, 10.2Â per cent of all posts to which we assign IP addresses originate directly from IP addresses associated with universities or research institutions,â they write. âAlthough some universities also are the internet service provider for some of their faculty and students (e.g., through university-provided faculty or student housing), this means that a substantial number of posts on EJMR occur while users are at their workplace. Perhaps even more surprisingly, there are EJMR posts from identified IP addresses located at literally all the leading universities in the United States.â
They write that âamong the top 10 IP addresses with the highest number of toxic posts, there is not a single one from a university IP address. However, among the top 10 toxic university IP addresses there are several from leading U.S. universities including the University of Rochester and the University of Chicago.â
The researchers write that they ârecover[ed] 47,630 distinct IP addresses of EJMR posters and match[ed] these to 66.1Â per cent of the roughly 7Â million posts made over the past 12 years. We geolocate posts and describe aggregated cross-sectional variation âparticularly regarding toxic speechâ across sub-forums, geographies, institutions and contributors.â
The study included developing software and a dark dictionary to catch this âtoxicâ speech.
âThese posts are obfuscated to such an extent that we found most machine learning models failed to accurately classify them as toxic,â the researchers write.
âTo address this, we developed software to deobfuscate such speech. First, we classified posts into commonly occurring natural languages on EJMR (Stahl, 2023): English, German, Chinese, Korean and a few others. Then we collected high-frequency non-English words in the English posts, which we used to develop a dictionary mapping text like âf**k,â âsecksâ and âGTFOâ to canonical forms. We used this dictionary to deobfuscate some of the most commonly obfuscated terms.
âThen, we checked each word in each post for common symbol-based obfuscations like âfa//g//g//ot,â removing symbols where doing so resulted in an English word or well-known profanity. Finally, we transformed so-called leetspeakâsuch as âd4mn j3wsââto its canonical form. We did this by attempting common leetspeak substitutions and checking if those substitutions resulted in an English word or a well-known profanity. Our goal in this effort was not perfection, but rather some improvement in the performance of machine learning models for this content.â
Dr Goldsmith-Pinkham said the study had not yet been peer-reviewed, but the authors plan to publish it in a peer-reviewed journal.
âEJMR is very popular,â the paper says. âSimilarWeb estimates that EJMR receives 2.5Â million visits per month with an average of 6.45 pages viewed per visit. In comparison, the same figures for the NBER [National Bureau of Economic Research] and AEA [American Economic Association] competitors are 1.1Â million and 991,000 visits and 2.09 and 2.76 pages per visit, respectively.â
While researchers found âthe vast majority of EJMR posts comes from residential IP addresses located in the United States and in particular in cities with elite universities ⊠there is also a significant share of other countries including Canada, the United Kingdom, Hong Kong, Australia, Germany, Italy and France.â
This is an edited version of a story that first appeared on .
ÌÇĐÄVlog
Register to continue
Why register?
- Registration is free and only takes a moment
- Once registered, you can read 3 articles a month
- Sign up for our newsletter
Subscribe
Or subscribe for unlimited access to:
- Unlimited access to news, views, insights & reviews
- Digital editions
- Digital access to °Ő±á·Ąâs university and college rankings analysis
Already registered or a current subscriber?








