wordfreq sunset message
2024-09-18 19:11:18.07712+02 by Dan Lyke 0 comments
Why wordfreq will not be updated
I don't think anyone has reliable information about post-2021 language usage by humans.
The open Web (via OSCAR) was one of wordfreq's data sources. Now the Web at large is full of slop generated by large language models, written by no one to communicate nothing. Including this slop in the data skews the word frequencies.