
- The Moby Lexicon Project
- BNC Baby
- Full BNC
- Project Gutenberg (Download full database)
- CMU Pronouncing Dictionary
- GNU Collaborative International Dictionary of English
- The Internet Dictionary Project
- English Wikitionary Dump
- Simple English Wiktionary Dump
- JACET 8000
- Minimal pairs in English RP
- List of homographs
- Homophones in English RP
- Google’s Official List of Bad Words
- Yasumasa Someya’s Lemmas List
- MRC Psycholinguistic Database
- Million Song Dataset
- Penn Treebank P.O.S. Tags
- Princeton University’s WordNet
- The Sentence Corpus of Remedial English
- Summer Institute of Linguistics (SIL) Word List
- The Tanaka Corpus
- The General Service List
- The New General Service List
- The Academic Word List
- The New Academic Word List
- The TOEIC Word List
- The Business Service List
- Apache Open Office MyThes
- Global WordNet