The list of real sounding “fake” words used for the new Apps 4 EFL activity “Fight the Fakes” is now available for download.
The list was generated by looping through each of the words from the SIL list and splitting them into three-letter chunks. A Markov chain process was then used to determine which of the three letter chunks were most likely to precede or follow each other. The three-letter chunks were then recombined according to these likelihoods in order to create realistic sounding neologisms of various lengths, e.g.
The words were doubled checked against the SIL list to ensure no real words were accidentally generated.
Ask your students to try and invent “definitions” for the fake words based on what they sound like, e.g. “hispanelist (n.), chat show panelist from Latin America”, “mandibilious (adj.), used to describe an animal with extraordinarily strong jaws”, “rattlesnatcher (n.), a person who goes around stealing toys from small children”
Use them as in Yes/No vocabulary knowledge tests to ensure students don’t cheat by clicking “Yes, I know this word” for every item
The final list consists of 3,773 high frequency TOEFL words, and can be downloaded here.
Step 1: Assemble a corpus of TOEFL materials
For my corpus, I used material from both the older CBT (Computer Based Test) and the current iBT (Internet Based Test). I found most of the materials online for free. Some were already in plain text format, but most were PDFs and required Optical Character Recognition (OCR) to convert to plain text. I used ABBYY’s FineReader Pro for Mac, but there are plenty of other options out there too. Some files were Microsoft Word format (.doc/.docx), and MacOS X’s batch conversion utility came in hand for these. I included model answers, listening transcripts, reading passages and multiple choice questions (prompts, distractors and answers). I tried to exclude explanations, advice and instructions from the authors and/or publishers.
Ultimately, I ended up with corpus just shy of a million words (959,124 to be precise). In general, bigger is better when it comes to corpus research. The TOEIC Service List (TSL) utilizes a corpus of about 1.5 million words, so my TOEFL corpus seems roughly comparable to this.
Step 2: Count the number of occurrences of each word
For my final list I removed any words which also appear on the NGSL, any contractions (e.g. “Don’t”,”I’m”,”that’s”), any numbers written in word form (e.g. “two”,”million”), any vocalizations (e.g. “uh”,”oh”), any ordinals (e.g. “first”,”second”,”third”), any proper nouns (“James”, “Elizabeth”, “America”, “San Francisco”, “New York”), and any words with fewer than 5 occurrences in the corpus. Next, I ran the list through a spell checker, and excluded any unrecognized words. I also excluded any non-lexical words, to leave a list consisting only of nouns, verbs, adjectives and adverbs.
This might not be a problem for individual users, but it becomes a major issue when leading a group of students in lock-step through a structured learning process. The fact that the “user experience” is inconsistent means that there is no single set of instructions that all students will be able to follow. The fact that developing for every possible OS/handset combination is a challenge means that many apps only run on the latest OS versions of the most popular handsets.
So, although every student may possess a smartphone, not every smartphone will be able to run the cool CALL app you have in mind. Even if they can, you will either have to give individual support to every student in helping them set up the activity, or create multiple iterations of the instructions to cover every OS/device eventuality.
Unlike institutionally owned devices, which can be easily wiped after the user logs out or finishes the class, student owned devices contain a trove of personal data: photos, messages, appointments, contact information, and more.
Most students would probably feel uncomfortable sharing at least some of this information with their teachers. So when we walk around the room monitoring students to make sure they are on-task, or helping them set up the mobile-based CALL activities, we have to be careful not to inadvertently peek into the personal lives behind the tiny glowing screens in their hands.
Ever since Apple overhauled the iOS notification system, it seems that every app and its dog wants to send me updates, offers, news and status reports. While I endeavor to disable notifications for any app that doesn’t absolutely need them, my students tend to be less discerning. There’s nothing worse than setting up a class activity on mobile devices, only to have students navigate away from the app or site the moment a giant emoji-laden message drops down from the top of the screen. Even the students who diligently dismiss annoying messages from friends must find them a distraction from the learning process.
And I haven’t even begun to mention the students who will double click the home button and go back to Candy Crush the minute you’re not hovering over their shoulders and spying on their screens.
The modified version of Maslow’s hierarchy of needs now puts battery life right at the bottom of the pyramid, directly below “Wi-Fi”. Yes, this a sarcastic dig at millennials’ seeming inability to pull themselves away from their devices and do something healthy like.. climb a tree. However, in the CALL-based EFL classroom, it is a very pertinent observation.
This means that students, who are already heavy mobile users, may simple not have enough juice to utilize their devices during study time as well as break time. Where this is the case, you’d better hope that you have enough power outlets and charging cables to get them hooked back up to the mainline.
Capped data plans on mobile are generally the norm these days. There may be actual technological reasons behind this, but the cynical side of me suspects it’s just the carriers trying to milk heavy users for more money.
In any event, if you don’t have an easily accessible Wi-Fi network in your classroom (which isn’t restricted to just teachers) and you’re asking students to use their own data connections to engage with your chosen app or website, you have to be careful not to inadvertently incur additional charges for your students. Usually they will be quick to let you know when this is the case, but it can be yet another barrier to the successful exploitation of BYOD.
If you can overcome the difficulties presented by various models of various handsets running various versions of various operating systems, and all students have a fully juiced up device with plenty of bandwidth, and they are able to pull themselves away from Candy Crush, and ignore messages from their friends in other classes, then BYOD can be a good way to gain access to mobile technology in the classroom.
However, we must be careful not to appropriate students personal (and often private) devices as our own teaching tools, despite how cool that new ELT app may be.
I was honored to receive a Best of JALT award for my presentation on Apps 4 EFL at the Nakasendo English Conference 2015. I’d like to thank the organizers of Saitama JALT for inviting me to give the plenary presentation, and for nominating me for this award. In particular I’d like to thank Matt Shannon, Tyson Rode, and Rob Rowland – you guys are awesome! Thanks!