How many of the 170k English words do you know?

https://vocabowl-870366514258.us-west1.run.app/

72•abnry•1h ago

Comments

dtagames•1h ago

This was fun! The progression seems logical.

I scored 71,000.

slices•38m ago

75k here but a few of the later ones were lucky guesses.

cubano•31m ago

Yes...exactly the same here although the guesses often had some grounding in the root of the word.

itsamario•1h ago

I know maybe 20-30. I'm aware of maybe a few thousand.

I use the language to understand not get an effect

goldenarm•56m ago

It's hilarious that most of these words are French

the_lonely_phon•48m ago

Depends is bratwurst a German word or an English one? You will hard pressed to find an American that doesn’t know thr word and what it means. You can buy them at just about any grocery store and they are a staple of many restaurants.

At some point the word becomes both. Sourced from its mother language and maybe even still meaning the same thing in both, but no less an English word than any other at this point.

mordechai9000•32m ago

It also had "weltschmerz" in the list, but I think I have only ever heard "ennui" used in English. They are both foreign words, but I would not have thought of weltschmerz as a loan word. Then again, maybe I am not reading the right texts.

rhdunn•46m ago

Norman French due to the Norman invasion of 1066 resulting in Old English evolving into Middle English. You can see that in the words for animals vs meats (cow and boef/beef, sheep and mutton, etc.) where the Germanic people raised the sheep and the Norman aristocracy ate them.

A lot of the more common and simpler words are Germanic, as is the grammar (e.g. compound words like cupboard).

classified•46m ago

English also has a ridiculously high fraction of Latin too.

pessimizer•16m ago

Not from Latin but through French - the direct use of Latin in English is generally restricted to technical jargon and legal terms (that English often also share with the French.)

Latin isn't really any sort of parent to Old English afaik, even though the Romans ran Britain for a while.

yorwba•53m ago

There is a typo in "Hippopotomonstrosesquippedaliophobia," it should be "Hippopotomonstrosesquipedaliophobia" instead. (Also, it breaks the layout.)

classified•47m ago

I bet that "p" just bounced out of pure spite.

summarybot•44m ago

Let the ironic screaming at the sight of this word commence!

bobson381•33m ago

also interrobang is rendered as bang-interro (!?) when it should be interro (?) then bang (!) -> (?!)

spelufo•8m ago

do you really think so?!

I think bang-interro just didn't sound as nice and that's probably why it is called an interrobang.

fritzo•52m ago

Feature request: fewer clicks. It should be one click per question

TheJoeMan•31m ago

I'd suggest a "toast" would suffice for the correct answers. Proceed to the next question when correct, with a "next" button when incorrect.

ortusdux•26m ago

Keyboard shortcuts would be nice as well. When I saw it was 100 questions I bailed.

pastel8739•50m ago

I wish the option was just “yes I know this word” or “no I don’t”. Reading the definitions takes too long for so many words

yorwba•30m ago

A different interaction design is used by https://testyourvocab.com : just a list of words with a checkbox for each. But it might encourage overconfidence. Before their acquisition by Preply, they also had an interesting blog with statistical analysis: https://web.archive.org/web/20210724115604/http://testyourvo...

The two tests give me widely different results, probably because the sampled words aren't perfectly representative and so the results should have huge error bars to account for this sampling error.

jstanley•50m ago

Cool idea, am working through.

It's annoying that you need to click 3 times per question, and the buttons are in 2 different places.

Maybe would be better to just let me click the answer I want and then instantly show me the next question?

Also who is Sandi?

rhdunn•11m ago

Sandi Toksvig, the current host of the BBC program QI (Quite Interesting), previously hosted by Stephen Fry. She's also been on a number of other BBC TV and radio shows.

gilleain•7m ago

I suspect Sandi Toksvig, one of the hosts of QI. One of the 'success' messages is "quite interestng!".

No offence mean to anyone, but the whole exercise feels very QI : superficial 'understanding' of a large range of things (for example words) without much of a connection between these words.

cm2012•49m ago

Fun fact: there's a test you can do called wordsum which correlates extremely highly, like .71, to IQ. It's just asking you 10 vocabulary questions. It turns out knowing advanced vocabulary correlates really well to IQ.

summarybot•45m ago

I don't know if I can get behind .71 implying "correlates really well" ... that's the issue I had recently with talking with GPT, it was evaluating my logical reasoning ability based on the vocabulary I was employing. You don't need fancy words to be intelligent.

Laurel1234•49m ago

Pretty fun.

I suggest skipping the submit button and just showing it's correct when pressing and moving on after a sec or so. Having to click on submit twice really breaks the flow.

Also in all the words I tried I noticed out of the 4 options one is the correct one, another is the opposite of the correct one, and the other 2 are random stuff. You can basically skip any option whose antonym isn't present as well.

RicoElectrico•35m ago

It estimated 74k words for me, but I feel this might be inflated; much of the time when I didn't know the answer - I could vibe guess it just as you did it. The distractor answers weren't convincing enough. For starters, when an answer was based on deconstructing the word into common English words, that ruled it out. After all, if it was, then it wouldn't have been obscure.

A tangent: writing distractors for multiple choice questions is hard. From the exams I know (excluding those whose nature precludes it, such as based on calculation or rote memorization) the only that does this brutally well is LEK (Polish medical graduate exam). It's nigh impossible to vibe guess it at more than random chance for someone outside the field.

_diyar•32m ago

in casual use you might also be able to guess it from context, so i think it’s a wash

datsci_est_2015•28m ago

Yeah I also got exactly 74k. Stuff like “xylologist” I guessed had to do with vegetation because of “xylem”, whereas xylophone player was too on the nose. Then again, maybe knowing xylem in the first place makes 74k reasonable.

mpeg

trevwebdev•48m ago

Interesting, I don't have the time to go through 100 though and having to click on answer and then mouse down to continue is a slog.

archildress•48m ago

Nice tool - would love it if I could press a number on the keyboard to select and rapidly move through them.

kiaofz•48m ago

These should maybe be checked through. Many are the second or third definitions, and some even reference the word in the definition e.g Lethargic: exhibiting lethargy

notsylver•44m ago

It seems like the right answer is usually the longest of the choices, I managed to get a few just by picking the longest. It would also be nice if there was a "I don't know" instead of guessing and skewing the results by getting it right, though maybe thats accounted for

orrito•32m ago

These were likely all AI generated, or at least the alternatives were. I made an app a while ago as well, and afterwards realized AI often wanted to make a very covering answer for the correct one, making it often longer than the others, thus defeating the idea of the quiz in the process.

EstanislaoStan•13m ago

Yeah this is AI slop I don't like..

jrrv•43m ago

Presumably it's a random batch of words since you can run the test again. I wonder how much the word selection affects the outcome. I got 66,750 with 20/20/15/17/14.

I'm curious how the difficult is chosen because "obfuscate" was included in the hardest difficulty but I would not consider that to me a difficult word.

Also I found that some of the definitions were not completely correct.

rhdunn•19m ago

It could be based on things like word frequency. I'd expect obfuscate/obfuscation to be less common outside of programming and RPGs (Vampire the Masquerade).

fl4regun•42m ago

apparently 54,000. Seems like it is including even fictional words though in this test (like from fiction novels). Ironically I scored higher on the expert words (18/20) than the "advanced" words (11/20)

apical_dendrite•29m ago

plenty of words and phrases originate from fiction

quixotic, scrooge, shangri-la, Uncle Tom, gargantuan, kafkaesque, blurb, milquetoast

and words like cyberspace were first used in fiction

once real people use them, they stop being fictional words

krustyburger•15m ago

Kafkaesque doesn’t originate directly from fiction like your other examples any more than a word like Dickensian does.

croisillon•41m ago

i remember of such a link in July 2011 but i could only find that one which is a bit different

https://news.ycombinator.com/item?id=2806377

yorwba•18m ago

How did you manage to remember the exact month? https://news.ycombinator.com/from?site=testyourvocab.com

WesleyJohnson•39m ago

59,400 - It said I'm a person of few words. It also recommended I read a dictionary. I feel some kind of way about that. :D

Fun!

hmokiguess•37m ago

why use many word when few word do trick

sd9•36m ago

Interesting concept, but 100 words is really quite a lot to get through... It's tiresome trudging through the easy words at the start, and I never got to see the interesting words before getting bored.

I've seen other systems like this calibrate far more quickly by assigning a sort of score and confidence behind the scenes. Confidence starts out low and increases over time - correct/incorrect answers rapidly adjust score at the beginning, then things settle down.

In practice this means you get a sequence of increasingly uncommon words initially, until you get one wrong, then you drop back to something easier until you start getting things right again, and eventually circle around words at your level.

Also - too many clicks per word. It's low stakes, just let me click the definition once and I'll live if I misclick (or add an undo button).

datsci_est_2015•31m ago

> Also - too many clicks per word. It's low stakes, just let me click the definition once and I'll live if I misclick.

This, and accept that people will have incorrect input and build it into the confidence. Even the smartest person in the world sometimes makes clerical errors, or has the wrong neuron fire at the wrong moment.

thenthenthen•2m ago

Moly holy the clicking is too much 3 clicks that could be one :O

latexr•17m ago

> Also - too many clicks per word.

They’re also too far away. I’m on a laptop and I have to keep moving the cursor up and down just to confirm. Give each option a letter or number and let me press it to choose the answer¹.

¹ There is (was?) some service for forms which does that and it works quite well. I think it was Typeform, but I just opened the website to check and—of course—it’s now just plastered with mentions of AI so I lost interest in verifying.

mcbetz•36m ago

This reminds me of a learning resource that I can't find again: you start with an assessment of how many words you know and then you get new words in context with every session (and maybe some spaces repetition). It was mostly from newspaper articles and catered for every level of English. It was a website (ca 2013), not an app. Any ideas?

Johnny_Bonk•35m ago

I like this but it should be all operable with keyboard to be faster ie up down and 1234 for options and if its righht you just move on, maybe show synonyms in the success ui.

bluecalm•33m ago

67900

English is not my native language. I get my vocabulary from browsing the Internet. There is no way I know that many words.

kortex•30m ago

Super fun, got 70,250. Friends have always lightly ribbed me for having to go home and look up words i've used. Those remaining 100k words must be really obscure.

One suggestion would be more convincing decoy choices, some were pretty silly. But I have no idea how they come up with them.

ak_111•17m ago

Open any technical textbook in an area slightly outside your domain and you will quickly disabuse yourself of the notion that majority of words are obscure. Most complex words are just technical/jargon not archaic or forgotten.

analog8374•30m ago

this is a test for willingness to put up with the whole 100. It says something.

3 clicks per is what gives it away. and the little compliments. and that it's 100 questions

dakolli•30m ago

Cool concept. but...

Vibe coders need to be forced to spend one day learning basic CSS before they're allowed to use an LLM to make a website and the internet would be a lot more pleasant as we move forward with slopification.. It doesn't have to be sloppy, and doesn't take all that much studying to at least be able to steer an llm in the right direction to make something look nice. At this point everything is just the same 3 colors and a centered flex column with weird spacing.

sim04ful•30m ago

I notice that the concept related to the right answer sometimes has an opposite counterpart.

HaloZero•30m ago

I wish it had keyboard shortcuts, it's a bit of a sludge to click through twice.

Got 64,650: 20/19/17/18/12 (the intermediate one was a dumb mistake)

ItsBob•30m ago

Apparently I know 70,000 words... I got 90 out of 100 and it thinks I'm Stephen Fry!

popey•29m ago

That was a nice diversion. I got 76,750.

2bird3•29m ago

All the 3 incorrect answers are just indirect opposites of the correct one.Quite easy to determine which is correct, even without knowing the word

fcatalan•28m ago

71050, not bad for a non native speaker I guess. I missed 9/100.

But to be honest many that might catch out a native speaker are just the Spanish/French/Latin word, so it was too easy in a way.

femto•27m ago

I got 97/100 (80.5k) by picking the answer that has no relation to the word. Most of the incorrect answers bore some relation to the word, whether that be phonetic or a similarity to a root word.

mpeg•16m ago

Yeah I got 75k~ and did something similar ... most of the expert and grandmaster ones had at least 1 or 2 obvious incorrect answers, then it was a 50/50 so I usually went for the thing that sounded either closer to the root of the word or completely left-field

Anything up to expert was obvious

WithinReason•15m ago

Also, just pick the longest answer :)

blatherard•26m ago

It might be nice if you could unlock a "hard mode" or ability to the first 1-3 levels after a first run. I scored a little over 81K and considered playing again because I like quizzes, but doing another batch of (to me) easy words seemed like a waste.

asdfasgasdgasdg•26m ago

Not a very good test. Too easy to guess many of the words, and the words seem to follow a theme. For example my list had five or six that had to do with speaking too much or too little (verbose, lugubrious, and a few others in that vein). And many easy words were placed late in the test (e.g. zeitgeist, facetious being in the expert and grand master categories?).

And it didn't even tell me at the end how many words I know!

There is a similar variant of such a test where you just go down a list of words of increasing obscurity, ticking the ones you are familiar with. If you do this once or twice, you can get a fairly good estimate of the actual number of words you know.

zaik•25m ago

That sounds like a good application of Item Response Theory (IRT).

jdiff•25m ago

78,250 is way more than I expected. I sure don't feel like I know 78,000 words.

NickHoff•25m ago

I enjoyed some of the incorrect options. For "Debilitate" one of the options was "Remove a bill from the tab".

JauntyHatAngle•24m ago

That was fun. Bit confused by the result because it says I was "wow are you stephen fry?" Which I assume meant I did decent. (72K).

But then below it said "you are a man of few words".

I take it the latter is just because I've only done the test once? But it's mixed messaging on first attempt I think.

Joe_Cool•4m ago

Combined with the factoid it features under "how is this calculated":

    However, most native speakers have an active vocabulary between 15,000 and 35,000 words.

We must be geniuses, lol.

amarant•22m ago

Fun game! I did worse than many others here, only 69.9k estimated words. But then English is my second language, so I'm pretty pleased with the result!

moron4hire•22m ago

Lethargic had an option "having the quality of lethargy".

tennfown•21m ago

Gaikwar - which I was able to guess was a former Indian state seems irrelevant as an “English” word especially given it seems to derive from a name that I have to assume is native to the region.

alistaira•20m ago

For those interested in the nature of the later, harder words but not willing to work through the earlier sets, here are the ones from my run:

Level 0: Core Basics Abundant, Baffle, Candid, Dwell, Emerge, Frugal, Generic, Hinder, Impartial, Jovial, Knack, Lucid, Meager, Naive, Obsolete, Peculiar, Quench, Refute, Seldom, Tedious, Unique, Valid, Wary, Yearn, Zeal, Adequate, Barren, Coarse, Diligent, Esteem, Fickle, Gloom, Hoax, Ignite, Jolt, Keen, Linger, Mend, Numb, Omit, Pledge, Quota, Rural, Soothe, Toxic, Urge, Vow, Witty, Yield.

Level 1: Intermediate Acumen, Benevolent, Complacent, Dilapidated, Eloquent, Fabricate, Gregarious, Hypothetical, Imminent, Juxtapose, Lethargic, Meticulous, Nostalgia, Oblivious, Pragmatic, Reiterate, Scrutinize, Tentative, Ubiquitous, Verbose, Wane, Aesthetic, Bolster, Candor, Defer, Elicit, Furtive, Glut, Heed, Impeccable, Lament, Modicum, Notorious, Opulent, Plausible, Resilient, Stagnant, Trivial, Viable, Zenith.

Level 2: Advanced Alleviate, Breviary, Cacophony, Deferential, Ephemeral, Fastidious, Garrulous, Harangue, Iconoclast, Juggernaut, Laconic, Magnanimous, Nefarious, Obsequious, Paradigm, Recalcitrant, Sanguine, Taciturn, Ubiquity, Vacillate, Winsome, Zephyr, Abase, Banal, Capricious, Debilitate, Ebullient, Facetious, Gaikwar, Hackneyed, Idiosyncrasy, Jargon, Kindle, Labyrinth, Maverick, Narcissism, Ostracize, Palliate, Quagmire, Rancorous, Sagacity, Tantamount.

Level 3: Expert Abstemious, Bellicose, Chicanery, Deleterious, Enervate, Fatuous, Gauche, Hegemony, Inculcate, Jejune, Kowtow, Lugubrious, Mawkish, Nonsectarian, Obdurate, Pernicious, Quotidian, Recapitulate, Supercilious, Tempestuous, Unctuous, Vehement, Winnow, Xenophobe, Ziggurat, Acquiesce, Bombastic, Circumlocution, Desultory, Equinox, Fiduciary, Gerrymandering, Hubris, Incognito, Kinetic, Loquacious, Metamorphosis, Nihilism, Orthography, Precipitous, Quasar, Reparation, Soliloquy.

Level 4: Grandmaster (The Obscure) Accoutrement, Brobdingnagian, Crepuscular, Defenestrate, Equanimity, Flibbertigibbet, Grandiloquent, Hippopotomonstrosesquippedaliophobia, Ineffable, Jingoism, Kerfuffle, Logorrhea, Mellifluous, Obfuscate, Panacea, Quixotic, Rococo, Sesquipedalian, Tergiversate, Ultracrepidarian, Vicissitude, Weltschmerz, Xeric, Yclept, Zeitgeist, Absquatulate, Bumbershoot, Callipygian, Dord, Ergophobia, Fartlek, Gobbledygook, Houghmagandy, Interrobang, Kakistocracy, Lollygag, Mumpsimus, Nudiustertian, Omphaloskepsis, Pogonotrophy, Quire, Ratoon, Snollygoster, Tittynope, Ucalegon, Vagitus, Widdershins, Xylopolist, Yarborough, Zenzizenzizenzic.

grey-area•20m ago

Got a bit boring then suddenly very hard with some really esoteric words at the end in the ‘grandmaster’ level. It’d be nice if it got progressively harder without levels.

Some definitions were not great and alternatives a little silly at times but on the whole seemed pretty accurate.

Also probably needs calibrated as 96/100 was projected to 77k words, what would the estimate be for 100/100?

yousif_123123•20m ago

This was fun! And it told me I know 55k words which made me a little happy.

I'm not sure exactly how you did this, but I think you asked an LLM to come up with the wrong options. Two things to consider:

1. While the LLM can go r good options, they won't be always hard to guess. I wonder if instead you can have the LLM generate very close words (or skip using an LLM entirely) and put those as the options. 2. If you will generate options with an LLM, make sure you are mindful of its inability to shuffle things around. The correct answer was overwhelmingly the first or second option in the list. You should ask the model to give the options in a uniform order (say from true meaning then decreasing amount of replayability), then manually shuffle them so that the probability of which option (A, B, C or D) is always 25%.

ronbenton•19m ago

Some felt too easily guessable. Too many joke answers maybe?

ekjhgkejhgk•18m ago

I was doing well until I got to grandmaster.

Then I was doing poorly in grandmaster, until I realize you can ace grandmaster by just picking the longest explanation every time.

pgraf•18m ago

Really interesting, but I would love to be able to express honestly when I just guessed. This way the result would be much more scientifically sound. Four answers have a 25% chance of random correctness, which is a bit high in my opinion. I think either adding a "I don't know" or a confidence level (Known/educated guess/wild guess) would help.

naishoya•18m ago

"77,250words "Unbelievable. Are you actually Stephen Fry in disguise?"

kubb•13m ago

Same here (72 750) but it doesn't feel right. I'm not a native speaker and I was able to guess some of them via elimination or cognates.

I'd say I know 10 000 words tops.

collabs•18m ago

I got 70,750 which is much higher than I expected. The early words were obvious. However, a lot of the later questions I could only answer because they were multiple choice. If I had to actually come up with a definition, I suspect my score would be much lower.

ortusdux•16m ago

YouTuber John Green has been working on finding a way to quantify the number of names the average person knows. IIRC, his estimate is around 5k, which seems both crazy and reasonable at the same time.

WithinReason•16m ago

81,250 97/100 without being a native speaker. Although truth be told only because I figured out how to guess well.

itvision•16m ago

Scientific Estimate: 36,250. Nah, I'm far worse.

Probably not too bad for a person whose native language is not English.

fp64•15m ago

When there are two options that describe exactly the opposite of each other, it will be one of them. Reduced a bit the fun - but then again, for some words I understood what they are dealing with, but not whether positively or negatively.

franciscop•14m ago

Only got 63,150 words. Considering English is the 3rd language I learned, I think I did pretty well.

EstanislaoStan•14m ago

Literally when I got to advanced and beyond just picking the longer and more complicated looking answer was the right one. I think this test is extremely flawed.

nickcw•13m ago

I have a copy of the shorter Oxford English Dictionary from 1970 which I inherited. It is two massive volumes and is only shorter in comparison to the full dictionary which is 12 volumes (more in more modern editions).

My shorter OED contains 163,000 words (compared to the 600,000 words of the longer).

According to this site I know 71,000 words... Let's test that against the OED. I should have about 43% chance if knowing a word picked at random.

In my totally scientific test (ha) I chose 50 words at random from the OED and discovered I knew 29 of them for a score of 58% which is more than two sigma from 43%, this disproving the hypothesis.

I forgot what that was now, but it was a fun experiment.

cake-rusk•11m ago

Apparently I am Stephen Fry in disguise :D

My score: 78,000 words, 20/20/19/18/18.

waterpowder•11m ago

69,250 (91/100) - I think being French helped a lot for the most complex words, as they're basically the same!

walthamstow•7m ago

76250, or 93/100. Native English speaker from London. Some of the last 10 words were seriously obscure.

Are accoutrement and ziggurat really English words? Accoutrement is even pronounced as French!

jcattle•7m ago

there's also https://www.myvocab.info/en

From what I can tell they actually have a bit more robust science behind their algorithm (and a lot less questions to answer)

spelufo•6m ago

Nice. I want one in Spanish so I can compare results.

srean•6m ago

In addition to how much fun it was, it has potential pedagogic value.

It would have paired well with an exposition of vanilla Monte Carlo and the benefits of stratified sampling.

Although stratified sampling is good, one can do better in this case by using adaptive sampling, where one uses a runtime (Bayesian) estimate of vocabulary to maximize information gain per question.

Joe_Cool•5m ago

Getting "Obfuscate" as #99 and "Quixotic" as #100 made me feel exorbitantly smart.

alentred•2m ago

Good fun! At first I was scared of having to answer 100 questions, but when the words got more sophisticated it turned to be more engaging. Also, the result is good for self-esteem! :) Many thanks to the author!

I wonder if the test is calibrated to the fact that some answers are just well-guessed? I am a non-native English speaker, but I speak 3 languages overall and have basic notions in Latin, and I have to admit it helped a lot in "deciphering" a few words that I didn't really knew. And in at least 2 cases I just guessed well.

Findecanor•1m ago

I got an estimate of 70,550, from a score of 87/100 (20/18/16/17/16).

I suppose the words must be weighed, because other people in the thread with more correct words got a not much higher estimate.

European Delusions and Danish Drones

My agent stack for automating my personal life

A Stressful Game

I feel like giving up on coding

Zeroshot, an open-source CLI for coding-agent verification loops

NY mayor, other leaders push to ban horse-drawn carriage rides after teen death

Why Do So Many Everyday Systems Feel Harder to Use Now?

Show HN: Claude Code for talking to potential customers

The AI slop refactor wave is coming

Bcachefs exits experimental status in new 'performance release'

DirectX Dump Files Preview Now Available

Blaise v0.11.0 Is Here

Measuring AI productivity yourself with gh, jq, and Git

The Accidental Framework

JEP: Embed Python in Java, the Polished Way

The Bottleneck in AI Was Never Intelligence

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design

APTA Feature on Cellular in Transit Systems

Why doesn't Get­Last­Input­Info() return info for the user I'm impersonating?

Ghost Boxes: Reusing Abandoned Big-Box Superstores (2016)

Factorio: More Planet Deliveries

Cutlass Tutorial: Efficient GEMM Kernel Designs with Pipelining

Extend Claude limits by offloading AI tasks to Neo

P-Tau217 as a Reliable Blood-Based Marker of Alzheimer's Disease

I restarted a 10 year old Xeon 174 times to delete 12 flags and gain 4 tps

Italy's Meloni says Trump 'made up' story that she 'begged' him for photo at G7

An agile retrospective skill for your Claude collaboration sessions

Ask HN: How do you measure whether your coding agent follows its rules?

Toward Better Hip Kernel Generation for AMD GPUs

Moving to Portugal: Complete Guide for Americans