We've two letters that are visually similar to yogh 'ȝ': 'ვ' (an English 'v', or 'w' if you're Tbliseli), and 'პ' (a hard 'p'). Our 'gh' sound is 'ღ'.
I have no doubt that two very disparate languages and scripts will find a few similarities simply due to proximity. Georgia and England (UK) are close enough for a fair amount of cultural exchange.
People have an extremely distorted perspective on European history for many reasons, but the late industrial age nation state probably had the biggest impact on that mental model people still have today in many ways. By all evidence I’ve seen, the cultural exchange in the distant past was far more organic than most people can easily imagine today for many reasons. Trade and cultural communion, religious exchanges and defensive unions all made that possible in a world that was not at all as controlled and authoritarian as we even experience today. It all waxed and waned over the centuries and regions of course, in a rather organic manner; but due to practical limitations a lot of the authoritarian restrictions we are all subject to today simply did not exist.
In some ways the USA until about 1960, is probably the most similar analogue of how Europe seems to have generally been for the longest time leading up to the Industrial Revolution. It was a land of general regions of self-regulating, cultural clustering with local levels of varying jurisdictions and power structures which to a large extent kept most people in their home region, if not their place of birth. By the latter part of that period the identity with one’s state and region and local culture had already largely succumbed to the oppressive force of the centralized dominating power of the federal and global power, but your region was still largely your cultural identity as a person and community.
That of course has all been totally razed and destroyed now and the USA effectively exists in name only today, which has been the case for an even longer time, but that’s a different topic altogether.
And AFAIK Georgian alphabet predates Insular script by a few centuries.
It's not uncommon for otherwise unrelated alphabets to come with similar symbols simply because the trend over time is towards simplification of letter shapes, and there are only so many basic elements. So even originally quite different characters can end up looking very similar when reduced to a few squiggles.
In the last week, our most famous Menzies passed away - politician Sir "Ming" Campbell.
Some wag made a 'ming vase' of his face: https://www.portrait.gov.au/portraits/2004.176/ming-vase-sir...
different hill, but one I would die on is: as the letter "c" should make the "ch" sound, the letter "c" serves no purpose not already handled by "s" or "k" otherwise
that last one is hardly fair - gist and mirage are french words. might as well complain about the silent letters in rendezvous or faux pas.
So surgery is full of -ectomies instead of -cut-outs.
I wonder do German brains work on a much longer context window because of the language?
b) the official title of the law was "Gesetz zur Übertragung der Aufgaben für die Überwachung der Rinderkennzeichnung und Rindfleischetikettierung", so how again is it that English "gets to use a sentence" and German doesn't? German has the choice depending on context, sometimes having one word is convenient.
The composed word also has a specific meaning that the same words with space between doesn't. For example "das rote Kraut" – "red herb" and "das Rotkraut" – "red cabbage". Also suppose "red cabbage" was grown in abnormal conditions, so it doesn't have the color pigments, it is still "red cabbage", but not "red" "cabbage". This is awkward to state in English, but no problem in German.
Maybe, but more due to the spelling of numbers and long sentences. Compound words are not an example of this, since Germans can parse these words just fine as different things. It just means that the lowest "tokenization" in everyday use is not the word, but subcomponents of them.
Do English native speakers "tokenize" expressions in words? Do you see it as '(labelling) (of) (minced)' or '(label)l(ing) (of) (minc)(ed)' ?
I can't speak for most Germans, but the algorithm I think I use is just greedy from left to right. This is also consistent with how mistokenization in common puns works, so I think this is common.
In primary school we trained to recognize syllable boundaries. Is that just a German thing, or is this common in other countries? You need to know these for spelling and once you know these, separating word components becomes trivial.
Not entirely true. English, as any other Germanic language, still likes to compound words to produce a new meaning, the main difference is that, as opposed to most other Germanic languages, spaces are usually retained in writing. But this is just a spelling difference, the underlying process is the same.
Let's consider "scheepskapitein", "Schiffskapitän" and "ship captain". All three are formed the exact same way and mean roughly the same thing, but it's customary in Dutch and German to spell it without a space and in English it's considered correct to have a space in between. Note, that there are no spaces in speech, it's simply a writing convention. So, how many words are there in this example?
Sure, linguists can dissect everything and should, but how does the English laymen perceive it?
(To put it another way, most native speakers treat "ts" as two sounds but not "ch")
Luckily there are other wasted characters, like "x" and "q".
Maybe as a fun pet project someday!
For example, in Year 1 that useless letter "c" would be dropped to be replased either by "k" or "s", and likewise "x" would no longer be part of the alphabet.
The only kase in which "c" would be retained would be the "ch" formation, which will be dealt with later.
Year 2 might reform "w" spelling, so that "which" and "one" would take the same konsonant, wile Year 3 might well abolish "y" replasing it with "i" and iear 4 might fiks the "g/j" anomali wonse and for all.
Jenerally, then, the improvement would kontinue iear bai iear with iear 5 doing awai with useless double konsonants, and iears 6-12 or so modifaiing vowlz and the rimeining voist and unvoist konsonants.
Bai iear 15 or sou, it wud fainali bi posibl tu meik ius ov thi ridandant letez "c", "y" and "x" -- bai now jast a memori in the maindz ov ould doderez -- tu riplais "ch", "sh", and "th" rispektivli.
Fainali, xen, aafte sam 20 iers ov orxogrefkl riform, wi wud hev a lojikl, kohirnt speling in ius xrewawt xe Ingliy-spiking werld.But we'd still be arguing about how to pronounce "ᵹif"
Catalans seem to pronounce "caixa" fine, so I think they _could_ say "Shabi"... But this does back up your larger point about "x" -> "sh" in Catalan.
j -> dzh is more weird than anything.
Vowels, of course, are a cause of war between dialects; nobody can even agree how many there are.
Esperanto has a nice trick where they reserve "x" as a modifier letter, so if you can't use diacritics you write "cx", "sx", "jx" etc; but it does not have a sound value of its own and can never occur by itself. We could extend this to "tx" and "dx" with obvious values, and also to vowels - "a" for /æ/ vs "ax" for /ɑ/, "i" for "ɪ" vs "ix" for /i/ etc. Using "j" the way it is today feels somewhat wasteful given how rare it is. In the x-system it would probably be best represented by "gx", and then we could have a saner use for "j" like all other Germanic languages do. Which would free up "y" so we could use it for the schwa.
One thing that occurred to me the other day is that "x" is also a diacritic, so we could just say that e.g. "sx" and "s͓" are the same thing. Then again from a purely utilitarian perspective a regular dot serves just fine and looks neater (and would be a nice homage to Old English even if ċ and ġ are really just a modern convention).
Vowels, yeah... I think it's pretty much impossible to do a true phonemic orthography for English vowels that is not dialect-specific. As in, either some dialects will have homographs that are not homonyms, or else other dialects will not have the ability to "write it as you speak it" because they'll need to use different letters for the same (to them) sound. In the latter case, it would become more of a morphological orthography. Which would still be a massive improvement if it's at least consistent.
OTOH if you look at General American specifically, and treat [ə] and [ʌ] as stress-dependent allophones, then you can get away with 9 vowel characters in total (ɪiʊuɛəoæɑ). That's pretty easy with diacritics.
What’s the ch sound? My intuition from German class is that ch represents a throaty hhhh. Somehow that got spoiled into k in most English words.
Every c in Pacific Ocean is pronounced differently. C is a silly letter.
It varies between dialects. Swiss German speakers tend to stick out to Germans because we pronounce the ch in a much scratchier way than is accepted in Standard German.
If you mean the standard German from Germany, there a two variants. At the end of a syllable it is like you described (kind of throaty hhhh). For the beginning of syllables think of sh and open your mouth.
-- Caeser, seizer of the day
It's funny you use "tube" as an example though, as in my British accent I pronounce that as "chube", whereas I believe many Americans would use a "t" sound for that word. Not sure how you settle on a spelling in those cases.
That aside, what you describe is a distinction between yod-dropping and lack thereof, and whether and where it happens is highly dialect dependent.
cent / sent / scent
ceiling / sealing
cite / sight / site
colonel / kernel
carrot / karat
cue / queueHow is it that you can say these words without confusion?
Language is context sensitive and you understand the word based on the context around it. Likewise, you understand homographs based on the context. Because of this, spelling isn't as important as it might appear.
practice / practise licence / license
Most don't bother because context is nearly always sufficient.
Exceptions to this are generally loan words, particularly from French (eg chaise, which sounds more like "sh"). Others are harder to explain. "Lichen" springs to mind. Yes it technically comes from Latin but we're beyond the time range to truly consider it a loan word.
There are also some "ch" words of Greek origin (IIRC) that could simply be replaced with "c" or "k" (eg chemistry, school).
"Kh" on the other hand I think is entirely loan words, particularly from Arabic. Even then we have names like "Achmed" that would more consistently be written as "Akhmen". "Khan" is obviously a loan word but I think time has largely reduced the pronunciation to "karn" rather than "kharn" if it ever was that.
But I can't think of a single "kh" word that pronounced like "ch" in "chair".
"Sh" doesn't seem to crossover with any of these pronunciations.
What happened in short was that the greeks copied the ancient and now virtually defunct phoenican script, varieties of which were used across the region and kept the names even though they made no sense in the context of Greek, added vowels and wrote it from left to right.
The russians adapted the script in one way, Latin in another, Hebrew and arabic took entirely separate paths and now the only thing they share in common is alphabets that follows vaguely the same ordering.
It goes to show just how powerful the idea of writing is - once you have a society where it's pervasive, all its neighbors acquire it from them in short order, and they usually do so by adapting the original writing script to their needs. I strongly suspect that the reason why alphabets (and to a lesser extent syllabaries) spread especially widely is because they are easier to adapt to a different language - usually, once you've learned a new alphabet, it's more or less readily obvious how to use it to approximate any language that you already know.
(Which is also how you get imperfect spellings even in brand new orthographies. Practicality usually beats purity.)
In some way everything has it's roots there: language, numbers, math, philosophy, politics, religion. And earlier humans itself moved in the same direction. It's just were the large population and the high culture used to be. The last remnants were purged in the middle ages and now by islamic fundamentalists.
We pretend phonics exists, but it's just a lie we tell little kids to kickstart their learning. In reality, English spelling is more like learning Kanji. The original meanings of the words is warped beyond belief and we tell the specific pronunciation of specific letter sequences based on the surrounding letter sequences (much like telling which Kanji reading to use based on the surrounding Kanji). Words aren't so much sounded out as memorized and because English has such a massive vocabulary, the memorization work needed to be proficient is very extensive.
The classic example of this is "ough" which has NINE different pronunciations for the same letters and no real rules to indicate which one should be used. Spelling reform would make such situations completely unnecessary.
Languages with more phonetic alphabets tend to have much higher literacy rates for the same education quality and literacy can be achieved much faster. This works because once you memorize the sounds the letters make, you can sound out any word or write any word (provided you pronounce it correctly). The memorization process slowly kicks in where common words are still sight-read, but that process can happen much sooner and the individual can start independent reading much earlier with a focus on comprehension rather than memorizing weird rules and exceptions.
English departments have done massive damage in this regard. English started finalizing how words would be spelled around the same time the great vowel shift happened and completely screwed up everything. We then mass-adopted words with foreign spellings that used completely different phonetic systems. Despite the issues, English departments insist that these bugs are actually features despite the great harm they cause students and not only codify them, but denigrate all attempts to fix the problems.
English departments aren't the only ones. Even 150 years ago when Webster was trying small spelling reforms (some stuck around and some did not), people complained that the writing was childish. When Teddy Roosevelt tried a further spelling reform of getting rid of unneeded letters, he was turned into a laughing stock for the same reason (again with a handful sticking around). Modern "text speak" is yet another unofficial attempt to simplify spelling so it is more consistent, but once again, better, shorter alternatives are derided as making someone look unintelligent.
This still doesn't deal with the more fundamental phoneme/alphabet mismatch though. English has 44 common phonemes and a bunch of less common and regional sounds (for example, the χ sound in "cloCK"). Our adopted Latin alphabet has 26 letters of which at least 3 are unneeded (C as K or S, Q as KW, and X as KS). This leads to a horrible situation where a lot of sounds no longer have letters (Futhorc didn't get all the sounds, but still did better with 33 letters of which something like 11 were vowels). Some English sounds like the S in "treaSure" seem to have no real, unique spelling at all. Others like th and th have no indicator if it is supposed to be voiced like "THen" or unvoiced like "THink" (we used to have thorn and eth for this). We have 18 unique consonants and 24 common consonant sounds.
The vowel situation is even more dire. We have just 5 vowels and around 20 common vowels leaving each vowel desperately overloaded with all kinds of weird phonics "rules" and almost all of them having either multiple rules or different pronunciations for the same word (eg, "reed" vs "red" in "I read the book"). There needs to be massive vowel reform (either a ton of stable digraphs, diacritics, or more letters) so that sounds can be differentiated properly.
Spelling reform could all but eliminate our illiteracy problems and open a whole new world of possibilities to more than half of all Americans. In a world dominated by ever-increasing volumes of information, these people would have much better lives if we lowered the bar of learning to read to something more attainable.
There are certainly differences, but if you place current English spelling next to something like Shavian (or some other language with near-pure phonetic spelling), I'd say that Modern English learning patterns are closer to Kanji than the pure phonetic alphabets.
One thing that worries me is the widespread adoption of english words and nouns in many languages. The list is ever increasing, even though the word makes absolutely no sense out of the context of English, cannot be adapted by a mon english speaker to have anything more then a single, rigid meaning. It's annoying enough for me when some books use French words. I don't know how everyone else copes.
As for literacy, i find it hard to believe the true statistics are as dire as you say but i'm prepared to accept that it is. Firstly, what are the statistics for contemporary societies with more sensible spellings? And can better education help? A final point, you clearly know far more about this topic then i do, but would adding half a dozen letters to the alphabet really help with increasing literacy?
https://www.thenationalliteracyinstitute.com/2024-2025-liter...
> It's annoying enough for me when some books use French words.
From around 1060 to 1360, French was the official language of England. It wasn't normal French though as William the Conqueror spoke Norman French. Both French dialects mixed in what can only be considered English style. For example Norman French said Warder while other French speakers said Guarder. English adopted both Warden and Guard, but gave them two different meanings. Overall, some 30% of our words are French though over 800 of the most common 1000 words are English in origin.
> would adding half a dozen letters to the alphabet really help with increasing literacy?
ITA (International Teaching Alphabet) shows the benefits and problems.
ITA students rocketed ahead the first couple of years and could read way more words than their traditional counterparts. The problem was the transition. Learning both systems seems to have evened things up or maybe even caused a net negative for ITA students. I believe this was because they had to learn two sets of spelling for everything. If you would like to see the difficulty in learning a new way to read/write and have a bit of fun, try learning Shavian script.
In an ideal world, they would have phonetic spelling only. I believe under those conditions that their advantage would continue to grow all the way through school. The problem is that this study is unethical to conduct because even if it is correct, the students would graduate and be unable to read traditional English which would permanently harm them.
This leaves the tricky problem of bridging the gap. This can't be done too quickly or the older generations get left behind. There's also an issue of transcribing everything into the new spelling. Technology has made that easier than ever, but it would still be a very hard proposition.
The first and easiest step is cleaning up the spellings using the letters we currently have. Stuff like all those -ough endings get rewritten in sane letters as an accepted alternative spelling. Silent letters start going away. We start moving toward consistent vowel and consonant digraphs. This will take time for older people to adapt to, but more consistent rules will mean they will have an easier time sounding them out.
After this, we start adding back letters. Maybe eth and thorn come back for the two "th" sounds. We certainly need a new letter for the S in "treaSure" and maybe bring back the elongated S to use for SH. At some point, we then start working on slowly adding new characters to stand in for the vowel digraphs.
I don't think you could convince adults to do more than a couple of steps at a time each generation. Such a plan would likely take decades to maybe even a century or two. Until the creation of the printing press, such slow changes were considered normal. Only in recent times have we attempted to gate-keep what "real English" is. If we allow the language to grow more organically, I think it could be guided into something far better than we have today.
They all wrote the fancy 'g' rather than the simplified 'g' we use now. I assume they copied text from textbooks rather than (say) from teachers from England.
As a real young'un I used to attempt to do the same as an exercise but it's not easy to make it look good.
The other sound that ‘ȝ’ once spelled is the “harsh” or “guttural” sound made in the back of the mouth, which you hear in Scots loch or German Bach.4 This sound is actually the reason for the most famous bit of English spelling chaos: the sometimes-silent, sometimes-not sequence ‘gh’ that you see in laugh, cough, night, and daughter. Maybe one day I’ll tell you that story too.
[0] https://en.wikipedia.org/wiki/Consonant_cluster
[1] https://nl.wikipedia.org/wiki/Medeklinkerstapeling (Dutch wiki page for consonant clusters with more examples)
It's funny I just started reading and understanding the first paragraph, before recognizing that this is a foreign language, I don't know at all.
Quick reminder that writing != language. Even the highest fidelity writing systems are lossy encoding systems. In fact, the more phonologically accurate a writing system is to its language, the more it obscures the history of its words, especially words borrowed from other languages.
So from the perspective of someone interested in etymology, English writing's tendency to preserve old and foreign spellings is a good thing.
Of course it also depends on how conservative the language is, like Finnish orthography is practically IPA, and yet Finnish is a freaking time capsule for words like borrowed Proto-Germanic *kuningaz and *wīsaz, which became king and wise in English, but kuningas and viisas in modern Finnish. So you can have both phonemic writing as well as etymological transparency if your phonology doesn't change much.
And OTOH even modern English spelling often doesn't distinguish differences that are there in most dialects (e.g. "bear" vs "near"), so this isn't even a new problem. Realistically I suspect there's some "minimal reasonable set" of phonemes that need to be distinguished to reflect the most prominently distinct pairs in all major dialects, even if some subtle dialectal distinctions might not be reflected in spelling.
eg. Egyptian Heiroglyphs, or Asian characters (esp. Korean which has a relatively young alphabet - which IIRC is phoneme based, or Chinese which has a very old set, which is used across multiple languages (eg. Mandarin/Cantonese/etc)
Korean Hangul is not ideographic (I think what you meant by pictorial?). It's a morphophonemic alphabet that just happens to organize the basic phonemic units into larger graphemes representing whole syllables - but in a completely predictable way. And it is another example of this playing out: the original Hangul was entirely phonemic, but over time pronunciation diverged from spelling, and today it's morpho-phonemic, and even then not perfectly so. So they preserved the history at the cost of some mismatch between the spelling and the sound.
Chinese' pictorial writing completely obfuscates the historical state of the spoken language, to the extent that in order to reconstruct older phases of the spoken Chinese language, scholars have had to inspect old Chinese loan-words in foreign languages that do preserve the old phonetic structure.
An example of this is the discovery that Chinese tones developed from earlier final-consonants, which were lost in Mandarin, but are preserved in Cantonese, Japanese and Korean. i.e: Mandarin "guó" compared with the early borrowing into Japanese : "koku", both meaning "country".
This increases the complexity of learning to write the language -- 56 letters in alphabet and each combination of consonant+vowel and consonant+consonant takes on a different letter form instead of just being a string of independent letters like English.
But reading / pronunciation is straightforward. (No we don't have spelling bees :) )
Tibetan, Mon-Burmese and Thai scripts, as an example, all derive from the Brahmi script (through a long and sometimes windy ancestry), but neither reflects the modern pronunciation, hence mind numbing transcription systems.
Tibetan and Burmese languages are particularly notorious for codifying the archaic pronunciation of respective languages that has been frozen in time for centuries. It is a treasure trove for linguists that have got a time machine for free, but I don't think that the same can't be said modern speakers of both languages.
Indian languages are generally rich in phonemes though. My mind boggles at the notion of [n] [ɳ] [ɲ] [ŋ] all being distinct. I mean, I can reproduce each one of them on its own, but doing that in rapid speech, and worse yet, recognizing the same in others' speech...
They are phonetically distinct, but not phonemically distinct, which is to say that in most places they occur, they aren't used to distinguish words or meanings.
In particular, the velar nasal "ङ" or "ng" always appears adjacent to a velar sounds (k/kh/g/gh) and similarly the palatal nasal "ञ" always appears adjacent to palatal sounds (c/ch/j/jh), both as allophones of the nasal phonemes "m" (bilabial) and "n" (alveo-dental), basically just like we speak in English under the exact same conditions (like the nasal in the word "English"!)
You perceive a difference with Indic language and English because the Latin script doesn't distinguish nasals for velar and palatal points of articulation - it only distinguishes by bilabial (m) and alveolar (n), whereas Indic scripts do distinguish those, even though they offer no additional information.
The unique nasal sound which is often contrastive in many Indian languages is the retroflex nasal "ण" (ṇ). That's the one that it's easy to confuse in speech if you are not a native speaker, so it's the only one you need to pay extra attention to when learning.
But, as far as I know, the different nasals are phonemic in some languages of India. Which ones depends on which languages, but I do remember seeing at least one in which all four of these were distinct.
None of the major Indian languages I'm familiar with have 4 nasal phones, from either the Indo-Aryan or Dravidian language families.
In the Indo-Aryan languages, the convergence of the various nasals is so complete that they are all often represented with a single "dot" diacritic character when they occur at word junctions.
I'd be open to hearing examples of Indian languages that have 4 nasal phonemes, though.
m (ಮ) n (ನ) ɳ (ಣ) ɲ (ಞ) ŋ (ಙ)
There are 5 nasal glyphs, but like in other Indian languages, 2 (velar and palatal) are allophones of the others, leaving only 3 actual phonemes. Indian scripts are often overspecified, and not every glyph represents a phoneme.
I don't think that's true. From the northern Indian languages schwa deletion (https://en.wikipedia.org/wiki/Schwa_deletion_in_Indo-Aryan_l...) to the extreme divergence between the standard formal and spoken forms of languages in Southern languages (https://en.wikipedia.org/wiki/Diglossia), it's a stretch to say the scripts mirror what is spoken.
It's just that if you are a native speaker/reader, you are so fluent that you unconsciously auto-correct those inconsistencies - just like in English.
Even in the formal registers of each spoken Indian language, which should be in theory be more systematically consistent with their scripts, there are inconsistencies in spelling/pronunciation of loan-words from both foreign and other Indian languages (i.e. aspirates in South India and retroflex approximates in northern India, and any number of inconsistent renderings of English words in Indic script).
On the topic of screwball spelling is this video essay on silent letters. The fun takeaway for me was that a lot of silent letters were never pronounced. it is just that when some of the first dictionaries were being produced, and the spellings decided on, they decided to introduce silent letters to indicate the origin of the word. the b in debt is because it comes from the latin debitum. but it was not spelled that way until the 1500's prior to that it was dette.
https://www.youtube.com/watch?v=NXVqZpHY5R8 (RobWords: Why English is full of silent letters)
Is that true? Seems like it's in every other word when I visit Spain...
https://translate.google.com/?sl=da&tl=en&text=rødgrød%20med...
(click the speak icon)
c+a/o/u sounds like k (casa, cosa, cuchara) and z+e/i does not exist.
- Words that are only written with Z: zepelín, zigurat, zigzag.
- Words that can be written with either Z or C: zénit, zinc, zirconio, azimut.
My main point is that if you hear the sounds "th" (za) with e or i 99% it is cena or circo and will not be zena or zirco. It is an heuristic but very reliable.
P.S. Or pececín, as a random example :P
with
lang_sounds as (
select
lang,
unnest(string_to_array(ipa, null) ) as sound
from world_dictionary
),
totals as (
select
lang,
count(sound) as sound_count
from lang_sounds
group by lang
)
select
lang,
totals.sound,
count(sound) / totals.sound_count
from
lang_sounds join
totals on
lang_sounds.lang = totals.lang
where sound = 'θ' or sound = 'ð' or sound = 'θ̠' or sound = 'z'
group by lang, sound
order by count(sound) / totals.sound_countThere's no single th sound in English. There's a few different sounds you get from that letter combination in different words (and in different dialects).
English speaking Swedes often transform this silent b into a spoken p which is about as awkward a result as you're imagining.
Not as rare as the lateral fricative, the "ll" in Welsh and "hl" in Southern African languages. e.g. in the place names Llandudno in Wales and Hluhluwe in KwaZulu.
And then there's the clicks in Southern African languages. Which are usually written as a Q not followed by a u. e.g. in the place name Gqeberha or the Mbaqanga music style.
<q> is only one of the clicks in Bantu languages! The letters <c> and <x> are used for other kinds of clicks too (for dental and lateral clicks, respectively).
And then of course there are the Khoisan languages, which use a completely different set of click letters: <ʘ ǀ ǁ ǃ ǂ>.
It's literally in the name.
[1]: https://en.wikipedia.org/wiki/Voiceless_dental_and_alveolar_...
"I speak a number of languages, but I'm not a linguist. How could I get a list of sounds that aren't included in the languages I speak? Would be interesting to hear what sounds I'm not used to hearing"
<Long blurb about what a great idea I had, and an intro to how to do it>
If you’d like, you could just tell me the languages you speak, and I can give you a first-pass list of major sound types you’re probably missing (like clicks, ejectives, tones, pharyngeals, etc.).
Would you like me to sketch a concrete example with your languages?
"Sure, here's my list: <my list>"
And then it gives a list of sounds you know, and sounds you don't. Pretty cool. It even has links to a sound map site so you can hear them.
If you like, I could generate a concrete list of IPA symbols that are not in <my list> — a sort of “negative phoneme inventory”. Would you like me to build that for you?
"Yes please. Also what additional language would cover the most ground?"
One language that covers the most new ground
Amharic (Ethiopian Semitic) is a superb single add-on because it gives you, in one go: • Ejectives (tʼ, kʼ, qʼ, sʼ/tsʼ), • Pharyngeal consonants (ħ, ʕ), • Uvular stop (q), and • True gemination (contrastive long consonants).
If you want a two-language combo to “max out” the world’s rarities, add Zulu (or Xhosa) for clicks alongside Amharic.
"æ" even has the same obvious sound value still, it can even keep the name "ash"
"þ" and "ð" for the two th-sounds (the former unvoiced, the latter voiced).
And if "ᵹ" is readmitted specifically in its affricate capacity, i.e. for "g" in "gem" etc, then "j" could instead be used with the value it has in French, i.e. used to spell words like "measure" - this is one of the few English phonemes that doesn't have a definitive letter associated with it right now.
Always use "k" for that sound and repurpose c/q/x for something else, and we could ditch the digraphs completely.
I'm in. It would also clear up some ambiguity over the pronunciation of "gif" ;-)
"J as in jif"-p-e-"g as in gif"
Get it right you people!
even in Contemporary OE, there are very few minimal pairs between /þ/ and /ð/ ("thigh" vs. "thy" comes to mind, but not much else apart from rare noun/verb combos like "loath" vs. "loathe"). it could be argued that we don't really need both, but the (surely obvious by now) pedant in me desires both. saying "boð" rather than "boþ" doesn't strictly change the meaning, and is not likely to cause confusion, but it sure sounds off!
"ð" also has the nice property of being sufficiently similar in looks to "d" that someone reading it and not familiar with the letter might pronounce it as "d" and thereby make a decent approximation. Unfortunately that's not the case with "þ" though. If we ignored the historical letters and instead just picked something reasonably sensible, I'd probably go with "đ" and "ŧ" for this reason.
Was it, though? By comparing English and Dutch you can clearly see that one of the ways this harsh "gh" changed in English is it became "y" as in "yesterday". "Weg" (Dutch) - "way" (English), "gister[en]" (Dutch) - "yester[day] (English), etc. I wonder if at the time pronouncing it as "gh" was still common and this would make using the same letter in some words much more logical.
In Holland the g will be hard while in West Flanders "weg" is much closer to English "way" with a very soft g.
And to be honest, somebody that refuses to even try to understand that "Twee croissants en een koffie, alsjeblieft" means "Two croissants and a coffee, please", but replies with "English please" instead can kindly go fuck themselves.
At least TRY
Especially the word "croissant" is tricky. Chances are they they only got "koffie" in your example sentence and had no idea what came before it.
I don't know any Germanic languages, but it turns out I can understand a surprising amount of Dutch just by closing my eyes and listening to the flow of the sounds.
Afrikaans is even easier, as I get the impression that the word order is almost always the same as in English, whereas in Dutch it sometimes isn't (I think... as I say, I don't actually know either language!)
1) Dutch is not an easy language to learn if you speak English. Both the grammar and the vocabulary are way closer to German than to English. The grammar works in a different way, there is an actual grammatical gender, the vowels sound different, etc.
2) There is a lot of content to consume. Let's just start with programming oriented to older children, like the Klokhuis or Jeugdjournaal, which are both in simple language and interesting to adults as well. It's really all there.
3) when taking to people, they want to have a conversation and will switch to a language where that goal can be reached. If people consistently switch to English to you, the sad fact is that your pronunciation and/or grammar are not good enough yet, and you can't assume that the role of random people is to be your teachers.
1) Yes, Dutch is even closer to German, but it doesn't change the fact that it's very close to English. The grammar is very close, a lot of words are clearly related, learning to read Dutch when you are already profficient in English is a breeze. I never said they were mutually intelligible, but they are close enough to make the leap from one to the other much simpler.
Just as an example, let's consider three words from three different languages that mean the same thing: "gestolen", "volé" and "украденный". Which one of them is closer to the English word "stolen"? Not only in its stem form, but even in the way this irregular verb is conjugated.
2) Some content does exist, but not much. Again, comparing to a language with a much bigger speaker base, like French, let alone English. The difference is orders of magnitude, it's very noticeable when trying to find something interesting to watch or listen.
3) Every time I go to a country where English is less prevalent, I'm forced to learn and practice the language. The same pressure just does not exist in the Netherlands, and the sad fact is, people do not like making unnecessary effort. It is an obstacle and it doesn't help, no matter how you look at it
2) Obviously there's a huge difference in available content compared to bigger languages. But I do think there's enough, especially if you're not too picky (and you shouldn't be if your goal is to learn the language).
3) it is true that English proficiency is more prevalent in the Netherlands than elsewhere. People don't like making unnecessary effort anywhere, and even if they do, the conversation turns into a lesson, which is I guess not generally desirable.
The influence of English does exist in Western Dutch accents. Speaking like somebody that sounds like they're coming from Leiden or Vlaardingen should be much easier for people from English speaking countries. Like the r that sounds like a w, like the English pronounce "rare".
yesteday -> gestern way -> weg
But the mistery is: how was it pronounced in the old English? The modern y was possibly a norman induced change, and people started reading it as modern y instead of "g" ?
The process is actually fairly straightforward. First you start with a common language that has two allophones for /g/ which are ~ [g] and [ɣ], depending on context; in this case, "weg" was [weɣ].
In Old Dutch [ɣ] then becomes subject to final obstruent devoicing, giving [x] of the modern Dutch pronunciation of the same word.
Meanwhile in Old English /k/ and /g/ (in either of its incarnations) palatalized in various environments instead. For [ɣ] in particular, it became palatalized after [e] in most cases - thus we get [weʝ]. And then [ʝ] is already very similar to [j], and gradually evolved into the latter. This all has already happened by the time most Old English texts were written.
My understanding is that a modern Greek gamma (γ) is the same way: actually a voiced version of "loch". So before "back vowels", it sounds to modern English speakers like a hard 'g', but before "front vowels" it sounds to a modern English speaker like a 'y'.
As I watched my son learn to speak, then later to read and write, I paid attention to his misspellings and they all made perfect sense if you approached the language phonetically.
The first sentence he wrote for me was:
“my daddy and i tocd on d woki toki” (“My Daddy and I talked on the walkie talkie.”)
At first confusing, but it omits all the silent, or effectively silent because native speakers usually omit the sound, or abbreviate it so much that non-native speakers would miss it if they didn’t know it was there. Like “l” in “talked”.
And don’t get me started on irregular conjugations.
In English, it would frustrate me if every time I read marry it would be spelled the same as merry, because I say those differently, even if most Americans I come in contact with daily do not.
Spanish spelling is much more phonetic than English, but one thing I admire about it is when things are correctly spelled, you can adjust the text for accent based on spelling. The only area I can think of where spelling does not denote contrast in any accent is between b and v. (Edit: and that h is silent.) There are some contrasts that are only kept by a minority of speakers (Ll is merged with Y for most people, but maybe not in Bolivia or parts of Spain; s and z are merged in Latin America), but it's still a meaningful contrast for somebody.
Edit - I looked up and there is a translation thing online.
"this is text translated to a phonetic version" goes to "ðɪs ɪz tɛkst trænzˈleɪtɪd tuː ə fəʊˈnɛtɪk ˈvɜːʒᵊn" in British pronunciation and "ðɪs ɪz tɛkst trænˈsleɪtɪd tu ə fəˈnɛtɪk ˈvɜrʒən" in American.
Not sure it's going to catch on.
These samples are using the International Phonetic Alphabet, which isn’t really designed for the purpose of being a practical orthography — it’s used as a language-independent transcription method which can represent any sound. (That’s also why the British and American versions are different: different accents use different sounds, so they’re transcribed differently.) So this isn’t really a ‘phonetic spelling of English’, though it’s easy to see where the confusion comes from.
I don’t. The complexity and history is what makes life fun, and makes English one of the most interesting languages read, written, and spoken today.
Similar to Imperial versus Metric. Yes the metric system is better (aside from temperature ranges - Fahrenheit is superior) but it’s not nearly as fun. Why 12 inches in a foot? Who knows!
You're changing the definition of "better" so you can have your cake (admit X is better than Y) and eat it too (prefer Y over X). It's ok to like fun things! It's ok to like Imperial and weird grammar, but then you can't also claim that you agree it's worse.
Sure I can. Didn't I just do it?
There is no reason why English would be more interesting to native speakers than German, Spanish, Finnish or Bulgarian to their respective native speakers.
Not that German is "pure" but even with words you don't know you got a fair chance of being right most of the time
A French kid can reasonably spell words they hear, even if there are a lot of unpronounced or apparently useless letters.
I've heard from a Chinese friend that the same is true for Mandarin. Apparently most written words have a "meaning" component and a "pronunciation" component (excepting the most common words, which are easy to learn by rote).
https://en.wikipedia.org/wiki/Chinese_character_classificati...
I've since heard interesting comparisons as well from native Mandarin speakers, where there is a certain formulaic piece missing in English that let's them form ideas easier. Interestingly in that case they would describe opposites too where there are certain groupings of ideas that they found easier to form in Mandarin that they had to invent words for in English to get across. But they could do so - invent those words.
I'm a native English speaker and only a third rate French and Spanish speaker, so I have to take this on faith for the most part, but it is interesting to hear that common view across origins.
To me, Finnish is the only interesting language in your list, due to not being related to the rest of the Nordic languages or Russian (Finno-Ugric). It’s also purely phonetic, which is nice, but what a shame about all those case endings. I am definitely biased as I learned Finnish as a teenager.
Might as well take this opportunity to plug Bill Bryson’s book The Mother Tongue: English and How It Got That Way, it’s a fun read.
I think that simplified English is the most likely candidate for a universal human language. The “Unicode of languages”, if you will.
Of course both scales can go above and below the numbers mentioned above, but if you are going to do things based on units of 10 you might as well be consistent.
Really both C and F are silly, the minute temperature was found out to have a zero point, they should have been discarded, and we should have gone full in on Kelvin(or Rankine if that is your jam), or even better a new unit that better integrates 0, a scale from 0 to the boiling point of water? and give water boiling 1000? or would it be better to give water freezing 1000? Triple cells are a common way to calibrate temperature.
Lots of divisible factors.
With that said, there are clearly some things that could be changed.
There are a wide variety of accents in Britain alone - ‘wahtah’ isn’t recognisable to me for example. It’s a cliche to say brits don’t pronounce “r”, but all you have to do is watch Harry Potter to see all the different accents that do.
(Not actually by Mark Twain)
"a constructed alphabet conceived as a way to provide simple, phonemic orthography for the English language to replace the inefficiencies and difficulties of conventional spelling"
English, a language with over 40 different dialects in its country of origin.
English, the official language of over 60 countries?
English, the bastard child of millenia of Roman, Germanic, and French colonization?
English, a language with documented vowel shift that occured over 200-300 years?
THAT language would be easier if the words were spelled how they were pronounced?
Or the "pin=pen" merger of some accents, or "caught=cot". No one speaks "English", it's an ugly compromise of an army of dialects.
Is there a black person here?
--James D. Nicoll
Thick lousiana accent? Southeast london accent? Boston southie accent? Mid-atlantic?
No matter what you choose, it won't be phonetic for someone
But good luck getting the UK or India to adopt it.
By “lowest common denominator“, I mean omitting affectations like non-rotic R (Boston) and dropped g (urban) and exaggerated length of selected vowels (US southeast) and constant syllable length and other kinds of things. There are lots of videos on YouTube about how accents work.
But mainly phonetics is about eliminating silent letters, using the most unambiguous letter when there is a choice (eg K or S instead of C), etc.
Anyway I’m rambling but you probably get my point.
I sometimes discover that I mispronounce words I know very well. The reason is that I never heard those words used audibly, I only encountered them in books. So I "invented" a phonetic sound to them, and long ago forgot that I did that. Until whammy, I encounter it verbally, decades later.
(What was that essay/joke called again?)
Interesting, because at least for the last two, "tag" is "day" and "weg" is "way" in German. Replacing both of the G's there make a more recognizably English equivalent word.
Also, "-ig" is the suffix most commonly associated with "-y". For example, "hungrig" is "hungry", which tracks.
Super interesting article. Also I think "cat in a barbershop run by monkeys" is my new favorite classical painting.
x is k followed by s
c is sometimes k, sometimes s
q is k
So all of those could be replaced by k and s. Why are they needed? :)
[1] https://teara.govt.nz/en/document/40118/new-zild-and-how-to-...
There wis a young lassie named Menzies,
That askit her aunt whit this thenzies.
Said her aunt wi a gasp,
"Ma dear, it's a wasp,
An you're haudin the end whaur the stenzies!
https://en.wikipedia.org/wiki/Menzies
o11c•4mo ago
mminer237•4mo ago
Ekshef•4mo ago
awesome_dude•4mo ago
Super key could arguably apply to the shift keys, because you are using a super set of letters (or am I reaching too far)
eucyclos•4mo ago
Agentlien•4mo ago
But, the meta key already exists and on Windows it is the Windows key. We're going around in circles.
1718627440•4mo ago
Agentlien•4mo ago
1718627440•4mo ago
Approaching an Apple keyboard for the first time, I naively thought Command would be Ctrl and was quite confused, since there also is Ctrl. But once you start a terminal it became quite clear that this was not the case. This is quite neat, it directly solves the SIGINT,SIGSTOP / Copy, Ctrl-D? confusion. Also having these operation as OS command means that all programs support them.
lmm•4mo ago
You're not though. The original letters are not available, so uppercase is not a superset of lowercase. (Unless you're just stating your opinion that the set of uppercase letters happens to be super)
SllX•4mo ago
magarnicle•4mo ago
hackernewds•4mo ago