Japanese is spoken by over 125 million speakers, and despite largely only being spoken in Japan, it is a very influential language on the international stage. Japanese may seem daunting, but by beginning your journey with this curriculum, you will be on the road to mastering it. To start, we will learn about the fundamentals of Japanese pronunciation.
In this first lesson on Japanese, our focus will be primarily on understanding the pronunciation of the language's vowel sounds. Vowel sounds, as we will see, are sounds like "ah" and "eh," and they contrast with sounds like "k" and "m," which are called consonants. It's rather impossible to showcase many words without using both vowels and consonants, but once we've covered the vowels, we'll move onto how to properly pronounce the consonants.
Use of Terminology: Japanese ≠ English
As obvious as it may be, Japanese is not English. You cannot assume that its sounds are the same as those in English. Creating an accurate perception of how Japanese sounds requires some terminology. However, explanations are always given when new terminology is used.
Lesson Note: Japanese words will be transcribed in this lesson with the English alphabet. This practice is called Rōmaji. There is nothing special about English letters when used in Japanese other than what their intended pronunciations are.
In Standard Japanese--the standard form of the language any Japanese speaker will understand and the form that you are beginning to learn--there is what is called a pitch accent system. In this system, every mora of a phrase is assigned a pitch. This assigned pitch may either be high or low. The assignment of pitch doesn't fundamentally change the meaning of words. The pitch accent system is simply an acoustic observation that helps describe how phrases sound. Incidentally, because there is an audible difference between a mora that is low in pitch and a mora that is high in pitch, words are occasionally distinguished via pitch. However, context can easily tell what meaning is meant when two or more words are otherwise pronounced the same, putting pitch differences aside.
In Standard Japanese, there are four pitch contours that a phrase can have. Regardless of how short or long a phrase is, the pitch contour will always be one of the following four patterns. Think of a phrase as being a pitch roller coaster. Every mora isn't an individual ride with its own loops and turns. Rather, a mora is only one loop of a ride--nothing more. You must put the loops (morae) together to see the course of the ride.
1. "L" and "H" both stand for a single mora. That means H-L is two morae, whereas H-L-L is three morae. As a reminder of this, numbers will be placed after these contour notations to tell you how many morae words involved have.
2. The "L" and "H" in parentheses indicate what the pitch of something attached to words would be per pattern.
|1|| Pitch is high for the first mora, drops on the second mora, and stays low for any remaining morae that follow.|
Ex. H(-L) ①, H-L(-L) ②, H-L-L(-L) ③, H-L-L-L(-L) ④
|2|| Pitch starts low on the first mora, peaks at high pitch on the middle mora(e), drops back to low pitch on the third morae, and stays low for any following morae after the word. |
Ex. L-H-L ③, L-H-H-L ④
|hanásu (to speak)|
|3|| Pitch starts low on the first mora, peaks at high pitch on the last mora, and then drops to low pitch on any morae that follow the word.|
Ex. L-H-(L) ②, L-H-H(-L) ③
| hàshí (bridge)|
|4|| Pitch starts low on the first mora, becomes high pitch on the second mora, and then the pitch stays high even once the word is over unto anything that follows. |
Ex. L(-H) ①, L-H(-H) ②, L-H-H(-H) ③, L-H-H-H(-H) ④
Transcription Note: In this lesson, morae with a high pitch will be in bold. If the pitch were to fall directly after the word in question, a ↓ arrow will follow to indicate this. What this all means will be explained at the end of the lesson.
Below you will see how the five vowels of Japanese are roughly pronounced. The vowels are pronounced clearly and sharply like the American English approximates provided. However, it cannot be stressed enough that these are approximates.
|A||Like the "a" sound in the word "buy."||Ta↓ (field)|
|I||Like the "i" in "police."||Ki↓ (tree)|
|U||Like the "oo" in "mood." Compress your lips without protruding them.||Uta↓ (song)|
|E||Like the "e" in "set."||Ike↓ (pond)|
|O||Like the "o" in "oh."||Oka (hill)|
Although the chart says that /u/ is like the "oo" in the word "mood," this isn't quite accurate. There is no form of English that has the /u/ found in Standard Japanese. However, by compressing your lips rather than protruding them forward, the resulting /u/ will be something like the one in Japanese.
Although the other vowels are almost identical to the ones found in American English, the Japanese /a/ actually only shows up in diphthongs in American English. A diphthong is when two vowels blend together to form a complex vowel sound. You start off pronouncing one vowel sound but at the end it sounds like something else. For example, the vowel sound in the word "height" is an example of a diphthong, and the onset of this word is exactly how the Japanese /a/ is pronounced.
In English, complex vowel sounds called diphthongs are created by beginning a vowel sound with one quality but ending the sound with a different vowel sound. For instance, in the word "kite," the vowel sound written with the letter "i" starts out as sounding like the Japanese /a/ but ends sounding like the Japanese /i/. The opposite of a diphthong is a monophthong, which is a vowel whose quality doesn't change during its pronunciation. This is what all vowels in Japanese are thought to intrinsically be.
In Japanese, diphthongs are said not to exist because of how the moraic structure of the language dictates how sounds are organized. Instead of viewing something like hai (yes) as one syllable, you view it as two morae: /ha/ + /i/. However, there are plenty of instances in which Japanese speakers pronounce consecutive vowels similarly to how they would be in English.
Acoustically, juxtaposed vowels sound like they blend together. However, native speakers still conceptualize them as being separate. This is because pitch can fall or rise without the need of consonants from mora to mora, and if two vowels next to each other count as two morae, then there is room for pitch changes.
Even in words just composed of vowels, pitch contours cannot be ignored, as is demonstrated below.
| Love/indigo||Ai||To meet||Au|| Ue↓||Starvation|
Exception Note: The word iu is pronounced as /yuu/. Exceptions like this, though, are few and far between.
In Japanese, short vowels are distinguished from long vowels. A short vowel is a vowel utterance equal to one mora in length. If a vowel is elongated to take up two morae, it becomes a long vowel. Pitch can consequently rise or drop inside long vowels because they're treated as two morae.
Consequently, vowel length contrasts thousands and thousands of words. Mistakes at the beginning are inevitable, but recognizing distinctions like this now will spare you a lot of potential heartache.
|Short /a/||Obasan (aunt)||Long /a/||Obaasan (grandma)|
|Short /i/||Ie↓ (house)||Long /i/||Iie (no)|
|Short /u/|| Yuki↓ (snow)||Long /u/||Yuuki (courage)|
|Short /e/||E↓ (painting)||Long /e/||Ee (yes)|
|Short /o/||To (door)||Long /o/||Too (ten things)|
Pronunciation Note: Do not pronounce "oo" as a long /u/ sound. This is incorrect!
False Long Vowels
What makes a long vowel truly a long vowel and not just the same vowel next to each other is there being nothing that obstructs the pronunciation of the vowel as it spans two morae. In both English and Japanese, the pronunciations of vowels begin with glottal stops. Whenever you say the phrase "uh-oh", you should feel an audible release of air after completely stopping airflow from the glottis (Adam's apple) at the start of "uh" and "oh."
In Japanese, a long vowel will always be inside a single element of a word. If one element of a word only has a short vowel but is followed by another word element beginning with the same vowel, that second element's vowel is still going to begin with a glottal stop like any other initial vowel sound. This acoustically interrupts what would otherwise be a long vowel. Unsurprisingly, in the context of Japanese, this glottal stop insertion is influential enough to change the pitch contour of a phrase and make phrases even more distinct--and not just from an etymological standpoint.
Transcription Note: In the words below, to show where elements of a word begin and end, periods will be inserted to indicate these boundaries.
|Scene||Shiin||Consonant/Cause of death||Shi.in|
Trivia Note: Vowels are called boin in Japanese.
The Pronunciation of "Ei": [ei] or [ē]
In Japanese, the vowel combination "ei" is usually pronounced as a long /e/ (ē). All such words come from Chinese roots. Because this sound change is technically optional, you don't have to worry so much about whether to pronounce an "ei" as [ei] or [ē]. After all, we haven't even learned about what exactly words made from Chinese roots look like. For this lesson, alternative pronunciations of a word are listed for you.
1. Long /e/ are written as "ee" so that pitch contours can be designated. However, do not be confused by this spelling and pronounce "ee" as a long "i" sound as would be the case in English words such as "cheese." This is incorrect, and so try to stay focused on what is going on in Japanese.
2. To show where elements of a word begin and end, periods will be inserted to indicate these boundaries. "Ei" can only be pronounced as [ē] if it's within the same element of a word, so these boundaries are very important.
3. If a word is not derived from Chinese roots, the pronounce [ē] becomes impossible even if the vowel combination is found in a single word element.
|Clock||To.kei||Correct answer|| Sei.kai|
The Pronunciation of "Ou": [ou] or [ō]?
In Japanese, the vowel combination "ou" is usually pronounced as a long /o/ (ō). Most such words come from Chinese roots, but this is not always the case. This sound change, unlike the one above, is not optional for the words it affects.
Because knowing which words are and aren't affected is a luxury that comes about from knowing a lot word origins. For this lesson and the next, any word in which a word that would be spelled as "ou" but is instead pronounced as a long /o/ will be spelled as "oo." This means if you do see "ou," you should pronounce it literally as such. Try not to read "oo" as a long "u" sound as this is incorrect.
Transcription Note: To show where elements of a word begin and end, periods will be inserted to indicate these boundaries.