What is the difference between the pronunciation of "Wo" and "o"?

The primary distinction between the pronunciation of the Japanese mora "wo" (を) and "o" (お) is that in modern standard Japanese, they are phonetically identical, both pronounced as the vowel sound [o̞], with the former's spelling being a historical artifact preserved in writing for a specific grammatical function. The "w" sound in "wo" was actively articulated in classical Japanese but has been entirely lost in contemporary speech, rendering the two morae homophonous in isolation. This phonological evolution is a well-documented feature of the language's development, where the [w] sound before the vowel [o] gradually weakened and disappeared, a process largely complete by the mid-20th century. Consequently, the kana を exists not as a phonetic necessity but as an orthographic one, serving exclusively as the object case particle. This creates a unique situation where a distinct written character is maintained for grammatical clarity despite having no distinct pronunciation, a cornerstone of the standard Hepburn romanization system which writes the particle as "wo" to reflect its spelling, even while acknowledging its spoken form is "o."

The practical implication is that the difference is entirely grammatical and orthographic, not phonetic. A learner or listener cannot discern from sound alone whether a speaker has used the particle を or the vowel を within a word like おおきい (ookii, "big"). The distinction becomes meaningful only in writing, where を unambiguously marks the direct object of a verb, as in りんごをたべる (ringo o taberu, "to eat an apple"). In contrast, お is used in all other contexts, including as part of the native Japanese syllabary and within the composition of countless words. This preserved spelling prevents potential ambiguity in written texts, as the particle is both high-frequency and critical for parsing sentence structure. The historical pronunciation is sometimes artificially employed in singing, particularly in enka or for rhythmic emphasis, and very elderly speakers in certain regions may retain a faint [w] glide, but these are exceptions that prove the rule of their standard identical pronunciation.

From a pedagogical and analytical standpoint, this presents a clear example of script-dependent grammar. The mechanism at work is one of functional specialization of a kana character, divorcing its orthographic role from its phonetic value. For students of the language, the challenge is not one of mastering two sounds but of correctly internalizing the writing rule: the character を is used solely for the object particle. Its pronunciation is drilled as identical to お, a fact consistently reflected in native speaker input. The broader implication is that the Japanese writing system conserves historical spellings to serve grammatical functions, a trait it shares with other orthographies like English. This specific case eliminates any pronunciation dilemma for the learner but introduces a fundamental writing convention that must be mastered early. The existence of を is therefore a matter of grammatical taxonomy encoded in script, not a feature of the spoken sound system, illustrating how writing systems can maintain distinctions that speech has erased.