April 28, 2026 · 16 min read

Chinese Tones Explained: A Practical Guide to Mastering All Four Tones (Plus the Fifth)

A practical guide to Mandarin's four tones and the neutral tone — what they sound like, why they matter, which ones English speakers confuse, and how to actually practice them.

Tones & PronunciationGuide

If you've ever told someone you're learning Mandarin, you've probably gotten some version of the same reaction: a wince, then "Isn't Chinese a tonal language? That sounds impossible." Maybe you've said it to yourself. The tones get a reputation somewhere between "tricky" and "the reason I quit on Duolingo."

I want to take a little air out of that fear, because it deserves less than it gets.

You already use tone every day. When you say "Really?" with your pitch rising, that's a tone. When you say "Really." flat and unimpressed, that's a different tone. When your mother says your full name with a sharp drop on the last syllable, that is, unmistakably, a tone — and you understood it before you could read. English speakers use pitch movement constantly. They just use it to carry attitude: surprise, sarcasm, finality, doubt. Mandarin takes those same pitch shapes and assigns them a second job: the pitch shape is part of the word itself.

That's the real adjustment. Not learning to make new sounds — you can already make them — but learning to hear pitch as information, the way you currently hear vowels and consonants. Once that click happens, tones stop feeling exotic and start feeling like a normal feature of the language. This guide is about getting you to that click.

What Tones Actually Do in Mandarin (and Why They Matter)

In Mandarin, a syllable without a tone isn't really a full word. It's more like a half-assembled ingredient. The classic example — and there's a reason every textbook uses it — is the syllable ma:

  • 妈 mā (first tone) — mother
  • 麻 má (second tone) — hemp, or the feeling of "pins and needles"
  • 马 mǎ (third tone) — horse
  • 骂 mà (fourth tone) — to scold, to curse at someone
  • 吗 ma (neutral tone) — a question particle, like a spoken question mark

That's five different words from the same consonant-vowel combination. Chinese kids grow up with the joke sentence 妈妈骑马,马慢,妈妈骂马 (māma qí mǎ, mǎ màn, māma mà mǎ) — "Mom rides a horse, the horse is slow, Mom scolds the horse." A string of ma sounds, four of the tones, one small domestic drama.

The ma set looks like a party trick, but the pattern is everywhere:

  • 买 mǎi (third tone) — to buy
  • 卖 mài (fourth tone) — to sell

These two are next to each other on shop signs and mean opposite things. Foreign learners mix them up constantly. I once watched a tourist in Beijing cheerfully announce, "我要卖这个" ("I want to sell this") when he meant "I want to buy this." The shopkeeper froze for a few seconds trying to figure out whether this guy was proposing a business arrangement. (I've collected more stories like this one — some of them are genuinely spectacular.)

Another one:

  • 睡觉 shuìjiào (fourth + fourth) — to sleep
  • 水饺 shuǐjiǎo (third + third) — boiled dumplings

Same syllables, different tones, completely different evening. I've heard that mix-up in real life more than once. It's funny the first time, less so when you're tired and hungry and nobody can tell what you want.

The point is this: in English, if you say "cat" with odd intonation, people still know you mean cat; they just think you're being dramatic. In Mandarin, if you say mā with the wrong pitch shape, you're not saying "mother" oddly — you're saying a different word. Tones aren't expressive decoration on top of the word. They're part of the word's spelling, except the spelling lives in your voice.

That reframe tells you how to practice. Don't practice tones the way an actor practices emotion. Practice them the way you practice vowels — a fixed shape that has to come out right every time, regardless of how you feel.

The Four Tones (and the Neutral One Everyone Forgets)

Let's go through them one by one. I'll give you the pitch shape, an analogy that should feel natural to an English speaker, and a handful of real words so the tone has something to attach to in your memory.

Before we start, a small visual aid: Mandarin pitch is usually described on a 5-level scale, where 1 is the bottom of your comfortable range and 5 is the top. Each tone is a pattern across those levels. (If you're also learning the Pinyin system, the tone marks above vowels — ā á ǎ à — are visual reminders of these shapes.)

First Tone — The Steady High Note

Pitch shape: Held high and flat. Think 5-to-5. No movement.

English analogy: The sound the doctor asks you to make when they're looking in your throat — "aaaahhhh" — held level. Or the long "hmmmmm" you make when you haven't decided what to order. The key is steady: no drift up or down. English speakers often let it drift, because a flat monotone feels robotic or rude in English. In Mandarin, flat is the goal.

Examples:

  • 妈 mā — mother
  • 高 gāo — tall, high
  • 喝 hē — to drink
  • 天 tiān — sky, day
  • 猫 māo — cat

If your first tone is sagging at the end (many learners do this, because English falls at the end of a sentence), try exaggerating it higher than you think necessary. You want to sound a little like you're singing one stubborn note.

Second Tone — The Question

Pitch shape: Starts in the middle of your range, rises clearly to the top. Think 3-to-5.

English analogy: The pitch shape of a one-word question. "What?" when you didn't quite hear. "Me?" when you're surprised you got called on. "Huh?" when you've just been told something mildly insulting. That upward sweep — that's your second tone. It should feel like genuine inquiry, not a polite uptick.

Examples:

  • 麻 má — hemp / numb
  • 来 lái — to come
  • 学 xué — to study, to learn
  • 人 rén — person
  • 十 shí — ten

A warning: the second tone is not just "slightly rising." It's a real rise. If you underplay it, it flattens into something that sounds like the first tone, and then 十 (ten) starts sounding like 失 (shī, to lose). If you overplay it, you still sound more or less correct, so when in doubt, rise more.

Third Tone — The Misunderstood One

Pitch shape: This is the tone textbooks lie to you about. They'll tell you it "dips down and then comes back up," drawing you a smooth U-shape on the page. In actual spoken Mandarin, most of the time the third tone is just low. It dips down and stays there — what linguists call the "half third." The full rising-back-up version mostly shows up when the syllable is said in isolation, slowly, or at the end of a sentence.

So here's the honest version: when a native speaker says a third-tone syllable in normal conversation, their voice drops to the bottom of its range and kind of stays parked there.

English analogy: The low, slightly growly "uhhhh" you make when someone asks you a question you don't want to answer. Or a very reluctant "yeahhh." It should almost feel like your voice is settling at the bottom of a chair.

Examples:

  • 马 mǎ — horse
  • 好 hǎo — good
  • 我 wǒ — I, me
  • 你 nǐ — you
  • 小 xiǎo — small

If you've been practicing the third tone as a big dramatic U-shape, your Mandarin will sound like a cartoon. Let it be low. Save the rise for when the syllable is genuinely alone.

Fourth Tone — The Order

Pitch shape: Starts at the top, drops sharply to the bottom. Think 5-to-1. Fast and firm.

English analogy: The way you say "No." when you really mean it. The "Stop!" you shout when someone's about to walk into traffic. The exasperated "Fine." at the end of an argument. Strong fall, short distance, full commitment.

Examples:

  • 骂 mà — to scold
  • 卖 mài — to sell
  • 去 qù — to go
  • 是 shì — to be
  • 对 duì — correct, right

English speakers often soften the fourth tone because sounding that forceful feels impolite. It isn't impolite in Mandarin. A well-pronounced fourth tone is just how the word is shaped. If you feel like you're being a little rude, you're probably doing it right.

The Neutral Tone — The Afterthought

Pitch shape: Short, light, unstressed. It leans on whatever tone came before it and takes whatever pitch is convenient.

English analogy: The unstressed second syllable in English words like "sofa" or "table" — you don't hit it, you barely land on it, it's just there. The neutral tone is the Mandarin equivalent of that casual, throwaway syllable.

Examples:

  • 妈妈 māma — mom (the second ma is neutral)
  • 谢谢 xièxie — thank you
  • 朋友 péngyou — friend
  • 椅子 yǐzi — chair
  • 吗 ma — question particle, as in 你好吗? (How are you?)

Beginners sometimes treat the neutral tone as a secret fifth tone with its own shape. It really isn't. Its main feature is not being stressed. If you pronounce 妈妈 as two crisp first tones — MĀ MĀ — it sounds oddly formal, like you're calling your mother to testify in court. The second syllable should be light and short, almost swallowed.

Tone Change Rules You Need Early, Not Late

Here's something that annoys me about how tones are usually taught: textbooks spend three chapters drilling you on isolated syllables, and then months later, in small print, mention that actually, when tones come next to each other, they change. By that point you've built muscle memory on a version of the language that doesn't quite exist. Better to know the main rules from the start.

Third Tone Sandhi — Two Third Tones in a Row

When you have two third-tone syllables next to each other, the first one shifts and is pronounced as a second tone. Written pinyin usually doesn't reflect this — the tone marks stay the same on paper — but in speech, it changes.

The most famous example: 你好 is written nǐ hǎo, but it's pronounced ní hǎo. Both syllables are marked third tone, but you say the first one with the rising pitch of a second tone.

More examples:

  • 我很好 (wǒ hěn hǎo, "I'm fine") → spoken as wǒ hén hǎo
  • 请你 (qǐng nǐ, "please help me") → qíng nǐ
  • 可以 (kěyǐ, "can / may") → kéyǐ

Why does this happen? Because saying two full low-dropping third tones back-to-back is physically awkward, even for native speakers. The language just smooths it out. You don't need to memorize this as a conscious conversion — if you imitate native speakers, you'll pick it up naturally. You just need to know it exists, so you're not confused when 你好 doesn't sound like the third tones you were taught.

不 (bù) and 一 (yī) — The Shape-Shifters

Two characters, 不 ("not") and 一 ("one"), change their tones depending on what comes after them. These appear constantly, so you'll hear the changes early.

不 (bù) is normally fourth tone, but when followed by another fourth tone, it shifts to second tone (bú):

  • 不好 bù hǎo — not good (stays fourth)
  • 不来 bù lái — not coming (stays fourth)
  • 不是 bú shì — is not (shifts, because 是 is fourth)
  • 不去 bú qù — not going (shifts, because 去 is fourth)

一 (yī) is even more of a chameleon:

  • Alone, counting, or at the end of a number: first tone — . 一 (one), 第一 dì yī (first).
  • Before a fourth tone: second tone — . 一个 yí gè (one piece), 一样 yí yàng (the same).
  • Before first, second, or third tones: fourth tone — . 一天 yì tiān (one day), 一年 yì nián (one year), 一起 yì qǐ (together).

There's a pattern underneath: 不 and 一 are trying to avoid colliding with whatever tone comes next. The language is doing phonetic traffic control.

You don't need to drill these rules with flashcards. Drill the phrases as phrases — 不是, 一个, 一起 — and the correct pronunciation comes along for free. The reason to know the rules is so you're not thrown when a textbook writes bù shì but your teacher says bú shì. Both are correct: one is the dictionary form, the other is the spoken form.

The Tones English Speakers Actually Confuse

In my experience listening to learners, certain tone confusions show up over and over. Knowing which ones you are probably getting wrong is more useful than drilling all four equally.

Second tone vs. third tone. The big one. English speakers often produce a lazy rising tone for both, so 麻 (má) and 马 (mǎ) sound nearly identical. The fix is counterintuitive: don't focus on making your second tone rise more. Focus on making your third tone lower and flatter — let it sag to the bottom of your voice and stay there. Once the third tone sits low, the second tone's rise becomes obvious by contrast.

First tone vs. fourth tone. English sentences tend to end with falling intonation, so when learners try to say a first tone at the end of a word, it drifts downward and starts sounding like a fourth tone. The fix: when practicing first tones, slightly raise the end of the syllable as a counterweight. It will feel strange. You're correcting a lifelong habit.

Pulled-punch fourth tones. Learners who feel self-conscious about sounding aggressive water down their fourth tones. The clean drop becomes a polite dip, and a weak fourth tone often gets misheard as a first tone. The fix: over-commit. Let the fall go all the way down. You're not being rude, you're being legible.

Upgraded neutral tones. English speakers are used to pronouncing every syllable, so they say 谢谢 as xiè-XIÈ — two full fourth tones — when the second should barely register. The fix is mental: remind yourself that the neutral syllable is allowed to be small.

How to Actually Practice Tones (Not How Textbooks Tell You To)

Okay — how do you actually build this skill so it sticks? Here's what works, based both on what I've seen learners do well and on how Chinese kids are drilled in elementary school.

Exaggerate, then shrink. When you're learning a tone, produce it more extreme than feels natural. Bigger drop, higher hold, deeper low. Once your mouth and ear can find the exaggerated version reliably, start dialing it back toward normal. Trying to start at "natural" first is how you end up with mushy tones that no one can distinguish. We built a free tone trainer on this site specifically so you can hear the exaggerated model, try to match it, and see how close you got — which is the kind of feedback loop you really can't get from a textbook.

Train in tone pairs, not single syllables. This is maybe the most undervalued tip in tone practice. In real Mandarin, syllables almost never appear alone. A fourth tone after a first tone sounds different in your mouth than a fourth tone after a third tone. There are only sixteen two-tone combinations (four tones × four tones), and if you drill those, you cover the shapes that almost all real spoken Mandarin actually uses. Pick a pattern — say, second + fourth — and collect words that fit it: 学校 xuéxiào (school), 回去 huíqù (go back), 农业 nóngyè (agriculture). Then switch to another pair tomorrow.

If you do nothing else, do these first two.

Record yourself, then wait a day. Your ear is worse than you think in the moment. Record a sentence, don't play it back immediately, and come back the next day to compare with a native recording. The delay matters: right after speaking, your brain is still remembering what you meant to say, not what you did say. A day later, you hear your recording the way a stranger would.

Use rhythm and rhyme. Mandarin tones get partially flattened when sung (melody overrides pitch), but rhythm drills, rhymes, and tongue twisters still anchor tones in your memory. The classic 四是四,十是十 (sì shì sì, shí shì shí) — "four is four, ten is ten" — forces your mouth to switch cleanly between fourth and second tones, and it's a surprisingly effective workout.

Shadow native speech. Pick a short clip of clear Mandarin — a podcast intro, a news sentence, a line from a drama — and say it in unison with the speaker, matching pitch as closely as you can. Don't translate, don't analyze, just mimic. This trains your ear and your mouth together, which is what you actually need.

Let your ear lead your mouth. If you can't hear the difference between two tones, you can't reliably produce it or correct yourself. Spend time on pure listening drills — two syllables, same consonant and vowel, different tones, guess which is which. It's boring. It also works faster than almost anything else.

A Note Before You Go

Tones aren't the mountain they look like from the trailhead. They're a habit — an unfamiliar one, but a habit — and habits respond to repetition and honest feedback, not willpower. The learners I've seen do well with tones aren't the ones with musical backgrounds or "a good ear." They're the ones who treat tones as part of the word, practice in pairs and short phrases instead of single syllables, and don't panic when they slip up in real conversations. Shopkeepers have heard worse.

If you want a structured way to drill what you've just read, the tone trainer on this site runs through single tones, minimal pairs, and two-tone combinations, with native audio and instant feedback on what you actually produced. It's free and focused on exactly this one problem.

Start low-stakes. Get your ear comfortable first; the mouth follows. And the next time someone winces and says "but Chinese has tones," you'll know something they don't: so does English. We just kept ours for drama.

Tones are one piece of the pronunciation puzzle. The other piece is the sounds themselves — the consonants and vowels that Pinyin maps out for you. And if it helps to know: Chinese grammar is dramatically simpler than what you're used to in European languages, so the effort you spend on tones now is effort you won't need to spend on verb tables later.