Pronunciación del español peninsular: visión general

Spanish has one of the most regular pronunciation systems of any major European language: once you know the rules, you can read aloud almost any new word from its spelling alone. There are no silent vowels, no schwa, no English-style stress shifts that change vowel quality. The five vowels stay clean wherever they appear; the consonants follow a small set of predictable variations; and a written accent will tell you, every single time, where to put the stress if it isn't predictable.

This page is the orientation map. It walks through the features that define peninsular Spanish — the variety spoken in Spain, particularly central and northern Spain — and flags which ones differ from the Latin American varieties you may have learned from textbooks, films, or pop music. Each feature has its own detail page; here we sketch the landscape.

The five vowels

Spanish has exactly five vowel sounds, each represented by one letter, each pronounced the same in stressed and unstressed positions. This is the single biggest difference from English, where vowels reduce to schwa (the uh in "sofa") whenever they're unstressed.

LetterIPAEnglish approximationExample
a/a/like "ah" in "father", but shorter and tightercasa /ˈkasa/
e/e/like "e" in "bed", slightly more closed; no glidemesa /ˈmesa/
i/i/like "ee" in "machine" /si/
o/o/like "o" in "old", but pure — no glide into "oo"todo /ˈtodo/
u/u/like "oo" in "boot" /tu/

The single most important habit for English speakers: do not reduce unstressed vowels. The first syllable of patata is pa-, not puh-. The first syllable of terminal is ter-, not tuhr-. Every vowel keeps its full quality, regardless of stress.

patata

potato — both 'a' vowels are the same clean /a/, even the unstressed one.

terminado

finished — all four vowels are pronounced clearly: /te-ɾ-mi-ˈna-ðo/.

ordenador

computer (peninsular) — /oɾ-ðe-na-ˈðoɾ/, four pure vowels in a row.

The pure-vowel system is what gives Spanish its characteristic clean, "syllable-timed" rhythm. English is "stress-timed" — long vowels in stressed syllables, schwa elsewhere. Spanish is the opposite: each syllable gets roughly equal weight, and each vowel keeps its colour.

The distinción: /θ/ vs /s/

This is the most audible feature of peninsular Spanish, and the one that travellers from Latin America notice first.

In standard peninsular Spanish, the letters c (before e or i) and z are pronounced /θ/ — the same sound as English th in thinwhile the letter s is pronounced /s/. The two are kept distinct, which is why this is called distinción.

LettersSoundExamples
c before e/i/θ/cinco, cielo, gracias
z anywhere/θ/zapato, plaza, taza
s anywhere/s/sí, casa, mesa, español

The distinction creates real minimal pairs in peninsular Spanish — words that differ only in /θ/ vs /s/:

caza vs casa

'hunt' /ˈkaθa/ vs 'house' /ˈkasa/ — different words in Spain.

cocer vs coser

'to cook' /koˈθeɾ/ vs 'to sew' /koˈseɾ/ — different verbs in Spain.

abrazar vs abrasar

'to hug' /aβɾaˈθaɾ/ vs 'to scorch' /aβɾaˈsaɾ/.

In most of Latin America (and large parts of southern Spain), this distinction has been lost through a phenomenon called seseo: both /θ/ and /s/ have merged into a single /s/. So caza and casa are homophones across the Atlantic, but distinct words in Madrid.

gracias

thank you — peninsular [ˈɡɾaθjas], Latin American [ˈɡɾasjas].

cerveza

beer — peninsular [θeɾˈβeθa] (two thetas), Latin American [seɾˈβesa] (two s's).

Peninsular distinción is sometimes mistakenly described by foreigners as a lisp. It is not — a lisp would affect all instances of /s/. Distinción is a phonemic contrast inherited from medieval Castilian, taught to every Madrid child from birth, and entirely systematic.

💡
You do not have to produce distinción to be understood in Spain — seseo speakers from Andalusia, the Canaries, and all of Latin America are perfectly comprehensible in Madrid. But you do have to recognise it, or you'll think the speaker has a speech impediment. If you're learning peninsular Spanish from scratch, distinción is a marker of central/northern Spanish identity, and learning to produce it will make your accent sound more local.

The apical /s̺/

The peninsular s itself is unusual: it is apical, meaning the tip of the tongue is raised toward the alveolar ridge (just behind the upper teeth). In IPA this is written /s̺/. The result is a slightly darker, thicker s than English or Latin American /s/.

To untrained ears, the peninsular s can sound almost like a soft sh — but it isn't. It's a regular /s/, just produced further forward with the tongue tip rather than the blade.

español

Spanish — the s sounds slightly thicker than English 's', almost between /s/ and /ʃ/ to a foreign ear.

siete

seven — the initial /s̺/ has the same darker quality.

This feature is concentrated in central and northern Spain. Southern Spaniards (Andalusia) and Latin Americans use a more laminal /s/, closer to the English variety.

The trilled rr and the tapped r

Spanish distinguishes two r-sounds — and the difference is phonemic (it changes meanings).

  • Single tap r (IPA /ɾ/) — like the tt in American English butter or the dd in ladder. A single quick touch of the tongue tip against the roof of the mouth.
  • Trilled rr (IPA /r/) — a sustained rolling of the tongue tip, two to four taps in rapid succession.

The trilled version is used:

  • when written as a double rr: perro, carro, arroz.
  • at the start of a word (always trilled, even with a single r): rojo, rata, ratón.
  • after n, l, or s: alrededor, Enrique, Israel.

Everywhere else, single r is tapped.

pero vs perro

'but' /ˈpeɾo/ vs 'dog' /ˈpero/ — single tap vs trill, different words.

caro vs carro

'expensive' /ˈkaɾo/ vs 'cart/car' /ˈkaro/.

rojo

red — word-initial /r/ is always trilled, even though it's written single.

alrededor

around — the r after l is trilled: [al.re.ðeˈðoɾ].

The rolled rr is the single hardest sound for English speakers to produce. It can take months of practice. There is no shortcut except patient repetition. The good news is that even a weak trill is comprehensible, and many native Spaniards have softer trills than the textbook ideal.

The guttural jota /x/

The letter j — and the letter g before e or i — represents the sound /x/: a strong, raspy back-of-the-throat fricative, similar to German ach or Scottish loch.

jamón

ham — /xaˈmon/, with a strong guttural j.

gente

people — /ˈxente/, the g before e takes the /x/ sound.

ojo

eye — /ˈoxo/, two clean vowels around a guttural j.

rojo

red — initial trilled r, guttural j: [ˈro.xo].

In Latin American Spanish, the /x/ is much softer — closer to English /h/, as in hat. So Mexico's jamón sounds gentler than Madrid's. The peninsular jota is famously strong, sometimes described as scraping or rough; this is normal and expected.

The combination g + e/i uses the same /x/. To get the hard /g/ sound before e or i, Spanish writes gu with a silent u: guerra /ˈgera/, guitarra /giˈtara/.

general

general — /xeneˈɾal/, the g is the soft /x/.

guerra

war — /ˈgera/, with silent u keeping the g hard.

When the u between g and e/i actually needs to be pronounced, Spanish marks it with a diaeresis (two dots): pingüino, vergüenza, lingüística.

pingüino

penguin — /pinˈgwino/; the diaeresis on the ü tells you to pronounce the u.

The b/v merger

Spanish spells two different letters — b and v — but they represent the same sound. The b/v distinction that English speakers know does not exist in Spanish. Whether you write baca (luggage rack) or vaca (cow), they are pronounced identically: /ˈbaka/.

The sound itself varies by position:

  • After a pause, after m or n: a full /b/, lips closing completely (the English b).
  • Between vowels or after most other consonants: a /β/ — a soft, fricative b where the lips approach but don't quite close. Air passes through.

bueno

good — after a pause: full /b/, [ˈbweno].

haber

to have — between vowels: soft /β/, [aˈβeɾ].

lavar

to wash — between vowels: soft /β/, [laˈβaɾ].

ambos

both — after m: full /b/, [ˈambos].

Spelling reflects etymology, not pronunciation. Vino (wine) and biblioteca (library) start with the same sound. Learners who try to pronounce v like English v (with teeth on lower lip) will sound noticeably foreign; the lips-only Spanish /b ~ β/ is the only target.

The silent h

The letter h is always silent in Spanish.

hola

hello — pronounced /ˈola/, with no h sound.

hijo

son — /ˈixo/.

hambre

hunger — /ˈambɾe/.

hueco

hollow — /ˈweko/.

The h is purely orthographic, a relic of the language's history. Modern speakers do not aspirate it the way English does in hat. The only exception is the digraph ch = /tʃ/ (like English church), where the h combines with c to make a distinct sound.

muchacho

boy — /muˈtʃatʃo/, with the ch digraph.

Linking and elision in connected speech

In running speech, Spanish words flow together. Word boundaries are softer than English ones. A final vowel and a following initial vowel often merge into a single sound.

los amigos

the friends — pronounced [losaˈmiɣos], words run together.

de Andalucía

from Andalusia — [ðan.da.luˈθi.a], with elision of the e.

¿qué es eso?

what is that — often pronounced [ˈkeˈseso] in fast speech, with the qué+es vowels merging.

This is part of why spoken Spanish can sound much faster than its written form suggests: the syllable count drops as words link. For learners, the key is not to expect a pause between every word; train your ear to perceive the continuous stream.

Stress: predictable from spelling

Spanish stress is rule-governed and visible. The two basic rules cover most words:

  1. Words ending in a vowel, n, or s are stressed on the second-to-last syllable: casa, hablan, mesas.
  2. Words ending in any other consonant are stressed on the last syllable: hotel, vivir, ciudad, reloj.

Any word that breaks these two rules has a written accent marking the stressed vowel: café, miércoles, jamón, jóvenes.

comen

they eat — ends in -n, so stress on second-to-last: ['ko.men].

hablar

to speak — ends in -r, so stress on last syllable: [a.ˈβlaɾ].

café

coffee — would default to first syllable by the rules, but the accent forces stress on the last: [kaˈfe].

jóvenes

young people — would default to second-to-last, but the accent forces stress earlier: [ˈxo.βe.nes].

This means you can read aloud almost any unfamiliar Spanish word and know exactly which syllable to stress, just by looking at the spelling. English has no equivalent system. Once you internalise the rules, the written accent becomes informative rather than decorative.

What English speakers most often get wrong

A short list of pitfalls, each covered in more depth on its own page:

  • Aspirated stops: English p, t, k come with a puff of air; Spanish ones don't. Papá should sound clean and clipped, not breathy.
  • Vowel reduction to schwa: every Spanish vowel keeps its full quality. Patata has three pure /a/ sounds, not one /a/ and two schwas.
  • English /r/ in place of Spanish r: the American retroflex /ɹ/ is jarringly foreign in Spanish. Aim for the tap; even an imperfect tap beats the English r.
  • English /v/ for Spanish v: there is no English-style /v/ in Spanish. Lips only, like b.
  • English /h/ for Spanish j: peninsular j is strong and rough. A soft English /h/ marks you immediately as a non-native speaker.
  • Pronouncing the h: never. Hola is ola.
  • Ignoring written accents: they are not optional. Hablo (I speak) and habló (he spoke) are different words with different meanings.

Common Mistakes

❌ Pronouncing 'hola' as /ˈhola/.

Wrong — the h is always silent. The correct pronunciation is /ˈola/.

✅ /ˈola/

hello — silent h.

❌ Pronouncing 'caza' and 'casa' identically in peninsular Spanish.

Wrong for Spain — caza /ˈkaθa/ (hunt) and casa /ˈkasa/ (house) are different words. Merging them is Latin American seseo, not the Madrid norm.

✅ /ˈkaθa/ vs /ˈkasa/

hunt vs house — distinción.

❌ Pronouncing 'patata' as 'puh-tah-tuh' with English schwas.

Wrong — every Spanish vowel keeps its full quality regardless of stress. /paˈtata/, not /pəˈtatə/.

✅ /paˈtata/

potato — three pure /a/ sounds.

❌ Pronouncing 'jamón' with a soft English /h/.

Too soft for peninsular Spanish — the j is a strong /x/, scraping at the back of the throat.

✅ /xaˈmon/

ham — strong guttural jota.

❌ Pronouncing 'vino' with an English /v/ (lower lip on teeth).

Wrong — Spanish has no English-style /v/. Both b and v are pronounced with the lips alone: /ˈbino/.

✅ /ˈbino/

wine — b/v merger, lips only.

Key takeaways

  • Five clean vowels, each pronounced the same in stressed and unstressed positions. No schwa.
  • The peninsular distinción: /θ/ for c (before e/i) and z, distinct from /s/ for s. Latin America merges them (seseo).
  • The peninsular apical /s̺/ is slightly darker than English /s/, produced with the tongue tip on the alveolar ridge.
  • Trilled rr vs tapped r is phonemic (perropero).
  • Guttural jota /x/ — strong, raspy, like German ach. Soft English /h/ doesn't approximate it.
  • b and v are pronounced identically. h is always silent.
  • Stress is predictable from spelling, with written accents marking the exceptions.

Now practice Spanish

Reading grammar gets you part of the way. The exercises are where it sticks — free, no signup needed.

Start learning Spanish

Related Topics

  • El alfabeto españolA1The 27 letters of the Spanish alphabet — including the defining ñ — with peninsular letter names (uve, uve doble, ye), pronunciation notes per letter, and a clear account of why ch and ll are no longer separate letters since the 1994 RAE reform.
  • Las cinco vocalesA1Spanish has exactly five vowel sounds — /a, e, i, o, u/ — pure, short, and unreduced in every position. The single biggest pronunciation habit for English speakers to break is the schwa: Spanish vowels never weaken in unstressed syllables.
  • Distinción: la /θ/ peninsular vs el seseoA2The signature sound of peninsular Spanish — the interdental /θ/ (like English 'th' in 'think') for c before e/i and z, kept distinct from /s/. The phonemic contrast that makes casa /ˈkasa/ (house) and caza /ˈkaθa/ (hunt) different words in Madrid but homophones across Latin America.
  • R y RR: tap vs vibrante múltipleA1Spanish has two r-sounds: a quick tap /ɾ/ between vowels (pero) and a sustained trill /r/ when written rr, at the start of a word, or after n/l/s (perro, rosa, alrededor). The trill is the canonical English-speaker challenge.
  • Reglas de acentuaciónA1Spanish stress is predictable from spelling: words ending in a vowel, n, or s are stressed on the second-to-last syllable; words ending in any other consonant are stressed on the last. Exceptions are marked with a written accent. Three pattern names cover every word: aguda, llana, esdrújula.
  • Rasgos fonéticos del español peninsularB1A bird's-eye view of the constellation of features that together define peninsular pronunciation — distinción /θ/, the apical /s/ of the centre and north, the guttural jota, generalised yeísmo, robust trilled rr, and the characteristic intonation cadence — and how each contrasts with Latin American Spanish.