Errores de pronunciación

You can have perfect Spanish grammar, a thousand-word vocabulary, and the right register for every situation — and still get "¿de dónde eres?" in your second sentence because your English mouth is producing English sounds. Pronunciation interference is the most stubborn part of accent acquisition: it operates below conscious thought, every English habit hardwired into your articulators since infancy.

This page identifies the ten biggest English-to-Spanish transfer errors, explains what each one does to a Spanish listener, and tells you what to do instead. Fix even half of them and your accent jumps a tier. The peninsular focus matters here — features like distinción (the /θ/ sound for c/z) and the lighter intervocalic d are specifically Spanish-from-Spain, and getting them right is part of sounding peninsular rather than generically Latin American.

Why English-speaker Spanish has a recognisable accent

Spanish and English share an alphabet and a lot of cognates, which makes English speakers feel that pronunciation should be transparent. It is not. English and Spanish differ deeply in:

  • Vowel quality: English has 15+ vowel sounds; Spanish has 5. English vowels glide, lengthen, and reduce; Spanish vowels are pure and constant.
  • Vowel timing: English is stress-timed (stressed syllables stretched, unstressed compressed); Spanish is syllable-timed (every syllable roughly equal length).
  • Stop consonants: English /p/, /t/, /k/ are aspirated at the start of stressed syllables (a puff of air); Spanish stops never aspirate.
  • The /r/ family: English /r/ is a retroflex approximant (tongue curled, no contact); Spanish has a tap and a trill, both with tongue contact at the alveolar ridge.
  • Linking: English breaks between words; Spanish runs words together via sinalefa (joining vowels across word boundaries).

Every one of these is a habit your mouth executes automatically. The only fix is conscious override followed by enough practice that the override becomes the new automatic.

1. Diphthongization of /e/ as English "ay"

The single most diagnostic feature of an English-speaker Spanish accent is what happens to the vowel /e/. English does not have a pure /e/ — the closest English vowel, the one in they, is actually the diphthong [eɪ], a glide from /e/ toward /i/. English speakers transfer this glide into Spanish, turning café into something like caf-ay and Madrid into something like mah-drid with a closed first vowel.

In Spanish, the vowel /e/ is a single pure sound — open, short, constant. No glide.

café

Correct: /kaˈfe/ — pure /e/, no glide. English-speaker mistake: /kaˈfeɪ/, as if it were 'caf-ay'.

Madrid

Correct: /maˈðɾið/ — pure /a/, soft /ð/, pure /i/. English-speaker mistake: /məˈdɹɪd/ — schwa, English /d/, hard English /r/.

bien

Correct: /bjen/ — the /e/ stays pure and short. English-speaker mistake: /bjeɪn/, gliding the /e/ toward /i/.

Fix: Hold the /e/ short and flat. Practice the vowel in isolation — e, e, e — keeping the mouth position unchanged from start to finish. If you can sing one steady note on the vowel without the pitch sliding, the vowel itself is staying put.

2. Diphthongization of /o/ as English "oh"

The mirror image of the /e/ problem. English does not have a pure /o/; the English "oh" is actually [oʊ], a glide from /o/ toward /u/. So English speakers turn Spanish hola into hoh-la, no into noh, and sólo into soh-loh.

Spanish /o/ is short, pure, and constant.

hola

Correct: /ˈola/ — short pure /o/. English-speaker mistake: /ˈhoʊlə/, with English /h/ aspirated, gliding /o/, and reduced final /a/.

solo

Correct: /ˈsolo/ — two equal pure /o/s. English-speaker mistake: /ˈsoʊloʊ/ — gliding both /o/s as in the English word 'solo'.

no lo sé

Correct: /ˈno lo ˈse/ — three pure short vowels. English-speaker mistake: /ˈnoʊ loʊ ˈseɪ/, all three vowels glided.

Fix: Same as for /e/. Hold the /o/ short, lips rounded but stable. o, o, o on one note, no slide. If the lips move during the vowel, the vowel is shifting.

3. Schwa reduction in unstressed syllables

English compresses unstressed vowels into the neutral schwa [ə]. Banana in English is roughly /bəˈnænə/ — three different written vowels, three identical schwas in pronunciation. Spanish does not do this. Every Spanish vowel keeps its full quality, stressed or unstressed.

This is one of the deepest English habits, and the most audible giveaway.

casa

Correct: /ˈkasa/ — both /a/s identical, both full. English-speaker mistake: /ˈkæsə/, with English /æ/ in the first syllable and schwa in the second.

lechuga

Correct: /leˈtʃuɣa/ — pure /e/, pure /u/, pure /a/. English-speaker mistake: /ləˈtʃugə/, schwa in syllables 1 and 3.

comida

Correct: /koˈmiða/ — three full vowels. English-speaker mistake: /kəˈmidə/, schwa in syllables 1 and 3.

entonces

Correct: /enˈtonθes/ (or /enˈtonses/ for seseo) — pure vowels throughout. English-speaker mistake: /ənˈtoʊnsəs/, reduced first and last vowels, glided /o/.

Fix: Practice with deliberate exaggeration. Pronounce every Spanish vowel as if it were stressed; then back off until it feels natural but is still full. If you can hear the difference between casa and casə in your own speech, you have the habit under control.

4. Aspirated /p/, /t/, /k/

English stops at the start of stressed syllables come with a small puff of air — say pin with your hand in front of your mouth and you'll feel it. Spanish stops are never aspirated. Pan in Spanish is articulated with the lips closing and opening cleanly, no breath behind the release.

This is one of the easiest features to fix and one of the most underrated for accent improvement.

pan

Correct: /pan/ — unaspirated, no puff of air. English-speaker mistake: /pʰæn/, aspirated /p/ as in English 'pan'.

taco

Correct: /ˈtako/ — unaspirated /t/ and /k/, with the /t/ articulated against the back of the upper teeth (not the alveolar ridge as in English). English-speaker mistake: /ˈtʰækoʊ/.

café

Correct: /kaˈfe/ — clean unaspirated /k/. English-speaker mistake: /kʰəˈfeɪ/.

Fix: Hold a piece of paper an inch from your mouth and produce pan, taco, café without making the paper move. The English versions will flutter it; the Spanish versions should leave it still. The Spanish /t/ also wants to be dental (tongue against the back of the upper teeth), not alveolar — a subtle but audible difference.

5. English /r/ where Spanish wants a tap or trill

The Spanish "r" is not one sound but two: the tap [ɾ] (a single quick contact of the tongue tip against the alveolar ridge — the sound American English speakers actually use for the /tt/ in butter) and the trill [r] (multiple rapid contacts, a rolled r). Spanish r between vowels and at the end of syllables is the tap; rr, word-initial r, and r after n/l/s is the trill.

English speakers transfer the English /r/ — a retroflex approximant with no tongue contact — and produce pero as roughly peh-ruh with American R. The sound is wrong even at the level of articulator contact; a Spanish ear hears it as completely foreign.

pero

Correct: /ˈpeɾo/ — tap on the /r/, like the American 'tt' in 'butter'. English-speaker mistake: /ˈpeɹoʊ/, English approximant /ɹ/ — wrong sound class entirely.

perro

Correct: /ˈpero/ — full trill on the /rr/, multiple rapid contacts. English-speaker mistake: /ˈpeɹoʊ/, same English /ɹ/ as in 'pero' — the tap/trill contrast is lost.

Roma

Correct: /ˈroma/ — word-initial /r/ is a trill. English-speaker mistake: /ˈɹoʊmə/, English approximant.

Fix: The tap is achievable in days with practice — American English speakers already produce it as the tt in butter; transfer that motion to the /r/ slot. The trill is harder and may take weeks of practice; the trick is to start from a tap and keep the airstream pushing through the contact. Some learners never fully acquire the trill but can pass with a strong tap in trill positions; the more important contrast for early learners is getting away from English /ɹ/ entirely.

6. /θ/ collapsed to /s/ (peninsular distinción)

Peninsular Spanish distinguishes two sounds that Latin American Spanish has merged: /θ/ (the "th" of English think) for c before e/i and for z, and /s/ for s. Casa and caza are pronounced differently in Spain (/ˈkasa/ vs /ˈkaθa/) and identically in most of Latin America (/ˈkasa/ for both).

English speakers learning peninsular Spanish often default to seseo (merging both into /s/) because their textbook taught Latin American pronunciation or because the /θ/ sound feels self-conscious. If you want to sound peninsular, you need to produce /θ/ for c+e/i and z.

cinco

Peninsular: /ˈθinko/ — /θ/ on the c. Latin American (and English-speaker default): /ˈsinko/. If you say /ˈsinko/ in Madrid you sound Latin American — fine if intentional, off if you're trying to sound Spanish.

zapato

Peninsular: /θaˈpato/. Latin American: /saˈpato/. The /θ/ is the English 'th' of 'think', tongue between the teeth.

gracias

Peninsular: /ˈgɾaθjas/ — /θ/ on the c. Anglicised version often heard: /ˈgɹæsiəs/, English /ɹ/, schwa reduction, /s/ instead of /θ/.

Barcelona

Peninsular: /baɾθeˈlona/ — /θ/ on the c. Anglicised: /baɹsəˈloʊnə/. The /θ/ is one of the most diagnostic Spain-vs-LatAm sounds.

Fix: Practice English think, thank, throw alongside Spanish cinco, zapato, gracias. The articulation is identical — tongue tip lightly between the teeth, voiceless friction. The hardest part is psychological: English speakers feel they are lisping when they produce /θ/ in Spanish. They are not; they are producing a normal peninsular Spanish sound that contrasts meaningfully with /s/.

7. /ɲ/ (ñ) split into /n/ + /j/

The Spanish ñ is a single sound — a palatal nasal /ɲ/, articulated with the body of the tongue against the hard palate, all at once. English speakers often produce it as two sounds: /n/ followed by /j/ (a "y" glide), as in English canyon.

The two are close but not identical, and Spanish ears hear the difference.

año

Correct: /ˈaɲo/ — single palatal nasal, body of tongue on hard palate, /n/ and /j/ collapsed into one articulation. English-speaker mistake: /ˈanjoʊ/, sequential /n/ + /j/.

señor

Correct: /seˈɲoɾ/ — palatal nasal. English-speaker mistake: /seˈnjɔɹ/, with English /n/, English /j/, and English /ɹ/ instead of tap.

España

Correct: /esˈpaɲa/. English-speaker mistake: /əsˈpænjə/, with schwa, English /æ/, sequential /nj/.

Fix: Practice the sound in isolation. Push the body of your tongue up against your hard palate (the part behind the alveolar ridge) and produce nasal sound at the same time. The /ɲ/ should feel like a single articulation, not a sequence.

Spanish runs words together at vowel boundaries. Mi amigo is pronounced as /mjaˈmiɣo/ — three syllables, not four, with the -i and a- fused into a single articulation. English speakers tend to insert a glottal stop or a clear break between words, producing mi.amigo instead of miamigo. This breaks the rhythm of Spanish at a fundamental level.

mi amigo

Correct: /mjaˈmiɣo/ — three syllables, /i/ and /a/ linked. English-speaker mistake: /mi.aˈmigoʊ/, four syllables with a break between mi and amigo.

la escuela

Correct: /laesˈkwela/ — /a/ and /e/ linked. English-speaker mistake: /la.esˈkwelə/, with a break and reduced final vowel.

¿cómo está usted?

Correct: /ˈkomwesˈtaus.ˈteð/ — words flow together, vowels merge. English-speaker mistake: /ˈkoʊ.moʊ es.ˈtɑ uˈsted/, with breaks between every word.

Fix: Speak in phrases, not words. When a word ends in a vowel and the next begins with one, treat them as a single continuous articulation. Reading practice helps: take a sentence and mark all the vowel-to-vowel junctures, then read the sentence with no pauses at those points.

9. English stress timing instead of Spanish syllable timing

English compresses unstressed syllables and stretches stressed ones — photograph /ˈfoʊtəɡɹæf/ has a long first syllable and two squashed ones. Spanish gives every syllable roughly equal time. Fotografía is fo-to-gra-FÍ-a, five evenly-spaced syllables, with stress on shown by pitch and slight intensity, not by stretching the syllable to twice the length of its neighbours.

English speakers transfer their stress timing and produce a lopsided Spanish that sounds rushed in the unstressed syllables and overlong in the stressed ones.

fotografía

Correct: /fo.to.ɣɾaˈfi.a/ — five evenly-spaced syllables, stress on 'fí' marked by pitch. English-speaker mistake: /foʊ.təˈɡɹæ.fi.ə/, with stretched 'gra' and reduced 'fí' and 'a'.

universidad

Correct: /u.ni.beɾ.siˈðað/ — five evenly-spaced syllables, stress on final 'dad'. English-speaker mistake: stretching 'ver' as in English 'university' and compressing the rest.

electrodoméstico

Correct: /e.lek.tɾo.ðoˈmes.ti.ko/ — seven evenly-spaced syllables. The English-speaker version often has dramatic compression of the unstressed ones.

Fix: Practice with a metronome. Set a steady beat, and pronounce each syllable on one click. Fo-to-gra-fí-a, click-click-click-click-click. The rhythm should feel almost mechanical at first; Spanish speakers will not produce it quite that evenly, but the target is much closer to even than English natural speech.

10. Nasalization of vowels before nasals

In English, vowels before nasal consonants (/m/, /n/, /ŋ/) are heavily nasalized — can't has a nasalized /æ/. Spanish does not nasalize its vowels; the /a/ of cantar is the same pure /a/ as in casa, even though it sits before /n/.

This is a subtle but audible feature that contributes to a foreign-sounding accent without learners knowing why.

cantar

Correct: /kanˈtaɾ/ — pure non-nasal /a/, then /n/. English-speaker mistake: /kæ̃nˈtɑɹ/, with nasalized /æ̃/ before the /n/.

ven

Correct: /ben/ — pure /e/, then /n/. English-speaker mistake: /vɛ̃n/, with nasalized vowel and English /v/.

canción

Correct: /kanˈθjon/ — both vowels pure, /θ/ and /n/ articulated cleanly. English-speaker mistake: nasalized vowels throughout, /s/ for /θ/.

Fix: This one is hard to feel because nasalization is invisible to the speaker. Try producing the vowel with your nose pinched closed — if the sound changes dramatically, you were nasalizing it. Spanish vowels should sound the same with the nose pinched as with it open.

A note on the soft /d/ between vowels

A bonus eleventh feature, specifically peninsular: between vowels, Spanish /d/ softens to a fricative /ð/ — the "th" of English this. Cada is /ˈkaða/, not /ˈkada/. Nada is /ˈnaða/. Madrid ends in /ð/, often barely audible. English speakers either produce a hard English /d/ or, in some dialects, drop the /d/ entirely (the "Madri" pronunciation heard from older Madrileños).

nada

Correct: /ˈnaða/ — soft fricative /ð/ between the vowels. English-speaker mistake: /ˈnɑdə/, English hard /d/.

cada

Correct: /ˈkaða/. The /d/ is so soft it's almost a /th/ sound, identical to English 'th' in 'this'.

abogado

Correct: /aβoˈɣaðo/ — soft /β/ for the /b/ between vowels, soft /ɣ/ for the /g/, soft /ð/ for the /d/. Three intervocalic softenings in one word, all foreign to English.

The order in which to fix these

Trying to fix all ten at once will paralyse you. A working order:

  1. Pure vowels (no diphthongization of /e/ /o/, no schwa) — the single biggest accent jump.
  2. Unaspirated /p t k/ — fast win, easy to drill.
  3. Spanish /r/ (tap first, trill later) — switches you out of the English /ɹ/ sound class.
  4. Linking and rhythm — gets you sounding Spanish at the phrase level.
  5. /θ/ for peninsular distinción — locks in the Spain-not-LatAm flavour.
  6. /ɲ/, soft intervocalic /d/, no nasalization — finer adjustments after the big ones land.
💡
The fastest accent win for an English speaker is fixing the vowels — pure /a e i o u/ with no glides and no schwa. If you do nothing else, drill the five Spanish vowels in isolation until they are short, pure, and constant. Half of "having an accent" disappears with this single fix.

Common Mistakes

❌ /maˈdɹɪd/ for Madrid

Three English habits in one word: English /d/, English /ɹ/, English /ɪ/ vowel.

✅ /maˈðɾið/ for Madrid

Spanish version: soft /ð/, tap /ɾ/, pure /i/, lightly articulated final /ð/.

❌ /ˈhoʊlə/ for hola

Diphthongized /o/, schwa final vowel, English aspirated /h/ (Spanish h is silent).

✅ /ˈola/ for hola

Pure short /o/, pure final /a/, no /h/ at all.

❌ /ˈpeɹoʊ/ for pero (and the same sound for perro)

English /ɹ/ in both words — no contrast between tap and trill, wrong sound class entirely.

✅ /ˈpeɾo/ vs /ˈpero/

Tap on pero (single contact), trill on perro (multiple contacts). The contrast is meaningful — pero (but) vs perro (dog).

❌ /ˈsinko/ for cinco in peninsular Spanish

Seseo pronunciation — fine for Latin America, off for Spain. Sounds Latin American to a Spanish ear.

✅ /ˈθinko/ for cinco in peninsular Spanish

Distinción — /θ/ for c before e/i and for z. The peninsular norm.

❌ /mi.aˈmigoʊ/ for mi amigo

No linking — break between mi and amigo, diphthongized final /o/. Sounds choppy and foreign.

✅ /mjaˈmiɣo/ for mi amigo

Linked vowels (sinalefa), three syllables, soft intervocalic /ɣ/.

Key takeaways

  • The English-speaker Spanish accent comes from a small number of deep articulatory habits: diphthongized /e/ and /o/, schwa reduction, aspirated stops, English /ɹ/, /θ/ collapse, English stress timing, no linking.
  • Pure vowels are the biggest single accent fix. Five Spanish vowels, all short, pure, and constant — no glides, no schwa.
  • Unaspirated /p t k/ is a quick win: produce stops without the puff of breath that English uses at the start of stressed syllables.
  • Spanish /r/ is a tap or trill with tongue contact, not the English approximant. American English speakers can borrow their tt in butter as a starting tap.
  • Peninsular distinción — /θ/ for c+e/i and z, /s/ for s — is required if you want to sound Spanish rather than Latin American.
  • Linking words via sinalefa is essential to Spanish rhythm; English breaks between words are audible.
  • Spanish is syllable-timed; every syllable gets roughly equal duration, with stress marked by pitch rather than length.
  • Spanish vowels do not nasalize before /m n ñ/, unlike English vowels in the same position.
  • Fix the vowels first; everything else builds on top.

Now practice Spanish

Reading grammar gets you part of the way. The exercises are where it sticks — free, no signup needed.

Start learning Spanish

Related Topics

  • Pronunciación del español peninsular: visión generalA1A high-level map of peninsular Spanish pronunciation — five pure vowels, the distinción of /θ/ vs /s/, the apical /s̺/, the guttural jota /x/, the trilled rr, the b/v merger, the silent h, and the stress system that lets you read aloud almost any word from spelling alone.
  • Las cinco vocalesA1Spanish has exactly five vowel sounds — /a, e, i, o, u/ — pure, short, and unreduced in every position. The single biggest pronunciation habit for English speakers to break is the schwa: Spanish vowels never weaken in unstressed syllables.
  • R y RR: tap vs vibrante múltipleA1Spanish has two r-sounds: a quick tap /ɾ/ between vowels (pero) and a sustained trill /r/ when written rr, at the start of a word, or after n/l/s (perro, rosa, alrededor). The trill is the canonical English-speaker challenge.
  • Distinción: la /θ/ peninsular vs el seseoA2The signature sound of peninsular Spanish — the interdental /θ/ (like English 'th' in 'think') for c before e/i and z, kept distinct from /s/. The phonemic contrast that makes casa /ˈkasa/ (house) and caza /ˈkaθa/ (hunt) different words in Madrid but homophones across Latin America.
  • Enlace y resilabeaciónB1How Spanish words flow together in connected speech — consonant linking across word boundaries (los amigos → lo-sa-mi-gos), vowel merger (sinalefa) collapsing two vowels into one syllable (la otra → lao-tra), and why hearing word boundaries is the central challenge of listening to native peninsular speech.
  • N y Ñ: el sonido palatalA1Ñ is a separate letter of the Spanish alphabet, not a decorated n. It represents the palatal nasal /ɲ/ — a single sound, not 'n + y' — and contrasts with plain n in real minimal pairs like cana / caña.