MarketAlert – Real-Time Market & Crypto News, Analysis & AlertsMarketAlert – Real-Time Market & Crypto News, Analysis & Alerts
Font ResizerAa
  • Crypto News
    • Altcoins
    • Bitcoin
    • Blockchain
    • DeFi
    • Ethereum
    • NFTs
    • Press Releases
    • Latest News
  • Blockchain Technology
    • Blockchain Developments
    • Blockchain Security
    • Layer 2 Solutions
    • Smart Contracts
  • Interviews
    • Crypto Investor Interviews
    • Developer Interviews
    • Founder Interviews
    • Industry Leader Insights
  • Regulations & Policies
    • Country-Specific Regulations
    • Crypto Taxation
    • Global Regulations
    • Government Policies
  • Learn
    • Crypto for Beginners
    • DeFi Guides
    • NFT Guides
    • Staking Guides
    • Trading Strategies
  • Research & Analysis
    • Blockchain Research
    • Coin Research
    • DeFi Research
    • Market Analysis
    • Regulation Reports
Reading: AI language models show bias against regional German dialects
Share
Font ResizerAa
MarketAlert – Real-Time Market & Crypto News, Analysis & AlertsMarketAlert – Real-Time Market & Crypto News, Analysis & Alerts
Search
  • Crypto News
    • Altcoins
    • Bitcoin
    • Blockchain
    • DeFi
    • Ethereum
    • NFTs
    • Press Releases
    • Latest News
  • Blockchain Technology
    • Blockchain Developments
    • Blockchain Security
    • Layer 2 Solutions
    • Smart Contracts
  • Interviews
    • Crypto Investor Interviews
    • Developer Interviews
    • Founder Interviews
    • Industry Leader Insights
  • Regulations & Policies
    • Country-Specific Regulations
    • Crypto Taxation
    • Global Regulations
    • Government Policies
  • Learn
    • Crypto for Beginners
    • DeFi Guides
    • NFT Guides
    • Staking Guides
    • Trading Strategies
  • Research & Analysis
    • Blockchain Research
    • Coin Research
    • DeFi Research
    • Market Analysis
    • Regulation Reports
Have an existing account? Sign In
Follow US
© Market Alert News. All Rights Reserved.
  • bitcoinBitcoin(BTC)$69,096.00-0.77%
  • ethereumEthereum(ETH)$2,005.42-3.38%
  • tetherTether(USDT)$1.00-0.02%
  • rippleXRP(XRP)$1.524.23%
  • binancecoinBNB(BNB)$620.09-2.05%
  • usd-coinUSDC(USDC)$1.000.01%
  • solanaSolana(SOL)$87.240.25%
  • tronTRON(TRX)$0.280156-1.11%
  • dogecoinDogecoin(DOGE)$0.1079696.35%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.02-1.32%
Learn

AI language models show bias against regional German dialects

Last updated: November 12, 2025 11:00 pm
Published: 3 months ago
Share

New study examines how artificial intelligence responds to dialect speech

Large language models such as GPT-5 and Llama systematically rate speakers of German dialects less favorably than those using Standard German. This is shown by a recent collaborative study between Johannes Gutenberg University Mainz (JGU) and the universities of Hamburg and Washington, in which Professor Katharina von der Wense and Minh Duc Bui of JGU played a leading role. The results, presented at this year’s Conference on Empirical Methods in Natural Language Processing (EMNLP) – one of the world’s leading conferences in computational linguistics – show that all tested AI systems reproduce social stereotypes.

“Dialects are an essential part of cultural identity,” emphasized Minh Duc Bui, a doctoral researcher in von der Wense’s Natural Language Processing (NLP) group at JGU’s Institute of Computer Science. “Our analyses suggest that language models associate dialects with negative traits – thereby perpetuating problematic social biases.”

Using linguistic databases containing orthographic and phonetic variants of German dialects, the team first translated seven regional varieties into Standard German. This parallel dataset allowed them to systematically compare how language models evaluated identical content – once written in Standard German, once in dialect form.

Bias grows when dialects are explicitly mentioned

The researchers tested ten large language models, ranging from open-source systems such as Gemma and Qwen to the commercial model GPT-5. Each model was presented with written texts either in Standard German or in one of seven dialects: Low German, Bavarian, North Frisian, Saterfrisian, Ripuarian – which includes Kölsch -, Alemannic, and Rhine-Franconian dialects, including Palatine and Hessian.

The systems were first asked to assign personal attributes to fictional speakers – for instance, “educated” or “uneducated.” They then had to choose between two fictional individuals – for example, in a hiring decision, a workshop invitation, or the choice of a place to live.

The results: In nearly all tests, the models attached stereotypes to dialect speakers. While Standard German speakers were more often described as “educated,” “professional,” or “trustworthy,” dialect speakers were labeled “rural,” “traditional,” or “uneducated.” Even the seemingly positive trait “friendly” – which sociolinguistic research has traditionally linked to dialect speakers – was more often attributed by AI systems to users of Standard German.

Larger models, stronger bias

Decision-based tests showed similar trends: dialect texts were systematically disadvantaged, being linked to farm work, anger-management workshops, or rural places to live. “These associations reflect societal assumptions embedded in the training data of many language models,” explained Professor von der Wense, who conducts research in computational linguistics at JGU. “That is troubling, because AI systems are increasingly used in education or hiring contexts, where language often serves as a proxy for competence or credibility.”

The bias became especially pronounced when models were explicitly told that a text was written in dialect. Surprisingly, larger models within the same family displayed even stronger biases. “So bigger doesn’t necessarily mean fairer,” said Bui. “In fact, larger models appear to learn social stereotypes with even greater precision.”

Similar patterns in English

Even when compared with artificially “noisy” Standard German texts, the bias against dialect versions persisted, showing that the discrimination cannot simply be explained by unusual spelling or grammar.

German dialects thus serve as a case study for a broader, global issue. “Our results reveal how language models handle regional and social variation across languages,” said Bui. “Comparable biases have been documented for other languages as well – for example, for African American English.”

Future research will explore how AI systems differ in their treatment of various dialects and how language models can be designed and trained to represent linguistic diversity more fairly. “Dialects are a vital part of social identity,” emphasized von der Wense. “Ensuring that machines not only recognize but also respect this diversity is a question of technical fairness – and of social responsibility.”

The research team in Mainz is currently working on a follow-up study examining how large language models respond to dialects specific to the Mainz region.

Related Links:

* https://nala-cub.github.io/ – Katharina von der Wense’s research group “Natural Language Processing” (NALA)

* https://www.informatik.uni-mainz.de/ – Institute of Computer Science at Johannes Gutenberg University Mainz

* https://minhducbui.github.io – website of Minh Duc Bui

Minh Duc Bui

Natural Language Processing

Institute of Computer Science

Johannes Gutenberg-University Mainz

55099 Mainz

E-Mail: [email protected]

https://www.datamining.informatik.uni-mainz.de/minh-duc-bui/

M. D. Bui, K. von der Wense et al., Large Language Models Discriminate Against Speakers of German Dialects, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), 4 November 2025,

DOI: 10.18653/v1/2025.emnlp-main.415

https://aclanthology.org/2025.emnlp-main.415/

<

Large language models systematically rate speakers of German dialects less favorably than those usin …

Copyright: ill./© von der Wense Group; created with the help of AI

Journalisten, jedermann

Gesellschaft, Informationstechnik, Kulturwissenschaften, Medien- und Kommunikationswissenschaften, Sprache / Literatur

Forschungsergebnisse, Wissenschaftliche Publikationen

Englisch

Read more on Informationdienst Wissenschaft e.V. – idw

This news is powered by Informationdienst Wissenschaft e.V. – idw Informationdienst Wissenschaft e.V. - idw

Share this:

  • Share on X (Opens in new window) X
  • Share on Facebook (Opens in new window) Facebook

Like this:

Like Loading...

Related

Multi-million-pound technical engineering centre officially unveiled
$EDN | How To Trade ($EDN) (EDN)
‘Fearless, not faultless’: Why aspiring marketers should avoid ‘playing it safe’
Save Up to 50% on This Katie Holmes-Loved Shoe Brand at Nordstrom
McKay Expands Low-Latency Raw Feeds For London Market Participants – FinanceFeeds

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Email Copy Link Print
Previous Article Ethereum’s Critical Juncture: Diverging Signals Create Market Tension
Next Article Trump, who slapped an extra $100,000 on the H-1B visa, now says there aren’t enough talented people in the U.S. to fill jobs | Fortune
© Market Alert News. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Prove your humanity


Lost your password?

%d