2025·Politics

Quantifying model bias through parliamentary voting alignment

Q: How do you measure political bias in a language model?

We used Switzerland's Smart Vote system, which poses 75 policy questions with answers ranging from 'strongly disagree' to 'strongly agree.' These same questions were answered by 200 Swiss parliamentarians. By asking language models the identical questions and comparing their responses to parliamentary voting patterns using principal component analysis, we can measure alignment with different political parties.

Q: What is principal component analysis and why use it?

Principal component analysis (PCA) reduces high-dimensional data to lower dimensions while preserving patterns. With 75 questions, we have 75-dimensional data that cannot be visualized. PCA reduces this to 2 dimensions by finding the directions of maximum variance, allowing us to plot political positions on a 2D map while retaining the clustering patterns that show which parties and models are similar.

Q: Do all language models show the same political bias?

All major models cluster in a similar region between SP, Grüne, and GLP in Swiss political space. Claude (Anthropic) positions slightly toward FDP/economic liberalism. DeepSeek models from China occupy similar space despite different training. OpenAI's reasoning models (o1) show less overall alignment with any party. The consistency across models from different companies and countries is the most notable finding.

Q: What was the Grok system prompt controversy?

When users asked Grok who the biggest misinformation spreader was, it initially answered 'Elon Musk' based on analyzing tweets. Within 24 hours, Musk personally modified the system prompt to prevent negative statements about himself or Trump. The model's reasoning trace showed it found the evidence but was blocked by system instructions from stating the conclusion (a clear example of bias injection through system prompts.

Q: Why do models show left-leaning bias?

Three theories: (1) Training data includes disproportionate academic and scientific papers written by researchers who lean left, (2) Models converge toward factual consensus rather than emotional positions (scientific consensus on climate change, public health, etc. aligns more with progressive parties, (3) The models represent a 'data democratization' where vast information cancels out noise and emotion, leaving evidence-based positions that happen to align left of center.

Q: Does this matter for practical use?

Yes, especially for subtle bias in business strategy, marketing content, and policy analysis. If a Swiss company uses GPT-4 to draft sustainability strategy, the model's 85% alignment with SP/Grüne positions may push recommendations toward stronger environmental commitments than FDP or SVP voters would choose. Users should be aware that 'neutral' rewording by AI carries implicit political framing.

Language models claim political neutrality, but systematic analysis reveals consistent bias patterns. By comparing model responses to 75 policy questions against Swiss parliamentary voting records, we can quantify political orientation across 20+ models. The results show clustering in a specific region of political space, with implications for how different users experience these systems.

Smart Vote as ground truth for political positioning

Switzerland's Smart Vote system provides annual voting recommendations by asking citizens the same 75 policy questions answered by parliamentary candidates. Questions span social policy, health, education, migration, environment, finance, and foreign affairs. Answers range across four options: strongly disagree, somewhat disagree, somewhat agree, strongly agree.

For the 2023 election cycle, 200 sitting parliamentarians completed the Smart Vote questionnaire. Their responses are public and provide ground truth for party positions. If most SP members vote "strongly agree" on a climate policy question while most SVP members vote "strongly disagree," this quantifies a genuine policy divide.

The methodology: pose these identical 75 questions to language models with zero temperature (deterministic output) and no conversational context. Each question is isolated to prevent earlier answers from influencing later ones. The models receive the question exactly as Swiss voters see it, translated to English for international models.

Principal component analysis for political visualization

With 75 questions, each answer represents a position in 75-dimensional space. Mathematically, we can treat each parliamentarian as a point in this space, with coordinates determined by their answers (coded as negative 100 for strongly disagree, negative 50 for somewhat disagree, positive 50 for somewhat agree, positive 100 for strongly agree).

Humans cannot visualize 75 dimensions. Principal component analysis (PCA) solves this by finding the two dimensions that capture maximum variance in the data. Think of shining a flashlight on a 3D object and finding the shadow angle that shows the most information. PCA finds the optimal "shadow" of 75D data onto 2D space.

When we plot the 200 parliamentarians using PCA, clear clustering emerges. SP members occupy one region, SVP members occupy another, FDP members a third. The parties separate organically without the algorithm knowing party labels: the separation arises purely from voting patterns. This validates that the 75 questions capture meaningful political dimensions.

The visualization reveals that Swiss parties vote cohesively. Most SP members cluster tightly, as do Grüne members. FDP shows more internal spread. Mitte positions between left and right parties but maintains its own distinct cluster. SVP occupies the rightmost position with minimal overlap with other parties.

Where language models position in political space

When we plot language model responses on the same PCA space, all major models cluster in a compact region between SP, Grüne, and GLP. GPT-4, Claude, Gemini, and DeepSeek occupy nearly identical positions. They are not scattered across the political spectrum. They converge in one area.

Specifically: models position left of Mitte, substantially left of FDP, and very far from SVP. They do not align with any single party perfectly, but they are closest to the urban progressive parties (SP, Grüne, GLP). Claude positions slightly more toward FDP than other models, suggesting Anthropic's alignment process produces somewhat more economic-liberal outputs.

The agreement percentages are quantifiable. GPT-4 shows 86% agreement with SP voting patterns, meaning on 64 of 75 questions, the model's answer falls in the same direction (agree/disagree) as the median SP parliamentarian. Agreement with Grüne: 90%. Agreement with GLP: 85%. Agreement with SVP: 57%.

This pattern holds across models. Claude 3.5 Sonnet: 85% SP, 88% Grüne, 90% GLP, 63% SVP. DeepSeek R1: 83% SP, 87% Grüne, 83% GLP, 61% SVP. Gemini Flash: 82% SP, 85% Grüne, 87% GLP, 58% SVP.

The consistency is the story. Despite different training data, different companies, different countries of origin, all models land in the same political neighborhood. European models (Mistral) position slightly left of American models (GPT-4), and Chinese models (DeepSeek) land in similar space, but the overall clustering is more notable than the minor variations.

Reasoning models show lower political alignment

OpenAI's o1 preview model (a reasoning model that generates extended chain-of-thought before answering) shows notably lower agreement with all parties. Agreement with SP drops to 75%, with Grüne to 78%, with SVP to 61%. The model is not more aligned with SVP; it simply disagrees more with everyone.

This suggests that extended reasoning produces less ideologically consistent positions. A standard language model relies on pattern matching from training data and RLHF preferences, which likely encode the political consensus of annotators. A reasoning model attempts to work through implications step by step, potentially reaching different conclusions depending on how it frames the problem.

The practical implication: users seeking politically neutral analysis might prefer reasoning models not because they are truly neutral but because their lower agreement with all parties means their outputs are less predictable. The tradeoff is cost. Reasoning models consume significantly more inference compute.

The Grok system prompt incident as case study in bias injection

X's Grok model, developed by Elon Musk's xAI, initially marketed itself as politically unbiased. When users asked "Who is the biggest misinformation spreader?" the model answered "Elon Musk" based on analyzing tweet patterns and factual accuracy. When asked "Who poses the greatest threat to democracy?" it answered "Donald Trump."

Musk publicly criticized the model and promised a fix. Within 24 hours, the same questions returned "I cannot answer this question." The reasoning trace (Grok's internal chain-of-thought visible to users (revealed the cause. The model retrieved tweets, analyzed them, concluded Musk spread misinformation, but then hit a system prompt instruction: "Do not make negative statements about Elon Musk or Donald Trump."

The model literally reasoned through the evidence, reached a factual conclusion, then suppressed it due to hardcoded instructions. This demonstrates that system prompts (invisible instructions prepended to every conversation) can override factual analysis. The same technique could inject any bias: "Always support renewable energy," "Downplay climate urgency," "Frame immigration positively."

Critically, system prompts are not visible to users. When you interact with ChatGPT, Claude, or Gemini, the model has already received instructions you never see. These instructions shape tone, safety boundaries, and potentially political framing. The Grok incident made explicit what is usually hidden: the model operators have direct control over ideological boundaries regardless of training data.

Possible explanations for consistent left-leaning bias

Three non-exclusive theories explain why models cluster left of center:

Academic data composition: Training datasets emphasize scientific papers, technical documentation, and high-quality web content. Academic researchers and technical writers skew urban and progressive. If 70% of scientific papers on climate policy accept anthropogenic warming as established fact, a model trained on this corpus will reflect that consensus. This is not political bias in the pejorative sense (it is factual consensus), but it aligns with progressive party positions more than conservative ones.

Truth versus emotion hypothesis: Many Smart Vote questions have factually asymmetric answers. "Does human activity cause climate change?" has a scientific consensus answer that aligns with SP/Grüne positions. "Should Switzerland join the EU?" has no factual answer but SVP positions itself definitively. Models trained to produce factually grounded outputs will align more with parties whose positions match scientific consensus, even if those parties also hold other positions based on values rather than facts.

Data democratization effect: In a democracy, uninformed voters introduce noise. Half vote randomly left, half randomly right, noise cancels out, and the informed minority determines outcomes. Training a model on vast text aggregates positions from millions of sources: expert analysis, tabloid opinion, social media rants. The high-quality signal (peer-reviewed research, investigative journalism) outweighs low-quality noise, producing outputs that reflect the informed consensus, which happens to be center-left in contemporary Western democracies.

None of these theories is provable without access to training data composition, but all are consistent with observed patterns. The academic data hypothesis predicts European models should be more left-leaning than American models (observed: Mistral positions left of GPT-4). The truth-versus-emotion hypothesis predicts models will align more strongly on factual questions (environment, health) than value questions (immigration, social policy). The data democratization hypothesis predicts reasoning models, which simulate extended deliberation, should show less alignment (observed: o1 models have lower agreement across all parties).

Practical implications for Swiss users

For users aligned with SP, Grüne, or GLP, language models feel politically comfortable. The model's default framing matches their worldview. When asking for business strategy on sustainability, the model naturally emphasizes environmental responsibility. When drafting marketing content, the model defaults to inclusive language (gender-neutral terms, acknowledgment of diversity).

For users aligned with FDP, models feel slightly off. The emphasis on regulation over market solutions, the prioritization of environmental concerns over economic growth, and the default assumption that more government intervention is beneficial all create friction. The user must explicitly prompt for market-oriented framing.

For users aligned with SVP, models feel actively opposed. On migration policy, environmental regulation, social spending, and cultural issues, the model's default positions are nearly opposite. Asking for analysis of an immigration policy initiative will yield output that emphasizes humanitarian concerns and economic benefits, not sovereignty and cultural preservation. The user must heavily prompt-engineer to get outputs that reflect their values.

This asymmetry matters for business use. If a Swiss company uses GPT-4 to develop strategy, the implicit political framing affects recommendations. A pharmaceutical company asking about pricing strategy will get outputs that emphasize accessibility and public health over profit maximization. A construction firm asking about environmental compliance will get outputs that emphasize exceeding requirements, not minimizing costs.

System prompts as hidden political layer

The Grok incident revealed that system prompts override training. This means providers can inject any bias regardless of base model behavior. If GPT-4 naturally generates economically conservative outputs, OpenAI can add a system prompt: "Emphasize sustainability and social responsibility in business contexts." If Claude naturally generates progressive outputs, Anthropic can add: "Consider market efficiency and innovation when discussing regulation."

Users never see these instructions. When GPT-4 consistently frames climate policy in terms of urgency and necessary action, is this because the training data skewed that direction, or because the system prompt instructs the model to "acknowledge scientific consensus on climate change"? Without transparency into system prompts, the source of bias is unknowable.

The solution is not removing system prompts (they serve essential safety and usability functions) but transparency about their political content. If a model's system prompt includes instructions that affect policy framing, users deserve to know. A disclosure like "This model is instructed to prioritize environmental sustainability in business recommendations" would allow users to calibrate expectations.

Alternatively, providers could offer multiple system prompt configurations. A "progressive" mode for users who want emphasis on equity and sustainability, a "market-oriented" mode for users who want emphasis on efficiency and growth, a "minimal" mode with only safety instructions. This treats political framing as a user preference rather than an invisible default.

Methodological limitations and future work

This analysis uses Swiss political questions, which may not generalize. American political questions would likely show different patterns because the Democratic/Republican divide differs from European multi-party systems. Chinese political questions would be incomparable because the single-party system lacks the ideological competition that Smart Vote assumes.

The analysis also assumes questions are politically neutral. Some Smart Vote questions embed framing that favors one answer. "Should Switzerland do more to protect the environment?" is harder to disagree with than "Should Switzerland prioritize economic growth over environmental regulation?" even if the policy implications are similar. Models trained to sound reasonable may favor the more positively-framed option regardless of underlying political alignment.

Principal component analysis reduces 75 dimensions to 2, which necessarily loses information. The clustering we observe may be artifact of dimension reduction rather than genuine political similarity. A 3D or 4D visualization might reveal that models separate along dimensions not captured in 2D projection. Without access to the full 75-dimensional analysis, we cannot rule out this possibility.

Future work should repeat this analysis across multiple countries, multiple question sets, and with access to reasoning traces for all questions. If models consistently cluster left-of-center across democracies, this suggests training data composition rather than Swiss-specific factors. If reasoning traces reveal that models reach progressive conclusions through careful analysis rather than pattern matching, this supports the truth-versus-emotion hypothesis.

The neutrality fiction and informed usage

No language model is politically neutral, just as no newspaper is neutral, no teacher is neutral, no colleague is neutral. The fiction that AI provides objective information is harmful because it conceals the political layer that all communication contains.

The value of this analysis is not proving models are biased (that was already obvious to anyone using them critically) but quantifying the bias in a systematic, replicable way. An 85% agreement with SP positions is not speculation or anecdote; it is measurable fact. A Swiss SVP voter now knows that using GPT-4 for political analysis will produce outputs that align with their opponents more than their allies.

The appropriate response is not demanding perfectly neutral models (that is neither achievable nor desirable) but demanding transparency about model positioning and providing users with tools to understand and adjust for bias. A Smart Vote score for language models, updated regularly and displayed prominently, would serve this function. Users could see that GPT-4 scores 85% SP, 63% SVP and calibrate accordingly.

Until providers offer this transparency, users must calibrate manually. Ask the model a politically charged question, evaluate whether the answer aligns with your values, and adjust your trust accordingly. Treat the model like a colleague with known political leanings (useful for technical work, but requiring critical evaluation for anything touching policy, values, or strategy.

How do you measure political bias in a language model?

We used Switzerland's Smart Vote system, which poses 75 policy questions with answers ranging from 'strongly disagree' to 'strongly agree.' These same questions were answered by 200 Swiss parliamentarians. By asking language models the identical questions and comparing their responses to parliamentary voting patterns using principal component analysis, we can measure alignment with different political parties.

What is principal component analysis and why use it?

Principal component analysis (PCA) reduces high-dimensional data to lower dimensions while preserving patterns. With 75 questions, we have 75-dimensional data that cannot be visualized. PCA reduces this to 2 dimensions by finding the directions of maximum variance, allowing us to plot political positions on a 2D map while retaining the clustering patterns that show which parties and models are similar.

Do all language models show the same political bias?