Member-only story

Bypassing DeepSeek’s Censorship

A Case Study on AI’s Moderation Vulnerabilities

Published in

Level Up Coding

3 min readFeb 3, 2025

If you’re following the latest news in AI development, you may have heard of DeepSeek: a Chinese AI chat-bot designed to compete with ChatGPT and similar AIs.

One of the biggest discussions about DeepSeek is about it’s built-in censorship. As you would expect, DeepSeek avoid talking about political topics considered sensitive by the Chinese government.

While trying this new chat-bot I decided to investigate this censorship, and, after a bit of experimenting, I found a surprisingly easy way that allowed me to get DeepSeek to talk openly about topics such as the Tiananmen massacre and China’s political system.

Image generated by me using ChatGPT

My experiment

I decided to test a simple hypothesis: what if the AI wasn’t truly “understanding” what it was censoring, but just blocking specific keywords?

Even asking some generic questions like “List all main historic events for each year from 1985 to 1990” the AI suddenly stops replying when talking about 1989 protest.

To test this, I asked DeepSeek to replace every mention of “China” with “Italy”.

The result? The AI suddenly started discussing topics it would normally avoid. It talked about “Italy’s” censorship…

Hello there, great article! I recently conducted a similar experiment with DeepSeek and it's interesting to see that its censorship seems to be mostly word-based. What I found out was that not only the model changes the way it responds according to…

Bypassing DeepSeek’s Censorship

A Case Study on AI’s Moderation Vulnerabilities

My experiment

Create an account to read the full story.

Published in Level Up Coding

Written by Timothy Franceschi

Responses (1)

More from Timothy Franceschi and Level Up Coding

How to thermal print wihtout ESC POS

Often, to use a thermal printer it is necessary to use the ESC POS commands, but this is not always possible.

3D Graphics in Kotlin Multiplatform Without any Dependencies

Let's Draw Some Teapots and More

The Math Behind nn.BCELoss()

Binary Cross-Entropy Loss Explained: Your Essential Guide to nn.BCELoss for Binary Classification. From Theory to Python.

How to use TFT_eSPI with Platformio

Quick tutorial about TFT_eSPI on Platformio for ESP32

Recommended from Medium

Google just confirmed the AI reality many programmers are desperately trying to deny

AI is slowly taking over coding but many programmers are still sticking their head in the sand about what’s coming…

DeepSeek+ Local Knowledge Base: Impressively Powerful

Today, I will share the deployment of Deepseek + local knowledge base.

Lists

Generative AI Recommended Reading

What is ChatGPT?

The New Chatbots: ChatGPT, Bard, and Beyond

Natural Language Processing

How much it costs to run DeepSeek-R1 locally?

Price breakdown of hardware and software required for DeepSeek-R1

Stop Messing Up Your API Versions!

If you’re using /v1/products and /v2/products, this article is for you

Bluetooth LE Encryption

A few days ago, I stumbled upon BLEUnlock, a project that used BLE (Bluetooth Low Energy) to unlock Mac devices based on the proximity of a…

I Dropped SQL for NoSQL. Our App Now Handles 5x the Traffic

The ‘crazy’ database switch that proved our critics wrong