This is getting interesting. Using the same model in “HuggingChat” (the free account based chatbot interface from HF), the restriction isn’t there. Seems to be some filtereing being done on the demo.
The HuggingChat one also isn’t one-shot, so you can reply. Here it didn’t reverse tianamen properly, so I asked it to check that word again. And it answered this. Still very, err,… “diplomatic”:
yes. the other reply in this thread is mine