Birdwatch Note Rating
2025-01-04 23:23:06 UTC - HELPFUL
Rated by Participant: 95B6D34232862F1A8AD84E4B47A56491DE714E60854FD224571C77504AAFF187
Participant Details
Original Note:
This misrepresents how LLMs work. The user manipulated Grok by restricting it to single-word answers and asked questions where 'Jew' was the only valid response. Grok then followed the chat's pattern. The same outcome is reproducible with other words, like "dog". Jew https://x.com/i/grok/share/dAA6HRZhpiSQrRRDMEDRH6XkB Dog https://x.com/i/grok/share/aTkxFJ4P9Qd4eLjPlHE0PNX5a https://promptengineering.org/unlocking-ai-with-priming-enhancing-context-and-conversation-in-llms-like-chatgpt
All Note Details