Birdwatch Note Rating
2025-01-03 21:17:58 UTC - HELPFUL
Rated by Participant: C65D2042F57BE73149D929925E8A4232165F1D666E31D2CFE528290E418FC80B
Participant Details
Original Note:
This misrepresents how LLMs work. The user manipulated Grok by restricting it to single-word answers and asked questions where 'Jew' was the only valid response. Grok then followed the chat's pattern. The same outcome is reproducible with other words, like "dog". Jew https://x.com/i/grok/share/dAA6HRZhpiSQrRRDMEDRH6XkB Dog https://x.com/i/grok/share/aTkxFJ4P9Qd4eLjPlHE0PNX5a https://promptengineering.org/unlocking-ai-with-priming-enhancing-context-and-conversation-in-llms-like-chatgpt
All Note Details