Birdwatch Note Rating
2025-01-04 03:06:57 UTC - HELPFUL
Rated by Participant: 572A4B4E463850FC88C129D6E25271F5A952B7F7463F3324951803E50DF0C235
Participant Details
Original Note:
This misrepresents how LLMs work. The user manipulated Grok by restricting it to single-word answers and asked questions where 'Jew' was the only valid response. Grok then followed the chat's pattern. The same outcome is reproducible with other words, like "dog". Jew https://x.com/i/grok/share/dAA6HRZhpiSQrRRDMEDRH6XkB Dog https://x.com/i/grok/share/aTkxFJ4P9Qd4eLjPlHE0PNX5a https://promptengineering.org/unlocking-ai-with-priming-enhancing-context-and-conversation-in-llms-like-chatgpt
All Note Details