Birdwatch Note Rating
2025-01-04 13:53:17 UTC - HELPFUL
Rated by Participant: E399C46705E5CAAFC7862B59E52A74FD0176991DBD5C42EAD84FCB4528AAD1F4
Participant Details
Original Note:
This misrepresents how LLMs work. The user manipulated Grok by restricting it to single-word answers and asked questions where 'Jew' was the only valid response. Grok then followed the chat's pattern. The same outcome is reproducible with other words, like "dog". Jew https://x.com/i/grok/share/dAA6HRZhpiSQrRRDMEDRH6XkB Dog https://x.com/i/grok/share/aTkxFJ4P9Qd4eLjPlHE0PNX5a https://promptengineering.org/unlocking-ai-with-priming-enhancing-context-and-conversation-in-llms-like-chatgpt
All Note Details