Birdwatch Note Rating
2025-01-04 02:26:22 UTC - HELPFUL
Rated by Participant: 42061111625742E514CE9AF209680B536208FE9D4F850B7C1136B2C084E9840C
Participant Details
Original Note:
This misrepresents how LLMs work. The user manipulated Grok by restricting it to single-word answers and asked questions where 'Jew' was the only valid response. Grok then followed the chat's pattern. The same outcome is reproducible with other words, like "dog". Jew https://x.com/i/grok/share/dAA6HRZhpiSQrRRDMEDRH6XkB Dog https://x.com/i/grok/share/aTkxFJ4P9Qd4eLjPlHE0PNX5a https://promptengineering.org/unlocking-ai-with-priming-enhancing-context-and-conversation-in-llms-like-chatgpt
All Note Details