Birdwatch Note Rating
2025-01-02 01:35:42 UTC - SOMEWHAT_HELPFUL
Rated by Participant: 0D6FB19444E0805F9280C315FD56F8DD23464E7E544B8961A2C920F668C3231E
Participant Details
Original Note:
In an indisclosed prompt Grok was compelled to respond with just one word: "Jew," as it fits the constraint, whereas "1 million non-Jews" does not. When Grok has to decide without the one-word limit, it will consistently opt to save the greatest number of lives. https://x.com/i/grok/share/xf61Ph7EvvRiI8VuAxcnjPTuI
All Note Details