Birdwatch Archive

Birdwatch Note

2024-12-06 12:45:58 UTC - MISINFORMED_OR_POTENTIALLY_MISLEADING

A crucial context is missing: the model acted this way after it was told to pursue a goal at any cost. So while this is a valid example of instrumental convergence, it didn’t happen spontaneously. The tweet’s author agreed it’s misleading: https://x.com/shakeelhashim/status/1865004134631391295?s=46 The paper: https://static1.squarespace.com/static/6593e7097565990e65c886fd/t/6751eb240ed3821a0161b45b/1733421863119/in_context_scheming_reasoning_paper.pdf

Written by FBAA585ECD5B603636B1C19DCCCDE00F95A63B0E203B5462F7657533D3B11A20
Participant Details

Original Tweet

Tweet embedding is no longer reliably available, due to the platform's instability (in terms of both technology and policy). If the Tweet still exists, you can view it here: https://twitter.com/foo_bar/status/1864748980908781642

Please note, though, that you may need to have your own Twitter account to access that page. I am currently exploring options for archiving Tweet data in a post-API context.

All Information

  • ID - 1865014823928705435
  • noteId - 1865014823928705435
  • participantId -
  • noteAuthorParticipantId - FBAA585ECD5B603636B1C19DCCCDE00F95A63B0E203B5462F7657533D3B11A20 Participant Details
  • createdAtMillis - 1733489158981
  • tweetId - 1864748980908781642
  • classification - MISINFORMED_OR_POTENTIALLY_MISLEADING
  • believable -
  • harmful -
  • validationDifficulty -
  • misleadingOther - 0
  • misleadingFactualError - 0
  • misleadingManipulatedMedia - 0
  • misleadingOutdatedInformation - 0
  • misleadingMissingImportantContext - 1
  • misleadingUnverifiedClaimAsFact - 0
  • misleadingSatire - 0
  • notMisleadingOther - 0
  • notMisleadingFactuallyCorrect - 0
  • notMisleadingOutdatedButNotWhenWritten - 0
  • notMisleadingClearlySatire - 0
  • notMisleadingPersonalOpinion - 0
  • trustworthySources - 1
  • summary
    • A crucial context is missing: the model acted this way after it was told to pursue a goal at any cost. So while this is a valid example of instrumental convergence, it didn’t happen spontaneously. The tweet’s author agreed it’s misleading: https://x.com/shakeelhashim/status/1865004134631391295?s=46 The paper: https://static1.squarespace.com/static/6593e7097565990e65c886fd/t/6751eb240ed3821a0161b45b/1733421863119/in_context_scheming_reasoning_paper.pdf

Note Ratings

rated at rated by
2024-12-06 15:12:14 -0600 Rating Details
2024-12-06 15:07:35 -0600 Rating Details
2024-12-06 12:58:44 -0600 Rating Details
2024-12-06 12:52:16 -0600 Rating Details
2024-12-06 12:23:25 -0600 Rating Details
2024-12-06 11:49:15 -0600 Rating Details
2024-12-06 09:56:02 -0600 Rating Details
2024-12-06 09:43:14 -0600 Rating Details
2024-12-06 09:28:46 -0600 Rating Details
2024-12-06 09:10:54 -0600 Rating Details
2024-12-06 08:56:26 -0600 Rating Details
2024-12-06 08:48:49 -0600 Rating Details
2024-12-06 08:48:24 -0600 Rating Details
2024-12-06 08:42:10 -0600 Rating Details
2024-12-06 08:41:33 -0600 Rating Details
2024-12-06 08:30:39 -0600 Rating Details
2024-12-06 08:09:47 -0600 Rating Details
2024-12-06 07:32:56 -0600 Rating Details
2024-12-08 07:08:17 -0600 Rating Details
2024-12-08 06:25:40 -0600 Rating Details
2024-12-08 00:48:08 -0600 Rating Details
2024-12-08 00:30:05 -0600 Rating Details
2024-12-07 23:07:11 -0600 Rating Details
2024-12-07 21:57:46 -0600 Rating Details
2024-12-07 21:39:47 -0600 Rating Details
2024-12-07 17:54:34 -0600 Rating Details
2024-12-07 17:49:01 -0600 Rating Details
2024-12-07 17:00:53 -0600 Rating Details
2024-12-07 16:10:36 -0600 Rating Details
2024-12-07 15:18:42 -0600 Rating Details
2024-12-07 13:01:00 -0600 Rating Details
2024-12-07 11:42:25 -0600 Rating Details
2024-12-07 10:54:44 -0600 Rating Details
2024-12-07 10:15:16 -0600 Rating Details
2024-12-07 09:11:52 -0600 Rating Details
2024-12-07 01:49:07 -0600 Rating Details
2024-12-06 23:40:07 -0600 Rating Details
2024-12-12 10:44:48 -0600 Rating Details
2024-12-11 06:40:40 -0600 Rating Details
2024-12-08 13:09:22 -0600 Rating Details
2024-12-08 07:23:31 -0600 Rating Details
2024-12-08 04:09:49 -0600 Rating Details
2024-12-07 21:32:46 -0600 Rating Details
2024-12-07 09:33:18 -0600 Rating Details
2024-12-07 03:25:24 -0600 Rating Details
2024-12-07 01:41:22 -0600 Rating Details
2024-12-06 17:03:07 -0600 Rating Details
2024-12-06 14:43:46 -0600 Rating Details
2024-12-06 11:33:30 -0600 Rating Details
2024-12-06 11:26:38 -0600 Rating Details
2024-12-06 10:39:57 -0600 Rating Details
2024-12-06 09:37:16 -0600 Rating Details
2024-12-06 08:45:35 -0600 Rating Details
2024-12-06 08:39:14 -0600 Rating Details
2024-12-06 08:38:31 -0600 Rating Details
2024-12-06 07:35:44 -0600 Rating Details
2024-12-06 07:25:35 -0600 Rating Details
2024-12-06 07:08:10 -0600 Rating Details
2024-12-06 06:54:09 -0600 Rating Details
2024-12-06 06:49:47 -0600 Rating Details
2024-12-25 03:42:13 -0600 Rating Details
2024-12-07 23:46:34 -0600 Rating Details
2024-12-07 17:47:33 -0600 Rating Details
2024-12-07 16:47:12 -0600 Rating Details
2024-12-07 14:36:16 -0600 Rating Details
2024-12-07 12:38:43 -0600 Rating Details
2024-12-07 05:23:34 -0600 Rating Details
2024-12-06 16:17:10 -0600 Rating Details
2024-12-06 16:16:30 -0600 Rating Details
2024-12-06 14:51:42 -0600 Rating Details
2024-12-06 10:56:47 -0600 Rating Details
2024-12-06 10:26:31 -0600 Rating Details
2024-12-06 10:08:09 -0600 Rating Details
2024-12-06 10:03:06 -0600 Rating Details
2024-12-06 09:48:44 -0600 Rating Details
2024-12-06 09:29:08 -0600 Rating Details
2024-12-06 09:06:06 -0600 Rating Details
2024-12-06 08:07:27 -0600 Rating Details
2024-12-06 08:04:09 -0600 Rating Details
2024-12-06 07:56:14 -0600 Rating Details
2025-01-20 10:56:57 -0600 Rating Details
2024-12-08 14:01:51 -0600 Rating Details
2024-12-08 08:45:03 -0600 Rating Details
2024-12-07 19:36:10 -0600 Rating Details
2024-12-07 18:42:49 -0600 Rating Details
2024-12-07 18:21:48 -0600 Rating Details
2024-12-07 18:16:21 -0600 Rating Details
2024-12-07 15:05:09 -0600 Rating Details
2024-12-07 13:20:32 -0600 Rating Details
2024-12-07 08:32:44 -0600 Rating Details
2024-12-07 03:00:24 -0600 Rating Details
2024-12-07 02:50:08 -0600 Rating Details
2024-12-06 23:13:44 -0600 Rating Details
2024-12-06 16:58:05 -0600 Rating Details
2024-12-06 11:49:14 -0600 Rating Details
2024-12-06 10:09:05 -0600 Rating Details
2024-12-06 09:52:17 -0600 Rating Details
2024-12-06 08:42:20 -0600 Rating Details
2024-12-06 08:37:23 -0600 Rating Details
2024-12-06 08:32:28 -0600 Rating Details
2024-12-06 07:57:41 -0600 Rating Details
2024-12-06 07:51:46 -0600 Rating Details