Birdwatch Archive - Rating Details

Birdwatch Note Rating

2024-12-20 20:12:22 UTC - HELPFUL

Rated by Participant: 261A01490BEAF6745DFBC76D21BD3BAB64F3622D6916622CDBFD5A2DCD09C89C
Participant Details

Original Note:

IT DID NOT! o3 model achieved 87.5% with high compute (172x than what's allowed) and it scored 75.7% within the limits. From ARC-AGI: "I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence" Source: https://arcprize.org/blog/oai-o3-pub-breakthrough

All Note Details

Original Tweet

All Information

noteId - 1870197809716326755
participantId -
raterParticipantId - 261A01490BEAF6745DFBC76D21BD3BAB64F3622D6916622CDBFD5A2DCD09C89C
createdAtMillis - 1734725542305
version - 2
agree - 0
disagree - 0
helpful - 0
notHelpful - 0
helpfulnessLevel - HELPFUL
helpfulOther - 0
helpfulInformative - 0
helpfulClear - 1
helpfulEmpathetic - 0
helpfulGoodSources - 1
helpfulUniqueContext - 0
helpfulAddressesClaim - 1
helpfulImportantContext - 1
helpfulUnbiasedLanguage - 1
notHelpfulOther - 0
notHelpfulIncorrect - 0
notHelpfulSourcesMissingOrUnreliable - 0
notHelpfulOpinionSpeculationOrBias - 0
notHelpfulMissingKeyPoints - 0
notHelpfulOutdated - 0
notHelpfulHardToUnderstand - 0
notHelpfulArgumentativeOrBiased - 0
notHelpfulOffTopic - 0
notHelpfulSpamHarassmentOrAbuse - 0
notHelpfulIrrelevantSources - 0
notHelpfulOpinionSpeculation - 0
notHelpfulNoteNotNeeded - 0
ratingsId - 1870197809716326755261A01490BEAF6745DFBC76D21BD3BAB64F3622D6916622CDBFD5A2DCD09C89C