Birdwatch Archive - Rating Details

Birdwatch Note Rating

2024-12-20 22:30:48 UTC - HELPFUL

Rated by Participant: 74FBBD24C4829D574AC2E7A6D334D569BA9F528DE5E080D4BE7E8F6B76286297
Participant Details

Original Note:

IT DID NOT! o3 model achieved 87.5% with high compute (172x than what's allowed) and it scored 75.7% within the limits. From ARC-AGI: "I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence" Source: https://arcprize.org/blog/oai-o3-pub-breakthrough

All Note Details

Original Tweet

All Information

noteId - 1870197809716326755
participantId -
raterParticipantId - 74FBBD24C4829D574AC2E7A6D334D569BA9F528DE5E080D4BE7E8F6B76286297
createdAtMillis - 1734733848041
version - 2
agree - 0
disagree - 0
helpful - 0
notHelpful - 0
helpfulnessLevel - HELPFUL
helpfulOther - 0
helpfulInformative - 0
helpfulClear - 0
helpfulEmpathetic - 0
helpfulGoodSources - 0
helpfulUniqueContext - 0
helpfulAddressesClaim - 0
helpfulImportantContext - 0
helpfulUnbiasedLanguage - 0
notHelpfulOther - 0
notHelpfulIncorrect - 0
notHelpfulSourcesMissingOrUnreliable - 0
notHelpfulOpinionSpeculationOrBias - 0
notHelpfulMissingKeyPoints - 0
notHelpfulOutdated - 0
notHelpfulHardToUnderstand - 0
notHelpfulArgumentativeOrBiased - 0
notHelpfulOffTopic - 0
notHelpfulSpamHarassmentOrAbuse - 0
notHelpfulIrrelevantSources - 0
notHelpfulOpinionSpeculation - 0
notHelpfulNoteNotNeeded - 0
ratingsId - 187019780971632675574FBBD24C4829D574AC2E7A6D334D569BA9F528DE5E080D4BE7E8F6B76286297