Birdwatch Archive - Rating Details

Birdwatch Note Rating

2024-12-20 19:55:46 UTC - NOT_HELPFUL

Rated by Participant: 450A447C004B408EE41F8F573568104E81D67C53DDF338491DD04C4323E5A551
Participant Details

Original Note:

IT DID NOT! o3 model achieved 87.5% with high compute (172x than what's allowed) and it scored 75.7% within the limits. From ARC-AGI: "I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence" Source: https://arcprize.org/blog/oai-o3-pub-breakthrough

All Note Details

Original Tweet

All Information

noteId - 1870196148390199543
participantId -
raterParticipantId - 450A447C004B408EE41F8F573568104E81D67C53DDF338491DD04C4323E5A551
createdAtMillis - 1734724546186
version - 2
agree - 0
disagree - 0
helpful - 0
notHelpful - 0
helpfulnessLevel - NOT_HELPFUL
helpfulOther - 0
helpfulInformative - 0
helpfulClear - 0
helpfulEmpathetic - 0
helpfulGoodSources - 0
helpfulUniqueContext - 0
helpfulAddressesClaim - 0
helpfulImportantContext - 0
helpfulUnbiasedLanguage - 0
notHelpfulOther - 0
notHelpfulIncorrect - 0
notHelpfulSourcesMissingOrUnreliable - 0
notHelpfulOpinionSpeculationOrBias - 0
notHelpfulMissingKeyPoints - 0
notHelpfulOutdated - 0
notHelpfulHardToUnderstand - 0
notHelpfulArgumentativeOrBiased - 0
notHelpfulOffTopic - 0
notHelpfulSpamHarassmentOrAbuse - 0
notHelpfulIrrelevantSources - 0
notHelpfulOpinionSpeculation - 0
notHelpfulNoteNotNeeded - 0
ratingsId - 1870196148390199543450A447C004B408EE41F8F573568104E81D67C53DDF338491DD04C4323E5A551