Birdwatch Archive - Rating Details

Birdwatch Note Rating

2024-12-21 00:02:56 UTC - HELPFUL

Rated by Participant: F3DCB6C06E0BD254437A8416573752FBAFEAAFF241BA9122B95451919F60E074
Participant Details

Original Note:

Terence Tao was referring only to the hardest questions in the benchmark, not all the questions. The result is a significant improvement, but tweet’s presentation is misleading. https://x.com/__nmca__/status/1870191873249181825?s=46 https://epoch.ai/frontiermath

All Note Details

Original Tweet

All Information

noteId - 1870236606860263894
participantId -
raterParticipantId - F3DCB6C06E0BD254437A8416573752FBAFEAAFF241BA9122B95451919F60E074
createdAtMillis - 1734739376316
version - 2
agree - 0
disagree - 0
helpful - 0
notHelpful - 0
helpfulnessLevel - HELPFUL
helpfulOther - 0
helpfulInformative - 0
helpfulClear - 1
helpfulEmpathetic - 0
helpfulGoodSources - 0
helpfulUniqueContext - 0
helpfulAddressesClaim - 1
helpfulImportantContext - 1
helpfulUnbiasedLanguage - 1
notHelpfulOther - 0
notHelpfulIncorrect - 0
notHelpfulSourcesMissingOrUnreliable - 0
notHelpfulOpinionSpeculationOrBias - 0
notHelpfulMissingKeyPoints - 0
notHelpfulOutdated - 0
notHelpfulHardToUnderstand - 0
notHelpfulArgumentativeOrBiased - 0
notHelpfulOffTopic - 0
notHelpfulSpamHarassmentOrAbuse - 0
notHelpfulIrrelevantSources - 0
notHelpfulOpinionSpeculation - 0
notHelpfulNoteNotNeeded - 0
ratingsId - 1870236606860263894F3DCB6C06E0BD254437A8416573752FBAFEAAFF241BA9122B95451919F60E074