Birdwatch Archive - Rating Details

Birdwatch Note Rating

2023-08-15 22:53:57 UTC - SOMEWHAT_HELPFUL

Rated by Participant: AA3C29D32E6821065984294D978D56D9F9A8D2C0799F333963D5A3FD1D58635F
Participant Details

Original Note:

This article does not disclose that the study it cites uses the much less capable GPT-3.5-turbo model, rather than the more advanced GPT-4. Additionally, the study used the API rather than the website and so intentionally chose to use a worse model. https://arxiv.org/pdf/2308.02312.pdf (model on bottom of page 3)

All Note Details

Original Tweet

All Information

noteId - 1691502802231501280
participantId -
raterParticipantId - AA3C29D32E6821065984294D978D56D9F9A8D2C0799F333963D5A3FD1D58635F
createdAtMillis - 1692140037395
version - 2
agree - 0
disagree - 0
helpful - 0
notHelpful - 0
helpfulnessLevel - SOMEWHAT_HELPFUL
helpfulOther - 0
helpfulInformative - 0
helpfulClear - 0
helpfulEmpathetic - 0
helpfulGoodSources - 1
helpfulUniqueContext - 0
helpfulAddressesClaim - 0
helpfulImportantContext - 0
helpfulUnbiasedLanguage - 0
notHelpfulOther - 0
notHelpfulIncorrect - 0
notHelpfulSourcesMissingOrUnreliable - 0
notHelpfulOpinionSpeculationOrBias - 0
notHelpfulMissingKeyPoints - 1
notHelpfulOutdated - 0
notHelpfulHardToUnderstand - 0
notHelpfulArgumentativeOrBiased - 0
notHelpfulOffTopic - 0
notHelpfulSpamHarassmentOrAbuse - 0
notHelpfulIrrelevantSources - 0
notHelpfulOpinionSpeculation - 0
notHelpfulNoteNotNeeded - 0
ratingsId - 1691502802231501280AA3C29D32E6821065984294D978D56D9F9A8D2C0799F333963D5A3FD1D58635F