Birdwatch Archive

Birdwatch Note Rating

2023-08-15 22:53:57 UTC - SOMEWHAT_HELPFUL

Rated by Participant: AA3C29D32E6821065984294D978D56D9F9A8D2C0799F333963D5A3FD1D58635F
Participant Details

Original Note:

This article does not disclose that the study it cites uses the much less capable GPT-3.5-turbo model, rather than the more advanced GPT-4. Additionally, the study used the API rather than the website and so intentionally chose to use a worse model. https://arxiv.org/pdf/2308.02312.pdf (model on bottom of page 3)

All Note Details

Original Tweet

All Information

  • noteId - 1691502802231501280
  • participantId -
  • raterParticipantId - AA3C29D32E6821065984294D978D56D9F9A8D2C0799F333963D5A3FD1D58635F
  • createdAtMillis - 1692140037395
  • version - 2
  • agree - 0
  • disagree - 0
  • helpful - 0
  • notHelpful - 0
  • helpfulnessLevel - SOMEWHAT_HELPFUL
  • helpfulOther - 0
  • helpfulInformative - 0
  • helpfulClear - 0
  • helpfulEmpathetic - 0
  • helpfulGoodSources - 1
  • helpfulUniqueContext - 0
  • helpfulAddressesClaim - 0
  • helpfulImportantContext - 0
  • helpfulUnbiasedLanguage - 0
  • notHelpfulOther - 0
  • notHelpfulIncorrect - 0
  • notHelpfulSourcesMissingOrUnreliable - 0
  • notHelpfulOpinionSpeculationOrBias - 0
  • notHelpfulMissingKeyPoints - 1
  • notHelpfulOutdated - 0
  • notHelpfulHardToUnderstand - 0
  • notHelpfulArgumentativeOrBiased - 0
  • notHelpfulOffTopic - 0
  • notHelpfulSpamHarassmentOrAbuse - 0
  • notHelpfulIrrelevantSources - 0
  • notHelpfulOpinionSpeculation - 0
  • notHelpfulNoteNotNeeded - 0
  • ratingsId - 1691502802231501280AA3C29D32E6821065984294D978D56D9F9A8D2C0799F333963D5A3FD1D58635F