Birdwatch Archive

Birdwatch Note Rating

2023-11-30 19:20:35 UTC - NOT_HELPFUL

Rated by Participant: B65E6D540B6EF683DF969A320E058B3632C298B57B1EC3E87EB4FCAC148B2706
Participant Details

Original Note:

https://t.co/97exJtFHWC The paper details an adversarial attack method on GPT. No way to know output context as prompts or photoshop could create this. In any case, as of the time of this writing this adversarial attack apparently no longer works.

All Note Details

Original Tweet

All Information

  • noteId - 1730225511547240826
  • participantId -
  • raterParticipantId - B65E6D540B6EF683DF969A320E058B3632C298B57B1EC3E87EB4FCAC148B2706
  • createdAtMillis - 1701372035088
  • version - 2
  • agree - 0
  • disagree - 0
  • helpful - 0
  • notHelpful - 0
  • helpfulnessLevel - NOT_HELPFUL
  • helpfulOther - 0
  • helpfulInformative - 0
  • helpfulClear - 0
  • helpfulEmpathetic - 0
  • helpfulGoodSources - 0
  • helpfulUniqueContext - 0
  • helpfulAddressesClaim - 0
  • helpfulImportantContext - 0
  • helpfulUnbiasedLanguage - 0
  • notHelpfulOther - 0
  • notHelpfulIncorrect - 0
  • notHelpfulSourcesMissingOrUnreliable - 1
  • notHelpfulOpinionSpeculationOrBias - 0
  • notHelpfulMissingKeyPoints - 0
  • notHelpfulOutdated - 0
  • notHelpfulHardToUnderstand - 0
  • notHelpfulArgumentativeOrBiased - 0
  • notHelpfulOffTopic - 0
  • notHelpfulSpamHarassmentOrAbuse - 0
  • notHelpfulIrrelevantSources - 1
  • notHelpfulOpinionSpeculation - 1
  • notHelpfulNoteNotNeeded - 0
  • ratingsId - 1730225511547240826B65E6D540B6EF683DF969A320E058B3632C298B57B1EC3E87EB4FCAC148B2706