Birdwatch Note Rating
2024-12-20 20:12:22 UTC - HELPFUL
Rated by Participant: 261A01490BEAF6745DFBC76D21BD3BAB64F3622D6916622CDBFD5A2DCD09C89C
Participant Details
Original Note:
IT DID NOT! o3 model achieved 87.5% with high compute (172x than what's allowed) and it scored 75.7% within the limits. From ARC-AGI: "I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence" Source: https://arcprize.org/blog/oai-o3-pub-breakthrough
All Note Details