Birdwatch Note Rating
2024-12-20 22:30:48 UTC - HELPFUL
Rated by Participant: 74FBBD24C4829D574AC2E7A6D334D569BA9F528DE5E080D4BE7E8F6B76286297
Participant Details
Original Note:
IT DID NOT! o3 model achieved 87.5% with high compute (172x than what's allowed) and it scored 75.7% within the limits. From ARC-AGI: "I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence" Source: https://arcprize.org/blog/oai-o3-pub-breakthrough
All Note Details