Birdwatch Note Rating
2024-12-20 19:55:46 UTC - NOT_HELPFUL
Rated by Participant: 450A447C004B408EE41F8F573568104E81D67C53DDF338491DD04C4323E5A551
Participant Details
Original Note:
IT DID NOT! o3 model achieved 87.5% with high compute (172x than what's allowed) and it scored 75.7% within the limits. From ARC-AGI: "I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence" Source: https://arcprize.org/blog/oai-o3-pub-breakthrough
All Note Details