home.social

#arcprize — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #arcprize, aggregated by home.social.

  1. There are two realities in AI today: one sees AGI just around the corner, the other urges caution. The #ARCPrize offers a clear test — can an AI truly understand and generalize, or is it just memorizing patterns?
    #divaexchange
    Learn more: diva.exchange/en/privacy/will-

  2. #OpenAI’s O3 model reportedly scores 76–88% on ARC benchmarks. François Chollet calls it a breakthrough - but warns: this still isn’t AGI. The #ARCPrize helps us tell real intelligence from clever shortcuts. #divaexchange

    More: diva.exchange/en/privacy/will-

  3. Chollet says today’s AI isn’t brilliant - it’s just “big databases.” These systems retrieve information well but fail at novel problems. The #ARCPrize sets real challenges that demand understanding and generalization, not just memorization.
    #divaexchange

    More info: diva.exchange/en/privacy/will-

  4. "Yesterday OpenAI announced some very impressive results from their not-yet-released o3 model. According to the announcement, o3 has made enormous progress over its predecessors on several “reasoning” benchmarks, in particular, two quite difficult ones: Frontier Math, a benchmark containing hundreds of unpublished math problems that are known to be hard even for human math whizzes, and the Abstraction and Reasoning Corpus (ARC), a collection of concept-induction tasks which I’ve written about here, here, and here.

    In this post I’ll discuss the o3 results on ARC. If you’re interested in AI and active on social media, you’ve likely already heard about these results, but I’ll try to add more context and my own thoughts here."

    aiguide.substack.com/p/did-ope

    #AI #GenerativeAI #OpenAI #o3 #ArcPrize #AbstractReasoning #LLMs

  5. arcprize.org/blog/oai-o3-pub-b

    “You'll know AGI is here when the exercise of creating tasks that are easy for regular humans but hard for AI becomes simply impossible.”

    #ai #arcprize #arcagi #agi #openai #o3

  6. Today a jaw dropping "38" score have been accomplished on the ARC-AGI leaderboard.

    Interview with the team behind it.
    youtube.com/watch?v=jSAT_RuJ_C
    #agi #arcprize #arcagi #kaggle