Screenshot of this question was making the rounds last week. But this article covers testing against all the well-known models out there.

Also includes outtakes on the ‘reasoning’ models.

  • Jax@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    4
    ·
    edit-2
    18 hours ago

    Absolutely not, I’m still just scratching my head at how something like this is allowed to happen.

    Has any human ever said that they’re worried about their car getting dirtied on the way to the carwash? Maybe I could see someone arguing against getting a carwash, citing it getting dirty on the way home — but on the way there?

    Like you would think it wouldn’t have the basis to even put those words together that way — should I see this as a hallucination?

    Granted, I would never ask an AI a question like this — it seems very far outside of potential use cases for it (for me).

    Edit: oh, I guess it could have been said by a person in a sarcastic sense

    • _g_be@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      11 hours ago

      you understand the context, and can implicitly understand the need to drive to the car wash’, but these glorified auto-complete machines will latch on to the “should I walk there” and the small distance quantity. It even seems to parrot words about not wanting to drive after having your car washed. There’s no ‘thinking’ about the whole thought, and apparently no logical linking of two separate ideas

      • Jax@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        ·
        16 hours ago

        I guess I’ll know to be impressed by AI when it can distinguish things like sarcasm.