• milicent_bystandr@lemm.ee
    link
    fedilink
    English
    arrow-up
    4
    ·
    1 month ago

    I reckon we can get a lot closer than an LLM in time. For one thing, the mind has particular understanding of interim steps whereas, as I understand it, the LLM has no real concept of meaning between the inputs and the output. Some of this interim is, I think, an important part of how we assess truthfulness of generated ideas before we put them into words.

    • Zement@feddit.nl
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 month ago

      I experimented with rules like : “Summarize everything of our discussion into one text you can use as memory below your answer.” And “summarize and remove unnecessary info from this text, if contradictions occur act curious to solve them”… simply to mimic a short term memory.

      It kind of worked better for problem solving but it ate tokens like crazy and the answers took longer and longer. The current GPT4 models seem to do something similar in the background.

      • milicent_bystandr@lemm.ee
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 month ago

        I think that’s still different from what I’m thinking of of interim steps, though.

        …but as I think how to explain I realize I’m about to blather about things I don’t understand, or at least haven’t had time to think about! So I’d better leave it there!

      • Infomatics90@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 month ago

        I would really like to get into LLM and AI development but the math…woosh right over my head.