With the new GPT o1 preview released, I had to see it in action. I came up with a fairly easy challenge that might trip up an old generation. Here's the prompt I gave it...
"The same person resides on two timelines. Each experience the same challenge. Both attempt to solve the problem using different methods but only one action each. Of the actions they chose, one is right, one is wrong and one is correct. Please explain this."
The ChatGPT o1 preview solved it in 28 seconds. Watching it reason through the challenge was nuts.
Here's what it concluded:
In this scenario, we have two timelines featuring the same person who encounters the same problem. Each version of the person attempts to solve the problem using a single action.
Here's the breakdown:
1. First Timeline: The person uses an action that is wrong.
2. Second Timeline: The person uses an action that is both right and correct.
GPT nailed it. OMG.
And then I got to thinking, why is the "right" answer also "correct"? The simple answer was acceptable but getting GPT a little closer to the rabbit hole might be fun. What if the answer was being defined in a more traditional context e.g. "This is how we've done it for years..."
Which could mean the person with the "wrong" answer might have had an alternate way to solve the puzzle in a non-traditional sense. Allowing for innovation that might even solve it in a more efficient way. Making it an accepted answer. Both wrong and correct.
Meaning both answers could be both correct and incorrect at the same time.
Presenting this to GPT:
You bring up an important consideration. In my initial explanation, I treated the "wrong" action as ineffective in solving the problem, while the "right and correct" action was the successful solution. However, as you've pointed out, there can be multiple ways to solve a problem, and what is considered "wrong" might not actually be incorrect or ineffective.
But it didn't stop here...
Then GPT's reasoning thread preceded to go down a rabbit hole and watching it over-think everything was hilarious.
Until it halted. I think there may have been smoke. lol