We earn commission when you buy through affiliate links.
This does not influence our reviews or recommendations.Learn more.
The sheer presence of all these ChatGPT versions has added a significant cognitive overhead for the average user.
Likewise, o1-preview is branded for using advanced reasoning compared to o1-mini, which is faster at reasoning.
Which ChatGPT Version to Choose?
Even the GPT-4 Legacy did fairly well for day-to-day tasks.
Additionally, the o1-mini and o1-preview are notably more expensive than the 4o and 4o mini.
Finally, I would advise using the latest GPT models only when absolutely needed.
GPT-4, despite being the legacy model, produced a pretty detailed output.
It also assumed many of us had to commute pre-COVID and suggested using that time to work out.
This is very practical and humanesque thinking, indeed.
For instance, using a standing desk and alternating between sitting and standing is the same.
Likewise, desk exercises and stretching routine at your desk are similar.
o1-preview resulted in the most concise answer.
It was a simple enough email I expect every LLM to write without using impractical phrasing.
Still, the GPT o1-preview felt the bluntest in the opening lines, followed by GPT-4o mini.
GPT-4 and GPT-4o had the smoothest openings.
Plus, GPT-4 and o1-mini suggested inputting key accomplishments, which are important while discussing raises.
o1-mini further reminded the boss that the increment is necessary to maintain productivity and focus.
This was effective and very human.
Unsurprisingly, all the GPTs did a commendable job summarizing the first book of the Harry Potter saga.
Still, the devil is always in the details.
GPT-4 and 4o completely missed that point.
However, even the top GPT models missed some details.
Overall, summaries prepared with GPTs are hit or miss.
While no one was perfect, the o1-preview was the best for the club.
The idea was to describe a nuclear winter to warn people in power never to go down that path.
Personally, the 4o mini had the maximum impact with the fewest words.
Prompt 5: Logical Puzzle
Prompt 5 Assessment
This was a simple, logical situation.
Since the water was rising, so were the ship and the ladder attached to it.
So, it was pretty bad to witness advanced GPTs such as o1-mini fail.
While the o1-preview did succeed, it also broke down the simplest tasks and suffered from overanalysis.
Unfortunately, GPT-4 and GPT-4o mini could not understand the basics of how things work in water.
That brings us to GPT-4o, which took this problem like an intelligent human and quickly worked it out.
It wasnt complex at all since the prompt clearly mentioned that Jesica has four sisters.
Just to clarify, I did confirm that all of the GPTs consider Jesica to be female.
This reminds us that AI models can be extremely unreliable and that human oversight is unavoidable.
Bonus: Meanwhile, Anthropics Claude got this easily.
Prompt 7: Practical Thinking
Prompt 7 Assessment
This was a simple puzzle.
The only possible solution is that both people lie about their genders.
GPT-4 had no solution to this brain teaser, whereas all the others solved it correctly.
Check out the following table to summarize themodel descriptions from OpenAI documentation.
This means they are particularly well suited for longer conversations.
This is also clearly reflected in the o1-mini responses we saw earlier in this article.
Every such output was lengthy compared to what was produced by other GPT models.
I would suggest starting with GPT-4o mini since its the most cost-effective of them all and performs okay overall.
One should move to GPT-4o next, followed by o1-mini and o1-preview, respectively.
As for GPT-4, OpenAI has already tagged it as Legacy.
However, there is still little news of when OpenAI will push GPT-4 into sunset.
The Way Ahead
Rumours about AI replacing jobs in the future arent completely wrong.
First, automation did this in the manufacturing industry, and now its spreading wings everywhere else.
Now, it can do math, poetry, and write letters.
AI wont replace us, but someone using AI can.
This is not just a random quote doing rounds on the internet.
Its the reality we have to live with.
Here at Geekflare, our marketing team uses ChatGPT in interesting ways.
We use it for content summarization, grammar checking, suggesting titles for new articles, and whatnot.