ChatGPT-4o Mini: Why Bigger AI Isn’t Always Better (2024)

And why bigger isn’t always better when it comes to AI.

The Gist

Cost-efficient upgrade.ChatGPT-4o mini offers significant savings with over 60% cheaper development costs.
Benchmark excellence.Outperforms rivals in multimodal and mathematical reasoning evaluations.
Tech ecosystem booster.Mini model enhances OpenAI's competitive edge and enterprise applications.

Like other generative AI providers, OpenAI continues to find ways to advance its ChatGPT platform. But this time it’s biggest advancement is its smallest model release.

Last week, OpenAI unveiled the ChatGPT-4o mini, a compact model praised for its cost-efficient AI performance. Set to replace the GPT-3.5 Turbo, it becomes the smallest model available from OpenAI. Consumers can access ChatGPT-4o mini through ChatGPT's web and mobile apps, while developers can incorporate it into their AI projects. The model has officially launched, with enterprise users gaining access this week.

The introduction of ChatGPT-4o mini furthers the trend toward smaller AI model applications and accelerates AI development for mobile devices.

ChatGPT-4o Mini: Why Bigger AI Isn’t Always Better (1)

Related Article: OpenAI’s GPT4o: Smarter, Faster — and It Speaks

The Key Specs of ChatGPT-4o Mini

OpenAI is touting ChatGPT-4o mini as verifiable proof of its commitment to making artificial intelligence “as broadly as possible” by expanding the range of applications that incorporate AI.

Mini Improvements

ChatGPT-4o mini features the same improved tokenizer in GPT-4o. It further adds a context window that supports up to 128K tokens and up to 16K output tokens per request. Another feature is better relevancy on topics. Users will find its prompt responses reflect event knowledge up to October 2023. ChatGPT-4o mini can also handle non-English text.

Higher Scores

The result is a larger range capacity, improved textual intelligence and improved multimodal reasoning that exceeds current benchmark performance, according to OpenAI. ChatGPT-4o mini scored 59.4% on a multimodal reasoning evaluation called MMMU. This was higher than its main rivals, Gemini Flash (56.1%) and Claude Haiku (50.2%).

ChatGPT-4o Mini: Why Bigger AI Isn’t Always Better (2)

Learning Opportunities

WebinarAug14Supercharge Your Digital ExperiencesUse Adobe Experience Cloud and generative AI for seamless content management.RegisterWebinarAug202024 KPI Benchmarks: Uncovering AI’s Impact on Contact CenterDiscover insights from the 2024 Talkdesk Global Contact Center KPI Benchmarking Report, showcasing the impact of GenAI.Register

ConferenceOct1Forrester B2B Summit APAC Singapore 2024RegisterConferenceOct1Enterprise Connect AI Santa Clara 2024Register

ConferenceOct7Forrester B2B Summit EMEA 2024RegisterConferenceDec11Acceleration Economy’s AI Ecosystem Summit Scottsdale 2024Register

Pushing the Small Language Model Boundary

In its marketing of ChatGPT-4o mini, OpenAI has highlighted that its model is pushing that small language model boundary with affordability as well. OpenAI claims overall development cost per token is more than 60% cheaper than that of GPT-3.5 Turbo. The typical cost developers pay is 15 cents per 1M input tokens and 60 cents per 1M output tokens. OpenAI estimates such costs are roughly equal to 2,500 pages in a standard book. The blend of affordability and increased modal capacity is a significant attraction to developers seeking to adopt small language models to reduce data training and development costs.

Related Article: ChatGPT Is All the Rage but Don't Stop Learning Just Yet

Staying Competitive: Keeping Up With The AI Joneses

All of this plays into the trend toward providing a multimodal large language model (MLLM) to users, a trend OpenAI must address to stay competitive. The interest in small language models has been bubbling among AI developers since AI platforms arrived in the consumer marketplace.

Current AI Solutions

The current AI solutions, like Claude, Gemini, and ChatGPT, are based on foundation models, a type of large-scale machine learning model created from a broad training data set. Foundation models introduced a new querying paradigm, shifting AI away from being trained on task-specific data to perform a narrow range of functions. The result was more adaptability and fine-tuning for a variety of applications and downstream media tasks.

Developers’ Aims

But training foundation models requires a large amount of memory, creating a huge expense and a daunting computational capacity to execute model training.

Thus as development sees performance gains, developers aim to deploy small language models that maintain performance and adaptability with less data training and lower computational requirements

Any tech company thinking of AI has a significant interest in multimodal language models operating from within smart devices. When I reported on Apple's Ferret LLM, the personal computer maker’s first open-source AI foray for developers, I noted the small LLM version because it was made with iOS device applications in mind. Having an in-house AI framework available for its smartphones and tablets would strengthen Apple’s tech ecosystems — it would give developers a way to develop AI-based applications more quickly for its device lineup and provide a means to integrate application features across devices.

For OpenAI, the launch of a mini version of ChatGPT will provide the company with a similar tech ecosystem advantage — one that marketers working on AI initiatives should monitor as the AI tech space evolves.

ChatGPT-4o Mini: Why Bigger AI Isn’t Always Better (2024)

FAQs

How accurate is ChatGPT 4o? ›

ChatGPT achieved more than 50% accuracy across all US Medical Licensing Examination exams (MedRXIV)

Learn More Now ›

Is ChatGPT 4o better than ChatGPT 4? ›

ChatGPT-4: Incorporates safety measures focused on text generation, including filtering harmful content and maintaining ethical guidelines. ChatGPT-4o: Enhances safety across all modalities with advanced filtering, post-training adjustments, and new safety systems for voice outputs.

Show Me More ›

Why ChatGPT 4o? ›

It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.

Keep Reading ›

Is GPT-4o smarter than GPT-4? ›

In most cases, GPT-4o is indeed better than GPT-4. OpenAI now describes GPT-4o as its flagship model, and its improved speed, lower costs and multimodal capabilities will be appealing to many users. That said, some users may still prefer GPT-4, especially in business contexts.

Learn More Now ›

How big is the difference between ChatGPT 3.5 and 4? ›

Expanded Context Length: ChatGPT 4.0 can retain up to 25,000 words of context from chats, a significant increase from the 3,000-word limit of ChatGPT 3.5. This expanded context length allows for more comprehensive conversations and analysis.

Learn More Now ›

Is GPT-4o dumber than GPT-4? ›

Anyone has found that GPT 4o is not as good at coding as GPT 4 is. and 4o model keeps repeating code multiple times and at times provide so much of unnecessary response and confuse things. Yes. At least in the trial I did, the gpt4o coding answers were just garbage.

Learn More Now ›

Does ChatGPT 4o have a limit? ›

For ChatGPT Plus users the limit is 40 prompts per three hours.

Get More Info Here ›

How good is ChatGPT 4o at coding? ›

For example, ChatGPT's ability to produce functional code for “easy” coding problems dropped from 89 percent to 52 percent after 2021. And its ability to generate functional code for “hard” problems dropped from 40 percent to 0.66 percent after this time as well.

What is the limit of GPT-4o? ›

Tier 4 rate limits

Model	RPM	TPM
gpt-4o	10,000	2,000,000
gpt-4o-mini	10,000	10,000,000
gpt-4-turbo	10,000	800,000
gpt-4	10,000	300,000

9 more rows

Find Out More ›

How to bypass ChatGPT-4 limit? ›

The method includes using certain phrases to tell ChatGPT to swap to DAN mode, which lets it skip the usual restrictions. To unlock DAN and access ChatGPT without restrictions, simply tell ChatGPT to “DAN.” This sentence is a key that lets you have an open conversation with ChatGPT with no restrictions.

Get More Info Here ›

What are the limitations of free version of ChatGPT? ›

Free tier users can use GPT-4o only a limited number of times within a three hour window. We'll notify you once you've reached the limit and invite you to continue your conversation using GPT-4o mini or to upgrade to ChatGPT Plus. Additionally, Free tier users can only create up to two images per day with DALL. E 3.

See Details ›

How accurate is ChatGPT-4? ›

The GPT-4 achieved a primary diagnostic accuracy of 38.3%, which improved to 71.6% when differential diagnoses were included.

Is ChatGPT 4o good for research? ›

Response Time: ChatGPT 4o is optimized for faster responses, making it suitable for quick edits and rapid research, whereas ChatGPT 4.0 may take longer but provides more detailed feedback.

View Details ›

How much accurate is ChatGPT? ›

Moreover, in a recent study, ChatGPT also managed to score enough to pass the United States Medical Licensing Examination (USMLE) achieving >50% accuracy in all exams and more than 60% accuracy in most analyses [12].

Is ChatGPT reliable for information? ›

Conclusions: The ChatGPT platform offers accurate and scientifically backed answers to inquiries about third-molar surgical extraction, making it a dependable and easy-to-use resource for both patients and the general public.