Apple’s MM1: New Multimodal AI Models Unveiled

Mar 18, 2024 | 0 comments

Apple Event May not Happen
Image Credits: Flickr

Apple is working on multimodal large-language AI models. Recently, a research paper was published on the online portal arXiv about MM1. Apple’s MM1 is a family of multimodal AI models. These multimodal AI models are state-of-the-art and perform competitively well after supervised fine-tuning on various multimodal benchmarks.

Although the research paper doesn’t mention anything about the Cupertino-based tech giant, most of the researchers in it are closely associated with Apple’s machine learning division. So there are high suspicions that the model is associated with Apple. 

About Apple’s MM1

Apple’s MM1 is a family of multimodal AI models with a size of up to 30 billion parameters that are capable of accepting both images and text as input. 

The MM1 research found that the choice of image encoder, number of image tokens, and image resolution have a high impact on the model’s performance, whereas the specific architecture has a negligible impact. “We show that the image encoder, together with image resolution and the image token count, has substantial impact, while the vision-language connector design is of comparatively negligible importance,” the research said.

The model is currently in the pre-training phase, and we don’t know if it will be able to produce the desired outputs.

Apple’s AI Strategy

According to some rumors, Apple is also working on an LLM framework known as “Ajax” and a chatbot called “Apple GPT”. According to a report by Bloomberg, Apple is spending around $1 billion per year on AI. 

Recently, Apple acquired a Canadian-based startup company, Darwin AI, which is going to help Apple strengthen on-device AI. Apple’s CEO, Tim Cook, also said during the Q1 earnings call that Apple is going to “break new ground” in GenAI this year. 

Now According to a new report by Bloomberg’s Mark Gurman, Apple is in talks with Google and wants to partner with them to bring Gemini’s AI-powered features to their iPhones.

The World Wide Developers Conference, which will be held in June, is going to be the next major event for Apple, where we can expect it to announce iOS 18 and showcase major developments it has done in AI.


It is a big leap for Apple in GenAI. Although they are late in the AI race, they are now trying to compete with tech giants like Microsoft, Google, and OpenAI. With Apple’s MM1, the company is looking to go all in with artificial intelligence. MM1 can help Apple bring AI to iPhones and other products.

Read More

Apple in Talks with Google for On-Device AI: Know More

European Parliament Approves the First Major AI Act


Elon Musk’s xAI Announces Grok 1.5 with Great Capabilities

Elon Musk’s xAI Announces Grok 1.5 with Great Capabilities

Image Credits: xAI Elon Musk's xAI launched Grok last year in November to compete with chatbots from big tech giants like Google, Microsoft, and OpenAI. Elon Musk's xAI is soon launching the next version of their chatbot, Grok 1.5, which performs really well as...

Meta’s Ray-Ban Smart Glasses Are Getting New AI Features

Meta’s Ray-Ban Smart Glasses Are Getting New AI Features

Image Credits: Meta Meta’s $300 smart glasses, made in collaboration with Ray-Ban, allow users to take pictures, record videos, make calls, hear music, and do much more. Now, new AI features are being added to Meta's Ray-Ban smart glasses.  New AI Features in Meta’s...

Claude 3 beats GPT-4 for the First Time on LMSYS Leaderboard

Claude 3 beats GPT-4 for the First Time on LMSYS Leaderboard

Anthropic released the Claude 3 model family earlier this month, and they have become highly popular since their release. Now Anthropic's Claude 3 Opus Model beats OpenAI's GPT-4 model for the first time on the LMSYS Chatbot Arena Leaderboard. LMSYS Chatbot Arena is a...


Submit a Comment

Your email address will not be published. Required fields are marked *