AIModels.fyi

AIModels.fyi

Share this post

AIModels.fyi
AIModels.fyi
Apple is working on multimodal AI. Here's what they've uncovered so far.

Apple is working on multimodal AI. Here's what they've uncovered so far.

Apple researchers reveal scaling laws and training methods for multimodal AI success.

aimodels-fyi's avatar
aimodels-fyi
Mar 16, 2024
∙ Paid
1

Share this post

AIModels.fyi
AIModels.fyi
Apple is working on multimodal AI. Here's what they've uncovered so far.
2
Share
“Fig. 1: MM1 can perform in-context predictions thanks to its large-scale multimodal pre-training. This allows MM1 to (a) count objects and follow custom formatting, (b) refer to parts of the images and perform OCR, (c) demonstrate common-sense and word knowledge about everyday objects, and (d) perform basic math functions. Images are from the COCO 2014…

Keep reading with a 7-day free trial

Subscribe to AIModels.fyi to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 AIModels.fyi
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share