Ferret

Refer and ground anything anywhere at any granularity

Ferret media 1
Ferret media 2
Ferret media 3

Description

A new type of multimodal large language model (MLLM) from Apple that excels in both image understanding and language processing, particularly demonstrating significant advantages in understanding spatial references.

Recommended Products