From Set Transformer to Perceiver Sampler
4 min read
CAT
October 8, 2024
Mengliu Zhao On multi-modal LLM Flamingo’s vision encoder Designing Multi-modal LLM is hard. The state-of-the-art multi-modal LLMs are primarily...