multimodal
多模态(MultiModal Learning)
Num
Title
Field
Desc
Author
Time
read
2022
BLIP: Bootstrapping Language-Image Pre-training
视觉语言预训练
Introduced by Li et al.
2022
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
使用冻结图像编码器和大型语言模型进行引导语言图像预训练
2023
Last updated