How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites
2024; Springer Nature; Volume: 67; Issue: 12 Linguagem: Inglês
10.1007/s11432-024-4231-5
ISSN1674-733X
AutoresZhe Chen, Weiyun Wang, Hao Tian, Sheng‐Long Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang,
Tópico(s)Natural Language Processing Techniques
Referência(s)