ShareGPT4V: Improving Large Multi-modal Models with Better Captions
2024; Springer Science+Business Media; Linguagem: Inglês
10.1007/978-3-031-72643-9_22
ISSN1611-3349
AutoresLin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin,
Tópico(s)Video Analysis and Summarization
Referência(s)