InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao ... Advances in Neural Information Processing Systems (NeurIPS), 2023 | 4272* | 2023 |
Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models J Li, D Li, S Savarese, S Hoi International conference on machine learning, 19730-19742, 2023 | 4219 | 2023 |
Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation J Li, D Li, C Xiong, S Hoi International conference on machine learning, 12888-12900, 2022 | 3803 | 2022 |
Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison D Li, C Rodriguez, X Yu, H Li Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020 | 566 | 2020 |
cosFormer: Rethinking Softmax in Attention Z Qin, W Sun, H Deng, D Li, Y Wei, B Lv, J Yan, L Kong, Y Zhong International Conference on Learning Representations, 2022 | 226 | 2022 |
Align and Prompt: Video-and-Language Pre-training with Entity Prompts D Li, J Li, H Li, JC Niebles, SCH Hoi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 217 | 2021 |
Blip-diffusion: Pre-trained subject representation for controllable text-to-image generation and editing D Li, J Li, S Hoi Advances in Neural Information Processing Systems 36, 2024 | 214 | 2024 |
From images to textual prompts: Zero-shot visual question answering with frozen large language models J Guo, J Li, D Li, AMH Tiong, B Li, D Tao, S Hoi Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 165* | 2023 |
Tspnet: Hierarchical feature learning via temporal semantic pyramid for sign language translation D Li, C Xu, X Yu, K Zhang, B Swift, H Suominen, H Li Advances in Neural Information Processing Systems 33, 12034-12045, 2020 | 137 | 2020 |
Transferring cross-domain knowledge for video sign language recognition D Li, X Yu, C Xu, L Petersson, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 137 | 2020 |
LAVIS: A One-stop Library for Language-Vision Intelligence D Li, J Li, H Le, G Wang, S Savarese, SCH Hoi Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 117* | 2023 |
Dual attention-in-attention model for joint rain streak and raindrop removal K Zhang, D Li, W Luo, W Ren IEEE Transactions on Image Processing 30, 7608-7619, 2021 | 84 | 2021 |
Arvo: Learning all-range volumetric correspondence for video deblurring D Li, C Xu, K Zhang, X Yu, Y Zhong, W Ren, H Suominen, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 74 | 2021 |
Enhanced spatio-temporal interaction learning for video deraining: A faster and better framework K Zhang, D Li, W Luo, W Ren, W Liu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 | 58 | 2022 |
The devil in linear transformer Z Qin, X Han, W Sun, D Li, L Kong, N Barnes, Y Zhong arXiv preprint arXiv:2210.10340, 2022 | 53* | 2022 |
Benchmarking Ultra-High-Definition Image Super-resolution K Zhang, D Li, W Luo, W Ren, B Stenger, W Liu, H Li, MH Yang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 34 | 2021 |
X-instructblip: A framework for aligning x-modal instruction-aware representations to llms and emergent cross-modal reasoning A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty, R Xu, S Savarese, ... arXiv preprint arXiv:2311.18799, 2023 | 28 | 2023 |
Toeplitz Neural Network for Sequence Modeling Z Qin, X Han, W Sun, B He, D Li, D Li, Y Dai, L Kong, Y Zhong The Eleventh International Conference on Learning Representations, 2023 | 27 | 2023 |
Reachability analysis of nonlinear systems using hybridization and dynamics scaling D Li, S Bak, S Bogomolov International Conference on Formal Modeling and Analysis of Timed Systems …, 2020 | 25 | 2020 |
Moonshot: Towards controllable video generation and editing with multimodal conditions DJ Zhang, D Li, H Le, MZ Shou, C Xiong, D Sahoo arXiv preprint arXiv:2401.01827, 2024 | 23 | 2024 |