Follow
Dongxu Li
Dongxu Li
Salesforce AI Research
Verified email at salesforce.com - Homepage
Title
Cited by
Cited by
Year
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
4272*2023
Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models
J Li, D Li, S Savarese, S Hoi
International conference on machine learning, 19730-19742, 2023
42192023
Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation
J Li, D Li, C Xiong, S Hoi
International conference on machine learning, 12888-12900, 2022
38032022
Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison
D Li, C Rodriguez, X Yu, H Li
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020
5662020
cosFormer: Rethinking Softmax in Attention
Z Qin, W Sun, H Deng, D Li, Y Wei, B Lv, J Yan, L Kong, Y Zhong
International Conference on Learning Representations, 2022
2262022
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
D Li, J Li, H Li, JC Niebles, SCH Hoi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
2172021
Blip-diffusion: Pre-trained subject representation for controllable text-to-image generation and editing
D Li, J Li, S Hoi
Advances in Neural Information Processing Systems 36, 2024
2142024
From images to textual prompts: Zero-shot visual question answering with frozen large language models
J Guo, J Li, D Li, AMH Tiong, B Li, D Tao, S Hoi
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
165*2023
Tspnet: Hierarchical feature learning via temporal semantic pyramid for sign language translation
D Li, C Xu, X Yu, K Zhang, B Swift, H Suominen, H Li
Advances in Neural Information Processing Systems 33, 12034-12045, 2020
1372020
Transferring cross-domain knowledge for video sign language recognition
D Li, X Yu, C Xu, L Petersson, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1372020
LAVIS: A One-stop Library for Language-Vision Intelligence
D Li, J Li, H Le, G Wang, S Savarese, SCH Hoi
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
117*2023
Dual attention-in-attention model for joint rain streak and raindrop removal
K Zhang, D Li, W Luo, W Ren
IEEE Transactions on Image Processing 30, 7608-7619, 2021
842021
Arvo: Learning all-range volumetric correspondence for video deblurring
D Li, C Xu, K Zhang, X Yu, Y Zhong, W Ren, H Suominen, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
742021
Enhanced spatio-temporal interaction learning for video deraining: A faster and better framework
K Zhang, D Li, W Luo, W Ren, W Liu
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
582022
The devil in linear transformer
Z Qin, X Han, W Sun, D Li, L Kong, N Barnes, Y Zhong
arXiv preprint arXiv:2210.10340, 2022
53*2022
Benchmarking Ultra-High-Definition Image Super-resolution
K Zhang, D Li, W Luo, W Ren, B Stenger, W Liu, H Li, MH Yang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
342021
X-instructblip: A framework for aligning x-modal instruction-aware representations to llms and emergent cross-modal reasoning
A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty, R Xu, S Savarese, ...
arXiv preprint arXiv:2311.18799, 2023
282023
Toeplitz Neural Network for Sequence Modeling
Z Qin, X Han, W Sun, B He, D Li, D Li, Y Dai, L Kong, Y Zhong
The Eleventh International Conference on Learning Representations, 2023
272023
Reachability analysis of nonlinear systems using hybridization and dynamics scaling
D Li, S Bak, S Bogomolov
International Conference on Formal Modeling and Analysis of Timed Systems …, 2020
252020
Moonshot: Towards controllable video generation and editing with multimodal conditions
DJ Zhang, D Li, H Le, MZ Shou, C Xiong, D Sahoo
arXiv preprint arXiv:2401.01827, 2024
232024
The system can't perform the operation now. Try again later.
Articles 1–20