


default search action
Fuxiao Liu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c13]Ming Li, Pei Chen, Chenguang Wang, Hongyu Zhao, Yijun Liang, Yupeng Hou, Fuxiao Liu, Tianyi Zhou:
Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning. ACL (Findings) 2025: 25287-25318
[c12]Zongxia Li, Xiyang Wu, Hongyang Du, Fuxiao Liu, Huy Nghiem, Guangyao Shi:
A Survey of State of the Art Large Vision Language Models: Benchmark Evaluations and Challenges. CVPR Workshops 2025: 1587-1606
[c11]Min Shi, Fuxiao Liu, Shihao Wang, Shijia Liao, Subhashree Radhakrishnan, Yilin Zhao, De-An Huang, Hongxu Yin, Karan Sapra, Yaser Yacoob, Humphrey Shi, Bryan Catanzaro, Andrew Tao, Jan Kautz, Zhiding Yu, Guilin Liu:
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders. ICLR 2025
[c10]Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, Jing Liang, Tianrui Guan, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi:
On the Vulnerability of LLM/VLM-Controlled Robotics. IROS 2025: 1914-1921
[c9]Xiaoyu Liu, Paiheng Xu, Junda Wu, Jiaxin Yuan, Yifan Yang, Yuhang Zhou, Fuxiao Liu, Tianrui Guan, Haoliang Wang, Tong Yu, Julian J. McAuley, Wei Ai, Furong Huang:
Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey. NAACL (Findings) 2025: 7668-7684
[i18]Ming-Chang Chiu, Fuxiao Liu, Karan Sapra, Andrew Tao, Yaser Jacoob, Xuezhe Ma, Zhiding Yu, Guilin Liu:
AIDE: Agentically Improve Visual Language Model with Domain Experts. CoRR abs/2502.09051 (2025)
[i17]Yijun Liang, Ming Li, Chenrui Fan, Ziyue Li, Dang Nguyen, Kwesi Cobbina, Shweta Bhardwaj, Jiuhai Chen, Fuxiao Liu, Tianyi Zhou
:
ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness. CoRR abs/2504.10514 (2025)
[i16]Zongxia Li, Wenhao Yu, Chengsong Huang, Rui Liu, Zhenwen Liang, Fuxiao Liu, Jingxi Che, Dian Yu, Jordan L. Boyd-Graber, Haitao Mi, Dong Yu:
Self-Rewarding Vision-Language Model via Reasoning Decomposition. CoRR abs/2508.19652 (2025)
[i15]Jingxi Chen, Zongxia Li, Zhichao Liu, Guangyao Shi, Xiyang Wu, Fuxiao Liu, Cornelia Fermüller, Brandon Y. Feng, Yiannis Aloimonos:
First Frame Is the Place to Go for Video Content Customization. CoRR abs/2511.15700 (2025)- 2024
[c8]Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Fuxiao Liu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang:
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences. ACL (1) 2024: 416-442
[c7]Tianrui Guan, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou
:
Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models. CVPR 2024: 14375-14385
[c6]Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang:
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning. ICLR 2024
[c5]Xijun Wang, Ruiqi Xian, Tianrui Guan, Fuxiao Liu, Dinesh Manocha:
SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition. IROS 2024: 10967-10974
[c4]Hao Fei
, Xiangtai Li
, Haotian Liu
, Fuxiao Liu
, Zhuosheng Zhang
, Hanwang Zhang
, Shuicheng Yan
:
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond. ACM Multimedia 2024: 11289-11291
[c3]Fuxiao Liu, Xiaoyang Wang, Wenlin Yao, Jianshu Chen, Kaiqiang Song, Sangwoo Cho, Yaser Yacoob, Dong Yu:
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning. NAACL-HLT 2024: 1287-1310
[i14]Xiyang Wu, Ruiqi Xian, Tianrui Guan, Jing Liang, Souradip Chakraborty, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi:
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities. CoRR abs/2402.10340 (2024)
[i13]Xiaoyu Liu, Paiheng Xu, Junda Wu, Jiaxin Yuan, Yifan Yang, Yuhang Zhou, Fuxiao Liu, Tianrui Guan, Haoliang Wang, Tong Yu, Julian J. McAuley, Wei Ai, Furong Huang:
Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey. CoRR abs/2403.09606 (2024)
[i12]Ming Li, Pei Chen, Chenguang Wang, Hongyu Zhao, Yijun Liang, Yupeng Hou, Fuxiao Liu, Tianyi Zhou
:
Mosaic IT: Enhancing Instruction Tuning with Data Mosaics. CoRR abs/2405.13326 (2024)
[i11]Min Shi, Fuxiao Liu, Shihao Wang, Shijia Liao, Subhashree Radhakrishnan, De-An Huang, Hongxu Yin, Karan Sapra, Yaser Yacoob, Humphrey Shi, Bryan Catanzaro, Andrew Tao, Jan Kautz, Zhiding Yu, Guilin Liu:
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders. CoRR abs/2408.15998 (2024)
[i10]Condy Bao, Fuxiao Liu:
DeepFM-Crispr: Prediction of CRISPR On-Target Effects via Deep Learning. CoRR abs/2409.05938 (2024)
[i9]Xijun Wang, Pedro Sandoval Segura, Chengyuan Zhang, Junyun Huang, Tianrui Guan, Ruiqi Xian, Fuxiao Liu, Rohan Chandra, Boqing Gong, Dinesh Manocha:
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments. CoRR abs/2412.20042 (2024)- 2023
[c2]Fuxiao Liu, Yaser Yacoob, Abhinav Shrivastava:
COVID-VTS: Fact Extraction and Verification on Short Video Platforms. EACL 2023: 178-188
[i8]Fuxiao Liu, Yaser Yacoob, Abhinav Shrivastava:
COVID-VTS: Fact Extraction and Verification on Short Video Platforms. CoRR abs/2302.07919 (2023)
[i7]Fuxiao Liu, Hao Tan, Chris Tensmeyer:
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents. CoRR abs/2306.06306 (2023)
[i6]Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang:
Aligning Large Multi-Modal Model with Robust Instruction Tuning. CoRR abs/2306.14565 (2023)
[i5]Zongxia Li, Paiheng Xu, Fuxiao Liu, Hyemi Song:
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps. CoRR abs/2307.05052 (2023)
[i4]Fuxiao Liu:
Driving Policy Prediction based on Deep Learning Models. CoRR abs/2307.11058 (2023)
[i3]Fuxiao Liu, Tianrui Guan, Zongxia Li, Lichang Chen, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou
:
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models. CoRR abs/2310.14566 (2023)
[i2]Fuxiao Liu, Xiaoyang Wang, Wenlin Yao, Jianshu Chen, Kaiqiang Song, Sangwoo Cho, Yaser Yacoob, Dong Yu:
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning. CoRR abs/2311.10774 (2023)- 2021
[c1]Fuxiao Liu, Yinghan Wang
, Tianlu Wang, Vicente Ordonez
:
Visual News: Benchmark and Challenges in News Image Captioning. EMNLP (1) 2021: 6761-6771- 2020
[i1]Fuxiao Liu, Yinghan Wang, Tianlu Wang, Vicente Ordonez:
VisualNews : Benchmark and Challenges in Entity-aware Image Captioning. CoRR abs/2010.03743 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-05 23:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







