


default search action
Boyi Li 0001
Person information
- affiliation: NVIDIA, Autonomous Vehicle Research Group, Santa Clara, CA, USA
- affiliation: University of California, Berkeley, CA, USA
- affiliation (PhD): Cornell University, Ithaca, NY, USA
Other persons with the same name
- Boyi Li — disambiguation page
- Boyi Li 0002 — University of Illinois Urbana-Champaign, Urbana, IL, USA (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i39]Yi Gu, Yan Wang, Yuxiao Chen, Yurong You, Wenjie Luo, Yue Wang, Wenhao Ding, Boyi Li, Heng Yang, Boris Ivanovic, Marco Pavone:
Accelerating Structured Chain-of-Thought in Autonomous Vehicles. CoRR abs/2602.02864 (2026)- 2025
[j4]Boyi Li, Philipp Wu, Pieter Abbeel, Jitendra Malik:
Interactive Task Planning with Language Models. Trans. Mach. Learn. Res. 2025 (2025)
[j3]Boyi Li, Ligeng Zhu, Ran Tian, Shuhan Tan, Yuxiao Chen, Yao Lu, Yin Cui, Sushant Veer, Max Ehrlich, Jonah Philion, Xinshuo Weng, Fuzhao Xue, Linxi Fan, Yuke Zhu, Jan Kautz, Andrew Tao, Ming-Yu Liu, Sanja Fidler, Boris Ivanovic, Trevor Darrell, Jitendra Malik, Song Han, Marco Pavone:
Wolf: Dense Video Captioning with a World Summarization Framework. Trans. Mach. Learn. Res. 2025 (2025)
[c24]Yusuke Hirota, Boyi Li, Ryo Hachiuma, Yueh-Hua Wu, Boris Ivanovic, Marco Pavone, Yejin Choi, Yu-Chiang Frank Wang, Yuta Nakashima, Chao-Han Huck Yang:
LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences. ACL (6) 2025: 295-309
[c23]Baifeng Shi, Boyi Li, Han Cai, Yao Lu, Sifei Liu, Marco Pavone, Jan Kautz, Song Han, Trevor Darrell, Pavlo Molchanov, Hongxu Yin:
Scaling Vision Pre-Training to 4K Resolution. CVPR 2025: 9631-9640
[c22]Jiawei Yang, Jiahui Huang, Boris Ivanovic, Yuxiao Chen, Yan Wang, Boyi Li, Yurong You, Apoorva Sharma, Maximilian Igl, Péter Karkus, Danfei Xu, Yue Wang, Marco Pavone:
STORM: Spatio-TempOral Reconstruction Model For Large-Scale Outdoor Scenes. ICLR 2025
[c21]Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone:
Language-Image Models with 3D Understanding. ICLR 2025
[c20]Wei Chow, Jiageng Mao, Boyi Li, Daniel Seita, Vitor Campagnolo Guizilini, Yue Wang:
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding. ICLR 2025
[c19]Ziqi Lu, Heng Yang, Danfei Xu, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang:
LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation models. ICLR 2025
[c18]Jiageng Mao, Boyi Li, Boris Ivanovic, Yuxiao Chen, Yan Wang, Yurong You, Chaowei Xiao, Danfei Xu, Marco Pavone, Yue Wang:
DreamDrive: Generative 4D Scene Modeling from Street View Images. ICRA 2025: 367-374
[i38]Jiageng Mao, Boyi Li, Boris Ivanovic, Yuxiao Chen, Yan Wang, Yurong You, Chaowei Xiao, Danfei Xu, Marco Pavone, Yue Wang:
DreamDrive: Generative 4D Scene Modeling from Street View Images. CoRR abs/2501.00601 (2025)
[i37]Jiawei Yang, Jiahui Huang, Yuxiao Chen, Yan Wang, Boyi Li, Yurong You, Apoorva Sharma, Maximilian Igl, Péter Karkus, Danfei Xu, Boris Ivanovic, Yue Wang, Marco Pavone:
STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes. CoRR abs/2501.00602 (2025)
[i36]Wei Chow, Jiageng Mao, Boyi Li, Daniel Seita, Vitor Guizilini, Yue Wang:
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding. CoRR abs/2501.16411 (2025)
[i35]Kumar Krishna Agrawal, Long Lian, Longchao Liu, Natalia Harguindeguy, Boyi Li, Alexander Bick, Maggie Chung, Trevor Darrell, Adam Yala:
Atlas: Multi-Scale Attention Improves Long Context Image Modeling. CoRR abs/2503.12355 (2025)
[i34]Baifeng Shi, Boyi Li, Han Cai, Yao Lu, Sifei Liu, Marco Pavone, Jan Kautz, Song Han, Trevor Darrell, Pavlo Molchanov, Hongxu Yin:
Scaling Vision Pre-Training to 4K Resolution. CoRR abs/2503.19903 (2025)
[i33]Long Lian, Yifan Ding, Yunhao Ge, Sifei Liu, Hanzi Mao, Boyi Li, Marco Pavone, Ming-Yu Liu, Trevor Darrell, Adam Yala, Yin Cui:
Describe Anything: Detailed Localized Image and Video Captioning. CoRR abs/2504.16072 (2025)
[i32]Renhao Wang, Haoran Geng, Tingle Li, Feishi Wang, Gopala Anumanchipalli, Philipp Wu, Trevor Darrell, Boyi Li, Pieter Abbeel, Jitendra Malik, Alexei A. Efros:
MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real. CoRR abs/2507.02864 (2025)
[i31]Yusuke Hirota, Boyi Li, Ryo Hachiuma, Yueh-Hua Wu, Boris Ivanovic, Yuta Nakashima, Marco Pavone, Yejin Choi, Yu-Chiang Frank Wang, Chao-Han Huck Yang:
LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences. CoRR abs/2507.19362 (2025)
[i30]Yusuke Hirota, Ryo Hachiuma, Boyi Li, Ximing Lu, Michael Ross Boone, Boris Ivanovic, Yejin Choi, Marco Pavone, Yu-Chiang Wang, Noa Garcia, Yuta Nakashima, Chao-Han Huck Yang:
Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation. CoRR abs/2509.07596 (2025)
[i29]Jay Patrikar, Apoorva Sharma, Sushant Veer, Boyi Li, Sebastian A. Scherer, Marco Pavone:
The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving. CoRR abs/2509.18626 (2025)
[i28]Yulu Gan, Ligeng Zhu, Dandan Shan, Baifeng Shi, Hongxu Yin, Boris Ivanovic, Song Han, Trevor Darrell, Jitendra Malik, Marco Pavone, Boyi Li:
FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos. CoRR abs/2512.10927 (2025)
[i27]Jiawei Yang, Ziyu Chen, Yurong You, Yan Wang, Yiming Li, Yuxiao Chen, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang:
Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving. CoRR abs/2512.10947 (2025)
[i26]Zhenghao "Mark" Peng, Wenhao Ding, Yurong You, Yuxiao Chen, Wenjie Luo, Thomas Tian, Yulong Cao, Apoorva Sharma, Danfei Xu, Boris Ivanovic, Boyi Li, Bolei Zhou, Yan Wang, Marco Pavone:
Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning. CoRR abs/2512.24426 (2025)- 2024
[j2]Long Lian, Boyi Li, Adam Yala, Trevor Darrell:
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models. Trans. Mach. Learn. Res. 2024 (2024)
[c17]Ran Tian, Boyi Li, Xinshuo Weng, Yuxiao Chen, Edward Schmerling, Yue Wang, Boris Ivanovic, Marco Pavone:
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving. CoRL 2024: 3656-3673
[c16]Shuhan Tan, Boris Ivanovic, Yuxiao Chen, Boyi Li, Xinshuo Weng, Yulong Cao, Philipp Krähenbühl, Marco Pavone:
Promptable Closed-loop Traffic Simulation. CoRL 2024: 5087-5105
[c15]Tsung-Han Wu, Long Lian, Joseph E. Gonzalez, Boyi Li, Trevor Darrell:
Self-Correcting LLM-Controlled Diffusion Models. CVPR 2024: 6327-6336
[c14]Boyi Li, Yue Wang, Jiageng Mao, Boris Ivanovic, Sushant Veer, Karen Leung, Marco Pavone
:
Driving Everywhere with Large Language Model Policy Adaptation. CVPR 2024: 14948-14957
[c13]Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li:
LLM-grounded Video Diffusion Models. ICLR 2024
[c12]Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang:
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision. ICLR 2024
[c11]Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge J. Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein:
Re-evaluating the Need for Visual Signals in Unsupervised Grammar Induction. NAACL-HLT (Findings) 2024: 1113-1123
[c10]Xiangyu Chen, Zhenzhen Liu, Katie Luo, Siddhartha Datta, Adhitya Polavaram, Yan Wang, Yurong You, Boyi Li, Marco Pavone, Wei-Lun Chao, Mark E. Campbell, Bharath Hariharan, Kilian Q. Weinberger:
DiffuBox: Refining 3D Object Detection with Point Diffusion. NeurIPS 2024
[i25]Boyi Li, Jathushan Rajasegaran, Yossi Gandelsman, Alexei A. Efros, Jitendra Malik:
Synthesizing Moving People with 3D Control. CoRR abs/2401.10889 (2024)
[i24]Boyi Li, Yue Wang, Jiageng Mao, Boris Ivanovic, Sushant Veer, Karen Leung, Marco Pavone:
Driving Everywhere with Large Language Model Policy Adaptation. CoRR abs/2402.05932 (2024)
[i23]Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone:
Language-Image Models with 3D Understanding. CoRR abs/2405.03685 (2024)
[i22]Xiangyu Chen, Zhenzhen Liu, Katie Z. Luo
, Siddhartha Datta, Adhitya Polavaram, Yan Wang, Yurong You, Boyi Li, Marco Pavone, Wei-Lun Chao, Mark E. Campbell, Bharath Hariharan, Kilian Q. Weinberger:
DiffuBox: Refining 3D Object Detection with Point Diffusion. CoRR abs/2405.16034 (2024)
[i21]Ran Tian, Boyi Li, Xinshuo Weng, Yuxiao Chen, Edward Schmerling, Yue Wang, Boris Ivanovic, Marco Pavone:
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving. CoRR abs/2407.00959 (2024)
[i20]Boyi Li, Ligeng Zhu, Ran Tian, Shuhan Tan, Yuxiao Chen, Yao Lu, Yin Cui, Sushant Veer, Max Ehrlich, Jonah Philion, Xinshuo Weng, Fuzhao Xue, Andrew Tao, Ming-Yu Liu, Sanja Fidler, Boris Ivanovic, Trevor Darrell, Jitendra Malik, Song Han, Marco Pavone:
Wolf: Captioning Everything with a World Summarization Framework. CoRR abs/2407.18908 (2024)
[i19]Shuhan Tan, Boris Ivanovic, Yuxiao Chen, Boyi Li, Xinshuo Weng, Yulong Cao, Philipp Krähenbühl, Marco Pavone:
Promptable Closed-loop Traffic Simulation. CoRR abs/2409.05863 (2024)
[i18]Xiangyu Han, Zhen Jia, Boyi Li, Yan Wang, Boris Ivanovic, Yurong You, Lingjie Liu, Yue Wang, Marco Pavone, Chen Feng, Yiming Li:
Extrapolated Urban View Synthesis Benchmark. CoRR abs/2412.05256 (2024)
[i17]Ziqi Lu, Heng Yang, Danfei Xu, Boyi Li, Boris Ivanovic, Marco Pavone, Yue Wang:
LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models. CoRR abs/2412.07746 (2024)- 2023
[c9]Jiaxin Ge, Sanjay Subramanian, Trevor Darrell, Boyi Li:
From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation. EMNLP 2023: 1173-1185
[i16]Long Lian, Boyi Li, Adam Yala, Trevor Darrell:
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models. CoRR abs/2305.13655 (2023)
[i15]Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li:
LLM-grounded Video Diffusion Models. CoRR abs/2309.17444 (2023)
[i14]Boyi Li, Philipp Wu, Pieter Abbeel, Jitendra Malik:
Interactive Task Planning with Language Models. CoRR abs/2310.10645 (2023)
[i13]Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim
, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang:
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision. CoRR abs/2311.02077 (2023)
[i12]Jiaxin Ge, Sanjay Subramanian, Trevor Darrell, Boyi Li:
From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation. CoRR abs/2311.12391 (2023)
[i11]Tsung-Han Wu, Long Lian, Joseph E. Gonzalez, Boyi Li, Trevor Darrell:
Self-correcting LLM-controlled Diffusion Models. CoRR abs/2311.16090 (2023)- 2022
[c8]Boyi Li, Serge J. Belongie
, Ser-Nam Lim, Abe Davis:
Neural Image Recolorization for Creative Domains. CVPR Workshops 2022: 2225-2229
[c7]Boyi Li, Yin Cui, Tsung-Yi Lin, Serge J. Belongie
:
SITTA: Single Image Texture Translation for Data Augmentation. ECCV Workshops (2) 2022: 3-20
[c6]Varsha Kishore, Xiangyu Chen, Yan Wang, Boyi Li, Kilian Q. Weinberger:
Fixed Neural Network Steganography: Train the images, not the network. ICLR 2022
[c5]Boyi Li, Kilian Q. Weinberger, Serge J. Belongie, Vladlen Koltun, René Ranftl:
Language-driven Semantic Segmentation. ICLR 2022
[i10]Boyi Li, Kilian Q. Weinberger, Serge J. Belongie, Vladlen Koltun, René Ranftl:
Language-driven Semantic Segmentation. CoRR abs/2201.03546 (2022)
[i9]Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge J. Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein:
Does unsupervised grammar induction need pixels? CoRR abs/2212.10564 (2022)- 2021
[c4]Boyi Li, Felix Wu, Ser-Nam Lim, Serge J. Belongie
, Kilian Q. Weinberger:
On Feature Normalization and Data Augmentation. CVPR 2021: 12383-12392
[i8]Boyi Li, Yin Cui, Tsung-Yi Lin, Serge J. Belongie:
Single Image Texture Translation for Data Augmentation. CoRR abs/2106.13804 (2021)- 2020
[i7]Boyi Li, Felix Wu, Ser-Nam Lim, Serge J. Belongie, Kilian Q. Weinberger:
On Feature Normalization and Data Augmentation. CoRR abs/2002.11102 (2020)
2010 – 2019
- 2019
[j1]Boyi Li
, Wenqi Ren
, Dengpan Fu, Dacheng Tao
, Dan Feng, Wenjun Zeng
, Zhangyang Wang:
Benchmarking Single-Image Dehazing and Beyond. IEEE Trans. Image Process. 28(1): 492-505 (2019)
[c3]Boyi Li, Felix Wu, Kilian Q. Weinberger, Serge J. Belongie:
Positional Normalization. NeurIPS 2019: 1620-1632
[i6]Felix Wu, Boyi Li, Lequn Wang, Ni Lao, John Blitzer, Kilian Q. Weinberger:
FastFusionNet: New State-of-the-Art for DAWNBench SQuAD. CoRR abs/1902.11291 (2019)
[i5]Boyi Li, Felix Wu, Kilian Q. Weinberger, Serge J. Belongie:
Positional Normalization. CoRR abs/1907.04312 (2019)
[i4]Felix Wu, Boyi Li, Lequn Wang, Ni Lao, John Blitzer, Kilian Q. Weinberger:
Integrated Triaging for Fast Reading Comprehension. CoRR abs/1909.13128 (2019)- 2018
[c2]Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng:
End-to-End United Video Dehazing and Detection. AAAI 2018: 7016-7023- 2017
[c1]Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng:
AOD-Net: All-in-One Dehazing Network. ICCV 2017: 4780-4788
[i3]Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng:
An All-in-One Network for Dehazing and Beyond. CoRR abs/1707.06543 (2017)
[i2]Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng:
End-to-End United Video Dehazing and Detection. CoRR abs/1709.03919 (2017)
[i1]Boyi Li, Wenqi Ren, Dengpan Fu, Dacheng Tao, Dan Feng, Wenjun Zeng, Zhangyang Wang:
RESIDE: A Benchmark for Single Image Dehazing. CoRR abs/1712.04143 (2017)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-20 23:54 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







