


default search action
Haoji Zhang 0001
Person information
- unicode name: 张颢继
- affiliation: Tsinghua University, Shenzhen International Graduate School, China
Other persons with the same name
- Haoji Zhang 0002 — Shanghai Maritime University, Merchant Marine College, China
- Haoji Zhang 0003
— Shaanxi University of Science and Technology, College of Electronic Information and Artificial Intelligence, Xi'an, China - Haoji Zhang 0004 — University of Electronic Science and Technology of China, School of Computer Science and Engineering, Chengdu, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i14]Wenxun Dai, Zhiyuan Zhao, Yule Zhong, Yiji Cheng, Jianwei Zhang, Linqing Wang, Shiyi Zhang, Yunlong Lin, Runze He, Fellix Song, Wayne Zhuang, Yong Liu, Haoji Zhang, Yansong Tang, Qinglin Lu, Chunyu Wang:
ChatUMM: Robust Context Tracking for Conversational Interleaved Generation. CoRR abs/2602.06442 (2026)- 2025
[j2]Yulin Wang
, Haoji Zhang
, Yang Yue
, Shiji Song
, Chao Deng, Junlan Feng, Gao Huang
:
Uni-AdaFocus: Spatial-Temporal Dynamic Computation for Video Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 47(3): 1782-1799 (2025)
[j1]Sule Bai
, Yong Liu
, Yifei Han, Haoji Zhang
, Yansong Tang
, Jie Zhou
, Jiwen Lu
:
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation. IEEE Trans. Image Process. 34: 8271-8284 (2025)
[c2]Yiqin Wang, Haoji Zhang, Jingqi Tian, Yansong Tang:
Ponder & Press: Advancing Visual GUI Agent towards General Computer Control. ACL (Findings) 2025: 1461-1473
[i13]Sule Bai, Mingxing Li, Yong Liu, Jing Tang, Haoji Zhang, Lei Sun, Xiangxiang Chu, Yansong Tang:
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning. CoRR abs/2505.14231 (2025)
[i12]Haoji Zhang, Yiqin Wang, Yansong Tang, Yong Liu, Jiashi Feng, Xiaojie Jin:
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams. CoRR abs/2506.23825 (2025)
[i11]Haoji Zhang, Xin Gu, Jiawen Li, Chixiang Ma, Sule Bai, Chubin Zhang, Bowen Zhang, Zhichao Zhou, Dongliang He, Yansong Tang:
Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning. CoRR abs/2508.04416 (2025)
[i10]Vidi Team, Celong Liu, Chia-Wen Kuo, Chuang Huang, Dawei Du, Fan Chen, Guang Chen, Haoji Zhang, Haojun Zhao, Lingxi Zhang, Lu Guo, Lusha Li, Longyin Wen, Qihang Fan, Qingyu Chen, Rachel Deng, Sijie Zhu, Stuart Siew, Tong Jin, Weiyan Tao, Wen Zhong, Xiaohui Shen, Xin Gu, Zhenfang Chen, Zuhua Lin:
Vidi2: Large Multimodal Models for Video Understanding and Creation. CoRR abs/2511.19529 (2025)
[i9]Xin Gu, Haoji Zhang, Qihang Fan, Jingxuan Niu, Zhipeng Zhang, Libo Zhang, Guang Chen, Fan Chen, Longyin Wen, Sijie Zhu:
Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning. CoRR abs/2511.21375 (2025)
[i8]Yuji Wang, Wenlong Liu, Jingxuan Niu, Haoji Zhang, Yansong Tang:
VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning. CoRR abs/2512.06373 (2025)
[i7]Jingqi Tian, Yiheng Du, Haoji Zhang, Yuji Wang, Isaac Ning Lee, Xulong Bai, Tianrui Zhu, Jingxuan Niu, Yansong Tang:
DDAVS: Disentangled Audio Semantics and Delayed Bidirectional Alignment for Audio-Visual Segmentation. CoRR abs/2512.20117 (2025)- 2024
[i6]Haoji Zhang, Yiqin Wang, Yansong Tang, Yong Liu, Jiashi Feng, Jifeng Dai, Xiaojie Jin:
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams. CoRR abs/2406.08085 (2024)
[i5]Yiqin Wang, Haoji Zhang, Yansong Tang, Yong Liu, Jiashi Feng, Jifeng Dai, Xiaojie Jin:
Hierarchical Memory for Long Video QA. CoRR abs/2407.00603 (2024)
[i4]Sule Bai, Yong Liu, Yifei Han, Haoji Zhang, Yansong Tang:
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation. CoRR abs/2411.15869 (2024)
[i3]Yiqin Wang, Haoji Zhang, Jingqi Tian, Yansong Tang:
Ponder & Press: Advancing Visual GUI Agent towards General Computer Control. CoRR abs/2412.01268 (2024)
[i2]Yulin Wang, Haoji Zhang, Yang Yue, Shiji Song, Chao Deng, Junlan Feng, Gao Huang:
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition. CoRR abs/2412.11228 (2024)- 2023
[c1]Jianhui Li, Jianmin Li, Haoji Zhang
, Shilong Liu
, Zhengyi Wang, Zihao Xiao, Kaiwen Zheng, Jun Zhu:
PREIM3D: 3D Consistent Precise Image Attribute Editing from a Single Image. CVPR 2023: 8549-8558
[i1]Jianhui Li, Jianmin Li, Haoji Zhang, Shilong Liu
, Zhengyi Wang, Zihao Xiao, Kaiwen Zheng, Jun Zhu:
PREIM3D: 3D Consistent Precise Image Attribute Editing from a Single Image. CoRR abs/2304.10263 (2023)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-24 01:25 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







