


default search action
IEEE Transactions on Circuits and Systems for Video Technology, Volume 36
Volume 36, Number 1, January 2026
- Shan Liu

:
Message From Editor-in-Chief. 4 - Jun Liu

, Wei Ke
, Hao Sheng
:
Robust RGB-T Tracking via Multi-Feature Response Adaptive Fusion and Dynamic Selection Recovery. 5-21 - Huan Li

, Xinpeng Huang
, Yilei Chen
, Chao Yang
, Mounir Kaaniche
, Qiuwen Zhang
, Ping An
:
Low-Bitrate Light Field Video Compression Through Key Sequences Encoding and Joint Reconstruction Network. 22-36 - Qiangqiang Shen

, Hanzhang Wang
, Yin-Ping Zhao
, Yongyong Chen
, Yongsheng Liang
, Xuelong Li
:
Dual Tensor Low-Rank Representation for Subspace Clustering. 37-50 - Lve Huang, Xiaowei Yu

, Huabiao Yan, Libo Huang
, Zhulin An
, Yongjun Xu
:
AF-YOLO: Asymptotic Feature Extraction and Fusion for Aerial Object Detection. 63-78 - Longyang Tang

, Bo Zhang
, Hui Lv
, Rui Xu
, Xudong Tian
, Junsheng Zhou, Yi Chen
:
DilatedTAD: Enhancing Adaptability to Actions of Varying Durations for Temporal Action Detection. 79-92 - Wei Dong, Guodong Fan

, Fan Zhang, Min Gan, Guang-Yong Chen
, C. L. Philip Chen
:
SAFAformer: Scale-Aware Frequency-Adaptive Guidance for Nighttime Flare Removal. 93-105 - Hao Li

, Wei Wang
, Cong Wang
, Mengzhu Wang
, Xiang Zhang
, Long Lan
, Xinwang Liu
, Kenli Li
, Xiaochun Cao
:
Phrase Grounding-Based Style Transfer for Single-Domain Generalized Object Detection. 106-118 - Xiao Ke

, Wenyao Chen
:
SFCE-Det: Sub-Feature Fusion and Cross-Layer Perceptual Enhancement Detector. 119-132 - Haibo Chen

, Zhiwen Zuo
, Lei Zhao
, Jun Li
, Jian Yang
:
ConceptCraft: One-Shot Personalized Text-to-Image Generation via Object-Background Disentanglement. 133-146 - Mingfu Xiong

, Longlong Ge
, Ruimin Hu
, Khan Muhammad
, Sambit Bakshi
, Javier Del Ser
, Xiaokang Yang
, Bin Sheng
:
HPRNet: Human Parsing Reconstruction With Non-Local Multi-Scale Perception Network for Cloth-Changing Person Re-Identification. 147-160 - Yubin Wu

, Xiaojie Li
, Hao Chen, Changcai Yang
, Lifang Wei
, Riqing Chen
:
MatchMamba: Correspondence Pruning via Selective State Space Model. 161-174 - Zhen Zhang

, Qing Zhao, Xiuhe Li, Cheng Wang, Guoqiang Zhu, Yu Zhang
, Yining Huo, Hongyi Yu, Yi Zhang
:
CA-YOLO: Cross Attention Empowered YOLO for Biomimetic Localization. 175-189 - Jitao Ma

, Weiying Xie
, Ye Shi, Xueshuang Xiang
, Yunsong Li
, Leyuan Fang
:
BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection. 190-204 - Yi-Feng Zhang

, Canlong Zhang
, Junwei Tian
, Haifei Ma
, Zhixin Li
, Zhi-Wen Wang
:
CMAG: Cross-Modal Attention and Graph-Enhanced Memory for Unsupervised Visible-Infrared Person Re-Identification. 205-218 - Gui Gao

, Gang Yang
, Yajun Wang
, Libo Yao
, Xi Zhang
, Gaosheng Li
:
Oriented Decoupling Target Detection Method for SAR Image Based on Multi-Channel Localization and Soft Thresholding. 219-232 - Yimin Fu

, Runqing Yang
, Zhunga Liu
, Michael K. Ng
:
Adaptive Mixture-of-Experts Distillation for Cross-Satellite Generalizable Incremental Remote Sensing Scene Classification. 233-247 - Jianli Zhao

, Tian-Heng Zhang
, Sheng Fang
, Jian-Feng Gao
, Jin-Yu Wang
, Maoguo Gong
:
Spatial-Spectral Texture-Preserved Total Variation: A Novel Regularization for Hyperspectral Image Denoising. 248-260 - Ruoxi Zhu

, Shusong Xu
, Peiye Liu, Jiaming Liu
, Yanheng Lu, Dimin Niu
, Hongzhong Zheng
, Yen-Kuang Chen
, Ming-e Jing
, Yibo Fan
:
A Flexible Zero-Shot Approach to Tone Mapping via Structure-Preserving Diffusion Models. 261-277 - Xiao Wang

, Chao Wang, Shiao Wang
, Xixi Wang
, Zhicheng Zhao
, Lin Zhu
, Bo Jiang
:
MambaEVT: Event Stream-Based Visual Object Tracking Using State Space Model. 278-291 - Haotian Liu

, Guo Yu, Hu Cao
, Sanqing Qu, Fan Lu
, Yan Zhong
, Zhichao Lu
, Luziwei Leng
, Guang Chen
:
I2EKD: Efficient and Versatile Image-to-Event Knowledge Distillation. 292-303 - Yunxiao Qin

, Yuanhao Xiong, Jinfeng Yi
, Lihong Cao
, Cho-Jui Hsieh
:
Generalized Transferable Attack Across Datasets. 304-319 - Pinxue Guo

, Lingyi Hong
, Xinyu Zhou, Shuyong Gao
, Wanyun Li
, Jinglun Li, Zhaoyu Chen, Xiaoqiang Li
, Wei Zhang
, Wenqiang Zhang
:
ClickVOS: Click Video Object Segmentation. 320-334 - Hai Liu

, Shuang Zeng
, Liqian Deng, Tingting Liu, Xionghua Liu, Zhaoli Zhang
, You-Fu Li
:
HPCTrans: Heterogeneous Plumage Cues-Aware Texton Correlation Representation for FBIC via Transformers. 335-349 - Chuang-Wei Liu

, Mingjian Sun
, Cairong Zhao
, Hanli Wang
, Alexander V. Dvorkovich
, Rui Fan
:
Integrating Disparity Confidence Estimation Into Relative Depth Prior-Guided Unsupervised Stereo Matching. 350-362 - Kuo-Liang Chung

, Te-Wei Hou:
Error Compensation-Based Fusion Algorithm for Drone-Image Color Correction. 363-378 - Junjie Liang

, Xia Dong
, Penglei Wang
, Jin Xu
, Danyang Wu
, Feiping Nie
:
Multi-View Graph Clustering via Dual View-Cluster-Order Interactivity Mining. 379-392 - Yuang Xiao

, Chang Tang
, Xiao Zheng
, Weiqing Yan
, Yuanyuan Liu
, Xinwang Liu
:
Mutual Calibration Network for Multi-View Clustering. 393-405 - Xianmin Chen

, Longfei Han
, Peiliang Huang
, Xiaoxu Feng
, Dingwen Zhang
, Junwei Han
:
Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement. 406-420 - Long Zhuang

, Yiqing Yao
, Nuo Li
:
RC-ROSNet: Fusing 3D Radar Range-Angle Heat Maps and Camera Images for Radar Object Segmentation. 421-434 - Shanshan Wang

, Xiaozheng Shen
, Xun Yang
, Ke Xu
, Xingyi Zhang
:
Feature Responsive LoRA: Toward Parameter-Efficient Transfer Learning for Self-Supervised Visual Models. 435-448 - Wen-Jie Zheng

, Xi-Le Zhao
, Yu-Bang Zheng
, Teng-Yu Ji
, Ben-Zheng Li
:
Dynamic Low-Rank Tensor Decomposition for Video Applications. 449-462 - Ganlin Yang

, Kaidong Zhang
, Jingjing Fu
, Dong Liu
:
Drim-NeRF: Diffusion-Based Restoration for Improving Neural Radiance Fields. 463-476 - Jianing Zhang, Yuchao Zheng, Ziwei Li

, Qionghai Dai
, Xiaoyun Yuan
:
GBR: Generative Bundle Refinement for High-Fidelity Gaussian Splatting With Enhanced Mesh Reconstruction. 477-490 - Dan Song

, Sizhe Li
, Yue Zhang, Weizhi Nie
, Chao Xue, An-An Liu
:
RefreshReg: Receptive Field Reshaping and Multi-Layer Consistency Filtering for Point Cloud Registration. 491-504 - Chengzhi Ma

, Kunqian Li
, Shuaixin Liu
, Han Mei
:
Depth-Assisted Network for Indiscernible Marine Object Counting With Adaptive Motion-Differentiated Feature Encoding. 505-520 - Chunqiang Yu

, Xianquan Zhang
, Ching-Nung Yang
, Xinpeng Zhang
, Zhenjun Tang
:
Reversible Data Hiding in Shared Images Using Overlapped Coefficients in Polynomials. 521-536 - Lin Yang

, Dawen Xu
, Jiangbo Qian
, Rangding Wang
, Songhan He
:
IPM Priority-Preserving Adaptive Steganography for HEVC. 537-550 - Yucheng Zhu

, Guangtao Zhai
, Xiongkuo Min
, Yunhao Li
, Long Teng
, Huiyu Duan
, Liang Yuan
, Xiaokang Yang
:
Future Fixation Sequence Prediction for Audio-Visual 360° Videos. 551-565 - Jian Jin

, Fanxin Xia, Feng Ding, Xinfeng Zhang
, Meiqin Liu
, Yao Zhao
, Weisi Lin
, Lili Meng
:
Customizable ROI-Based Deep Image Compression. 566-578 - Mengkun Liu

, Licheng Jiao
, Xu Liu
, Lingling Li
, Fang Liu
, Shuyuan Yang
, Shuang Wang
, Biao Hou
:
KCI-Net: Knowledge-Based Contourlet Inference Network for Super-Resolution. 579-595 - Peiran Peng

, Tingfa Xu
, Liqiang Song, Mengqi Zhu, Yuqiang Fang
, Jianan Li
:
COXNet: Cross-Layer Fusion With Adaptive Alignment and Scale Integration for RGBT Tiny Object Detection. 596-608 - Yipo Huang

, Zhichao Duan
, Pengfei Chen
, Li Cai, Leida Li
, Weisi Lin
:
Learning Scene-Invariant Distribution for Generalizable Blind Image Quality Assessment. 609-621 - Yun Zhang

, Shisheng Zhang
, Na Li
, Chunling Fan, Raouf Hamzaoui
:
VP-JND: Visual Perception Assisted Deep Picture-Wise Just Noticeable Difference Prediction Model for Image Compression. 622-636 - Nian Wang

, Zhigao Cui
, Yanzhao Su
, Yunwei Lan
, Yuanliang Xue
, Cong Zhang
, Aihua Li:
Weakly Supervised Image Dehazing via Physics-Based Decomposition. 637-652 - Qimin Yang

, Kan Ren
, Qian Chen
:
AMSFusion: An Adaptive Multi-Scale Infrared and Visible Image Fusion Network Based on Attention Mechanisms. 653-668 - Chenyang Shi

, Shasha Guo
, Boyi Wei, Hanxiao Liu
, Yibo Zhang
, Ningfang Song
, Jing Jin
:
A Label-Free and Non-Monotonic Metric for Evaluating Denoising in Event Cameras. 669-684 - Le Thi Hue Dao

, An Gia Vien
, Jooyoung Lee
, Seyoon Jeong
, Naeun Yang, Chul Lee
:
Gradient-Guided Diffusion-Based Restoration of Extremely Compressed Backgrounds for Video Coding for Machines. 685-701 - Jun-Xiu Li

, Hong Liu
, Xiao Wu
, Yu-Pei Song
, Zhenhua Zeng, Shin'ichi Satoh
:
Rethinking Crowd Localization Evaluation via Optimal Transportation Cost. 702-716 - Junsong Leng

, Zeyu Zhao, Chang Tian, Zhong Chen
, Guoyou Wang
, Xiaoxuan Liu
:
Multi-Modal Few-Shot Semantic Segmentation Based on Triple Attention Mechanism and Hierarchical Decoding Transformer. 717-731 - Tongxin Liu

, Xiyu Pang
, Gangwu Jiang, Xiushan Nie
, Meifeng Zheng
, Yilong Yin
:
Internal-External Context Interaction Network for Person Re-Identification. 732-746 - Zhiying Song

, Kaixuan Chen
, Pengfei Wang
, Mingli Song
, Nenggan Zheng
:
Unsupervised Action Segmentation via Multi-Scale Temporal-Interaction Enhancement. 747-762 - Fangying Xiong, Zhaoquan Yuan

, Xiao Wu
, Changsheng Xu
:
Class-Specific Knowledge-Guided Multimodal Prompt Tuning for Few-Shot Class-Incremental Learning. 763-776 - Fulin Luo

, Xi Chen
, Chuan Fu
, Tan Guo
, Bo Du
:
HDiff-HIR: Hierarchically Conditional Diffusion Model for Hyperspectral Image Reconstruction. 777-791 - Lizhi Wang

, Feng Zhou, Bo Yu, Pu Cao, Jianqin Yin
:
OMEGAS: Object Mesh Extraction From Large Scenes Guided by Gaussian Segmentation. 792-802 - Siyuan Wang

, Yuejie Lu
, Qiang Ling
:
OCC-Exoskeleton: A Plug-and-Play Module to Enhance CNN-Based Occupancy Prediction Networks. 803-816 - Haoxin Yang

, Weihong Chen
, Xuemiao Xu
, Cheng Xu
, Peng Xiao
, Cuifeng Sun, Shaoyu Huang, Shengfeng He
:
StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion. 817-831 - Peng Zhang

, Songru Yang, Jinsheng Sun
, Weiqing Li
, Zhiyong Su
:
Open-World Point Cloud Semantic Segmentation: A Human-in-the-Loop Framework. 832-845 - Shaoqing Xu

, Fang Li
, Peixiang Huang
, Ziying Song
, Zhi-Xin Yang
:
TiGDistill-BEV: Multi-View BEV 3D Object Detection via Target Inner-Geometry Learning Distillation. 846-860 - Ji Gan

, Lei Chen
, Ping Hu, Jiaxu Leng
, Weisheng Li
, Xinbo Gao
:
HandJoKe: Joint-Guided Keypoint Denoising Transformer for Depth-Based 3D Hand Pose Estimation. 861-873 - Linlin Ge

, Tianyu Song, Lei Wang, Jieqing Feng
:
Boundary-Aware Consistent Normal Orientation for Point Clouds. 874-886 - Qijun Gan

, Zijie Zhou, Jianke Zhu
:
XHand: Real-Time Expressive Hand Avatar. 887-899 - Yihao Wang

, Meng Yang
, Rui Cao
, Guangwei Gao
:
AEA-FIRM: Adaptive Elastic Alignment With Fine-Grained Representation Mining for Text-Based Aerial Pedestrian Retrieval. 900-912 - Zongtao He

, Liuyi Wang
, Lu Chen
, Chengju Liu
, Qijun Chen
:
NavComposer: Composing Language Instructions for Navigation Trajectories Through Action-Scene-Object Modularization. 913-929 - Ling Tong

, Kun Qian
, Zhaokun Yue
, Shan Luo
:
Can Vision Feel Touch? Tactile-Aware Visual Grasping for Transparent Objects. 930-944 - Rongjun Ge

, Ruiyi Li, Chong Wang, Yuxin Liu
, Heng Zhu
, Jean-Louis Coatrieux, Daoqiang Zhang
, Jian Lu, Yang Chen
, Shuo Li
, Yuting He:
Adaptation Follow Human Attention: Gaze-Assisted Medical Segment Anything Model. 945-958 - Maoxian Wan

, Kaige Li
, Qichuan Geng
, Binyi Su
, Zhong Zhou
:
Out-of-Distribution Semantic Segmentation With Disentangled and Calibrated Representation. 971-985 - Basit Alawode

, Sajid Javed
:
Unsupervised Background Subtraction Using Generator-Discriminator Learning. 986-1002 - Chenglong Shao, Tongzhen Si

, Xiaohui Yang
, Hui Yuan
:
Dependability Feature Learning Based on Sample Generation for Unsupervised Text-to-Image Person Re-Identification. 1003-1014 - Yuedong Tan

, Wenfang Sun
, Jingyuan Li
, Shuwei Hou
, Xiaobo Li
, Zhe Wang
, Beibei Song
:
HyperTrack: A Unified Network for Hyperspectral Video Object Tracking. 1015-1028 - Bin Xue

, Yuwei Cheng
, Kun Ding
, Chunhong Pan
, Shiming Xiang
:
USVTrack: A Benchmark for Multi-Object Tracking in Complex Water Surface Scenes. 1029-1044 - Xiaochen Wang

, Dehui Kong
, Jinghua Li, Jing Wang
, Baocai Yin
:
HAhb-KG: Hierarchical Augmented Knowledge Graph for Human Behavior Assisting Cross-Modal Learning Action Detection. 1045-1060 - Muyu Li

, Henan Hu
, Yingfeng Wang
, Sen Qiu
, Xudong Zhao
:
Hierarchical Topology Meets Temporal Occupancy: A Comprehensive Model for Multi-Person Pose Tracking. 1061-1074 - Mingrui Zhu

, Jianhang Chen, Xin Wei
, Nannan Wang
, Xinbo Gao
:
Fine-Detailed Facial Sketch-to-Photo Synthesis With Detail-Enhanced Codebook Priors. 1075-1088 - Shanzhi Yin

, Bolin Chen
, Shiqi Wang
, Yan Ye
:
Generative Human Video Compression With Multi-Granularity Temporal Trajectory Factorization. 1089-1103 - Xihua Sheng, Peilin Chen

, Shiqi Wang
, Dapeng Oliver Wu
:
DRFC: An End-to-End Deep Dynamic RF Signal Compression Framework. 1104-1116 - Zeming Zhao

, Xiaohai He
, Shuhua Xiong, Meng Wang, Shiqi Wang
:
Spatial-Temporal Correlation Information-Based Rate Control for Versatile Video Coding. 1117-1129 - Wei Wei

, Chenxu Zhao, Shuyi Zhao
, Lei Zhang
, Yanning Zhang
:
Hyperspectral Image Compression With Spectral-Spatial Coupling and Group-Wise Context Modeling. 1130-1142 - Xin Li

, Shaohui Li
, Wenrui Dai
, Han Li, Nuowen Kan
, Chenglin Li
, Junni Zou
, Hongkai Xiong
:
Point Cloud Attribute Compression With Geometry-Aware Lifting-Based Multiscale Networks. 1143-1159 - Feng Xing

, Yingwen Zhang
, Meng Wang, Hengyu Man
, Yongbing Zhang
, Shiqi Wang
, Xiaopeng Fan
, Wen Gao:
Mining Temporal Priors for Template-Generated Video Compression. 1160-1172 - Weiling Chen

, Weiming Lin
, Qianxue Feng, Rongxin Zhang
, Tiesong Zhao
:
Pixel-Level Just Noticeable Difference in Sonar Images: Modeling and Applications. 1173-1184 - Nuowen Kan

, Chenglin Li
, Yuankun Jiang
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
, Laura Toni
:
MERINA+: Improving Generalization for Neural Video Adaptation via Information-Theoretic Meta-Reinforcement Learning. 1185-1202 - Kaifeng Gao

, Siqi Chen, Hanwang Zhang
, Jun Xiao
, Yueting Zhuang
, Qianru Sun
:
Generalized Visual Relation Detection With Diffusion Models. 1203-1215 - Hongxi Li, Yubo Zhu, Zirui Shang

, Ziyi Wang, Xinxiao Wu
:
A Comprehensive Survey on Video Summarization: Challenges and Advances. 1216-1233 - Hengchang Wang, Li Liu

, Huaxiang Zhang
, Lei Zhu
, Xiaojun Chang
, Hao Du
:
VisualRAG: Knowledge-Guided Retrieval Augmentation for Image-Text Matching. 1234-1248 - Guowei Dai

, Duwei Dai
, Chaoyu Wang
, Qingfeng Tang
, Matthew Hamilton
, Hu Chen
, Yi Zhang
:
Multi-Task Learning Network for Medical Image Analysis Guided by Lesion Regions and Spatial Relationships of Tissues. 1249-1264 - Likun Gao

, Xinhui Xue
, Haowen Zheng:
Sparse Hyperspectral Band Selection Based on Expectation Maximization. 1265-1278 - Shiqiang Zheng

, Changsheng Chen
, Shen Chen, Taiping Yao
, Shouhong Ding, Bin Li
, Jiwu Huang
:
Generalized Document Tampering Localization via Color and Semantic Disentanglement. 1279-1292 - Meihong Yang, Ziyi Feng

, Bin Ma
, Jian Xu
, Yongjin Xian
, Linna Zhou:
An End-to-End Framework for Joint Makeup Style Transfer and Image Steganography. 1293-1308 - Shiben Liu

, Huijie Fan
, Qiang Wang
, Weihong Ren
, Yandong Tang
, Yang Cong
:
Domain Consistency Representation Learning for Lifelong Person Re-Identification. 51-62 - Hu Xue

, Hao Zhu
, Zhidan Ran
, Xianlun Tang
, Guanqiu Qi
, Zhiqin Zhu
, Sin-Chi Kuok
, Henry Leung
:
Feature Fusion and Enhancement for Lightweight Visible-Thermal Infrared Tracking via Multiple Adapters. 959-970
Volume 36, Number 2, February 2026
- Xiaoyu Chen

, Wanru Xu
, Shichao Kan
, Linna Zhang, Yi Jin
, Yigang Cen
, Yidong Li
:
Vision-Semantics-Label: A New Two-Step Paradigm for Action Recognition With Large Language Model. 1313-1327 - Jinliang Liu

, Jianwei Zhang
, Sen Yang
, Jinxi Xiang, Xiyue Wang
, Jieqiong Zhao
, Zongxin Yang
, Junhan Zhao
:
Toward General-Purpose Video Reconstruction Through Synergy of Grid-Splicing Diffusion and Large Language Models. 1328-1340 - Xuyang Liu

, Ting Liu
, Siteng Huang
, Yi Xin
, Yue Hu
, Long Qin
, Donglin Wang
, Yuanyuan Wu
, Honggang Chen
:
M2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension. 1341-1354 - Yunlong Tang

, Jing Bi
, Siting Xu, Luchuan Song
, Susan Liang
, Teng Wang
, Daoan Zhang
, Jie An
, Jingyang Lin
, Rongyi Zhu, Ali Vosoughi
, Chao Huang
, Zeliang Zhang
, Pinxin Liu, Mingqian Feng, Feng Zheng
, Jianguo Zhang
, Ping Luo
, Jiebo Luo
, Chenliang Xu
:
Video Understanding With Large Language Models: A Survey. 1355-1376 - Zhichao Chen

, Jie Yang
, Fan Li
, Zhicheng Feng
, Lifang Chen, Limin Jia
, Pan Li
:
Foreign Object Detection Method for Railway Catenary Based on a Scarce Image Generation Model and Lightweight Perception Architecture. 1377-1391 - Xin Wang

, Zirui Pan, Hong Chen
, Wenwu Zhu
:
DiViCo: Disentangled Visual Token Compression for Efficient Large Vision-Language Model. 1392-1405 - Ao Li

, Huijun Liu
, Yiqing Zhu, Yongxin Ge
:
Efficient Pre-Trained Semantics Refinement for Video Temporal Grounding. 1406-1418 - Cansu Korkmaz

, A. Murat Tekalp
, Zafer Dogan
:
Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models. 1419-1432 - Yujia Wang

, Qingyun Deng, Wei Liang:
Audio-Visual LLM for Augmenting Accessibility of 360° Video. 1433-1445 - Siyuan Wang

, Jiawei Liu
, Wei Wang
, Yeying Jin, Jinsong Du, Zhi Han
:
MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation. 1446-1459 - Wenpei Fan

, Yaonan Wang
, Licheng Liu
, Jiayi Zeng, Min Liu
:
Beijing Institute of TechnologyCMANet: A TCN-RMamba-Attention Network for Surgical Phase Online Recognition. 1460-1472 - Yu Wang

, Zhenfeng Shao
, Xiaolong Zuo
, Tao Lu
, Jiaming Wang
, Yuankun Wang
, Siyuan Wang
, Zhizheng Zhang, Xiaojin Zhao:
NSBRNet: Non-Local Spatio-Temporal Bidirectional Recurrent Network for Satellite Video Super-Resolution. 1473-1486 - Yusi Zhang

, Weiying Xie
, Tianlin Hui, Daixun Li
, Jiaqing Zhang
, Jie Lei
, Yunsong Li
, Leyuan Fang
:
LoME: LoRA-Driven Multimodal Extractor for RGB-X Vision Tasks. 1487-1500 - Zhihao Ying

, Jie Guo
, Yunsong Li
, Yu'e Gao, Chenyu Li:
Diff-Transformer: Heterogeneous Feature Fusion Network for Multisource Remote Sensing Classification. 1501-1516 - Jian Sun, Junlang Huang, Xinyu Jiang

, Yimin Zhou
, Chi-Man Vong
:
CGSI: Context-Guided and UAV's Status Informed Multimodal Framework for Generalizable Cross-View Geo-Localization. 1517-1530 - Shuo Zhang

, Yanlin Xie
, Jiaxin Chen, Youfang Lin
:
Decoupling and Aggregating: Dual-Layer Light Field Depth Estimation With Reflective and Transparent Surfaces. 1531-1543 - Deyang Liu

, Shizheng Li
, Yifan Mao
, Xiaofei Zhou
, Zeyu Xiao
, Caifeng Shan
:
Learning Implicit and Detail-Enhanced Network for Light Field Image Spatial-Angular Super-Resolution. 1544-1557 - Lizhu Liu

, Yaonan Wang
, Yurong Chen
, Hui Zhang
:
DDIP: Mutual-Regularized Dual Deep Image Prior for Self-Supervised Compressive Spectral Imaging. 1558-1570 - Zhiqiang Kou

, Haoyuan Xuan
, Jingyu Zhu, Hailin Wang
, Ming-Kun Xie
, Changwei Wang
, Jing Wang
, Yuheng Jia
, Xin Geng
:
Tail-Aware Reconstruction of Incomplete Label Distributions With Low-Rank and Sparse Modeling. 1571-1586 - Yuan Zhou

, Richang Hong
, Yanrong Guo
, Lin Liu
, Shijie Hao
, Hanwang Zhang
:
Controllable Relation Disentanglement for Few-Shot Class-Incremental Learning. 1587-1600 - Jingqi Song

, Zhiqiang He
, Yipeng Ning
, Xiaoming Xi
, Jie Guo
, Guanzhong Chen, Xiushan Nie, Lishan Qiao
, Yilong Yin
:
Prior Distribution Guided Gaussian Mixture Variational Autoencoder (PDGM-VAE) for Image Generation. 1601-1613 - Qingyang Zhou

, Yunfan Ye, Zhihuang Liu
, Chang Liu, Zhiping Cai
:
Non-Local Guided Neural Fields for 4D CT Reconstruction. 1614-1626 - Qing Tian

, Junyu Shen, Lulu Kang, Weihua Ou, Jun Wan
, Zhen Lei
:
Progressive Curriculum Learning With Teacher-Student Collaboration for Source-Free Unsupervised Domain Adaptation. 1627-1639 - Bin Dong

, Zicong Zhu
, Qianqian Bu
, Mengya Wu
, Jingen Ni
:
A Two-Stage Method With Lightweight Network and Active Contour Model for Remote Sensing Image Segmentation. 1640-1654 - Yuanliang Xue

, Guodong Jin
, Bineng Zhong
, Tao Shen
, Lining Tan
, Chaocan Xue
, Yaozong Zheng
:
FMTrack: Frequency-Aware Interaction and Multi-Expert Fusion for RGB-T Tracking. 1655-1667 - Jia Zhang

, Bo Peng
, Xi Wu
:
CDGR: Cross-Modal Dual Graph Reasoning for Weakly Supervised Semantic Segmentation. 1668-1680 - Jia Wang

, Xinfeng Zhang
, Gai Zhang
, Jun Zhu, Lv Tang
, Li Zhang
:
UAR-NVC: A Unified Autoregressive Framework for Memory-Efficient Neural Video Compression. 1681-1695 - Xiaorui Zhang

, Rui Jiang, Wei Sun
, Sunil Kr. Jha
:
FPGP: Increasing Robustness of Flow-Based Watermarking to Unknown Noise Through Feature Preservation and Gradient Perturbation. 1696-1715 - Cheng Liu

, Zheng Wang
, Xinyu Yan
, Meijun Sun
, Qinghua Hu
:
Visible-Infrared Camouflaged Object Detection. 1716-1728 - Yongxia Zhang

, Qiang Guo
, Caiming Zhang
:
Unsupervised, Untrained, and Robust Single Image Superpixel Segmentation Network. 1729-1741 - Zhen Yang

, Yanpeng Dong, Jiayu Wang, Heng Wang, Lichao Ma, Zijian Cui, Qi Liu, Haoran Pei, Kexin Zhang
, Chao Zhang:
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction. 1742-1753 - Jiafeng Liang

, Shixin Jiang, Wei Tang, Ning Wang
, Zekun Wang, Xun Mao
, Kai Lv, Ming Liu, Bing Qin
:
APSam: An Aggregating-Then-Pruning Sampler for Question-Conditional Denoising. 1754-1765 - Hongyang Gu

, Xiaogang Yang
, Ruitao Lu
, Lei Pu
, Siming Han, Ming Wu:
Discovering Multi-Frequency Embedding for Visible-Infrared Person Re-Identification. 1766-1780 - Zeru Shi

, Zengxi Zhang
, Kemeng Cui
, Ruizhe An
, Jinyuan Liu
, Zhiying Jiang
:
SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement. 1781-1794 - Shenghao Chen

, Chunjie Ma
, Yibo Zhao
, Meng Liu
, Yanbing Xue, Zan Gao
:
A Novel Multi-View Perception and Shrinkage Aggregation Network for Inharmonious Region Localization. 1795-1809 - Zheng Wang

, Shihao Xu, Wei Shi:
TrajSV: A Trajectory-Based Model for Sports Video Representations and Applications. 1810-1822 - Jiayi Li

, Jun Kong
, Yunde Zhang
, Ming Lu
, Min Jiang
:
SPCL: Semantic Polymorphism and Commonality Learning for Text-Based Person Retrieval. 1823-1836 - Xixia Xu

, Qi Zou
, Jiamao Li
:
Hierarchical Contrastive Consistency for Human Pose Estimation in Images and Videos. 1837-1847 - Weibo Zhang, Hao Wang

, Peng Ren
, Weidong Zhang
:
Underwater Scene Clarity Reconstruction via Multilayer Information Fusion and Self-Organized Stitching. 1848-1861 - Xi Yang

, Haoyuan Shi
, Zihan Wang, Nannan Wang
, Xinbo Gao
:
CSHNet: A Novel Information Asymmetric Image Translation Method. 1862-1875 - Junjie Zhang

, Feng Zhao
, Hanqiang Liu
, Jun Yu
:
Generative Information-Guided Heterogeneous Cross-Fusion Network With Contrastive Learning for Multimodal Remote Sensing Image Classification. 1876-1892 - Boang Li

, Hui Cao
, Badong Chen
, Tao Wang, Jie Zhang
:
EveryBrain: Generate EEG Responses From Images for Specified Individuals. 1893-1906 - Yan Xiang

, Kaiqi Zhao
, Zhenghong Yu
, Xiaochen Yuan
, Guoheng Huang
, Jinyu Tian
, Jianqing Li
:
DFFormer: Capturing Dynamic Frequency Features to Locate Image Manipulation Through Adaptive Frequency Transformer and Prototype Learning. 1907-1919 - Hao Liu

, Fengyong Li
, Chuan Qin
, Xinpeng Zhang
:
Fearless of Noise: Robust Image-in-Image Hiding Using Dual-Tree Complex Wavelet Transform and State Space Model. 1920-1934 - Ye Zhu

, Chang Ti, Gang Yan
, Yingchun Guo
, Bin Li
:
ALL-IN-ONE: Divide-and-Conquer Strategy for Multi-Manipulation Image Classification and Localization. 1935-1947 - Xintao Duan

, Sen Li
, Zhao Wang, Bingxin Wei, Haewoon Nam
, Chuan Qin
:
EctFormer: High-Imperceptibility Deep Image Steganography Based on Empirical Mode Decomposition. 1948-1961 - Jianqiao Sun

, Ziheng Cheng
, Bo Chen
, Xin Yuan
, Chunhui Qu, Hongwei Liu
:
MePAT: Meta-Prior Aided Transformer for Adverse Weather Condition Restoration. 1962-1976 - Zongyan Zhang

, C. L. Philip Chen
, Zepeng Su, Tong Zhang
:
Prompts Libra: Enhanced Image Outpainting Diffusion Model With Balanced Bimodal Guidance. 1977-1992 - Lihong Qiao

, Jinbo Li, Rongxuan Wang
, Yucheng Shu
, Weisheng Li
, Zhanchuan Cai
, Xinbo Gao
:
Parallel Trajectory Constraint Sampling for Solving Universal Medical Inverse Problems. 1993-2005 - Weidong Zhang

, Muzi Wang, Peixian Zhuang
, Dahai Liu
:
Underwater Image Enhancement via Advantage Feature Weighted Fusion. 2006-2018 - Xinrui Ju

, Yang Zou
, Xingyuan Li
, Zirui Wang
, Jun Ma
, Zhiying Jiang
, Jinyuan Liu
:
Illumination Refinement via Textual Cues: A Prompt-Driven Approach for Low-Light NeRF Enhancement. 2019-2032 - Shilong Wang

, Wenqi Ren
, Peng Gao
, Jiguo Yu
, Jianlei Liu
:
ZRID-Net: Zero-Reference Real-World Image Dehazing Framework via Deep Self-Decoupling and Reverse Knowledge Transfer. 2033-2051 - Liyan Wang

, Cong Wang
, Jinshan Pan
, Xiaofeng Liu
, Weixiang Zhou
, Xiaoran Sun, Wei Wang
, Zhixun Su
:
Ultra-High-Definition Image Restoration: New Benchmarks and a Dual Interaction Prior-Driven Solution. 2052-2068 - Yihan Yu

, Liquan Shen
, Junjie Zhu
, Zhengyong Wang
:
Ensemble Strategy for Underwater Image Quality Assessment and Dataset Construction. 2069-2082 - Chongye Guo

, Li Li
, Yanli Ren
, Xinpeng Zhang
, Guorui Feng
:
Aligning Normal Representations in Diffusion Model for Video Anomaly Detection. 2083-2094 - Ge Cao

, Qing Tang
, Xuan-Thuy Vo
, Adri Priadana
, Kang-Hyun Jo
:
Optimal Proxy Mining Contrastive Network for Unsupervised Person Re-Identification. 2095-2109 - Dan Song

, Juan Zhou
, Jianhao Zeng
, Hongshuo Tian
, Bolun Zheng
, Rongbao Kang
, An-An Liu
:
MEF-GD: Multimodal Enhancement and Fusion Network for Garment Designer. 2110-2122 - Zheng Liu

, Jinchao Zhu
, Nannan Li, Gao Huang
:
Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer. 2123-2136 - Qing Tian

, Bin Wang, Xiang Liu
, Jiashuo Shen, Keyang Cheng
, Weihua Ou, Zhen Lei
:
Part-Based Feature Complementary Denoising for Unsupervised Person Re-Identification. 2137-2150 - Jingjing Liu

, Zhiyong Wang
, Xinyu Fan
, Amirhossein Dadashzadeh
, Honghai Liu
, Majid Mirmehdi
:
Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms. 2151-2163 - Ziqi Peng

, Zhenyu Qi
, Yang Cao
, Yu Kang
, Wenjun Lv
:
Modeling Cross-Modal Semantic Transformations From Coarse to Fine in CLIP. 2164-2176 - Zhiwei Ning

, Zhaojiang Liu, Xuanang Gao
, Yifan Zuo
, Jie Yang
, Yuming Fang
, Wei Liu
:
CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection With IoU Joint Prediction. 2177-2190 - Lipeng Gu

, Xuefeng Yan, Weiming Wang
, Honghua Chen
, Dingkun Zhu
, Liangliang Nan
, Mingqiang Wei
:
CrossTracker: Robust Multi-Modal 3D Multi-Object Tracking via Cross Correction. 2191-2206 - Yuanhong Zhong

, Guangxia Yang
, Daidi Zhong
, Xun Yang
, Shanshan Wang
, Zhangling Duan
:
Local-Global Feature Fusion for Enhancing 3D Human Pose Estimation. 2207-2216 - Xiaowei Zhang

, Xinglong Li
, Mingliang Zhou
, Min Gan
, C. L. Philip Chen
:
ASCFormer: An Adaptive Structure-Aware Cascaded Transformer for 3D Object Detection. 2217-2231 - Qianyue Bao

, Fang Liu
, Licheng Jiao
, Yang Liu
, Shuo Li
, Lingling Li
, Xu Liu
, Puhua Chen
, Wenping Ma
:
ERFC: Energy-Aware Reinforcement Feedback Calibration for Zero-Shot Captioning. 2232-2246 - Yuqing Wen

, Yucheng Zhao, Yingfei Liu, Binyuan Huang, Fan Jia, Yanhui Wang, Chi Zhang, Tiancai Wang
, Xiaoyan Sun
, Xiangyu Zhang
:
Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving. 2247-2258 - Yu Gao

, Da-Wei Ding
:
FADiaFrame: Improving Fairness and Accuracy of Deep Learning-Based Diagnosis for Dermatological Lesions via a Novel Post-Processing Framework. 2259-2272 - Xinhua Jiang

, Tianpeng Liu
, Li Liu
, Zhenghui Gong, Yongxiang Liu
, Xiang Li
:
Policy Generalization Enhancement for UAV Active Object Detection via Divide-and-Conquer Sharpness-Aware Gradient Matching. 2273-2289 - Wei Liu

, Yufei Chen
, Xiaodong Yue
, Changqing Zhang
, Shaorong Xie
:
Enhancing Reliability in Medical Image Classification of Imperfect Views. 2290-2303 - Mengwei Li

, Zilei Wang
, Yixin Zhang
:
Improving Zero-Shot Generalization for CLIP With Prompt Ensemble Self-Distillation. 2304-2317 - Hao Li

, Kelin Dang, Maoguo Gong
, A. Kai Qin
, Yu Zhou
, Yue Wu
, Lining Xing
:
Sparse Unmixing Guided Adversarial Attack for Hyperspectral Image Classification. 2318-2331 - Guanlin Du, Hanzi Wang

, Xintao Xu, Yan Yan
, Xuelong Li
:
TCFF-Adapter: Text-Driven Adaption of CLIP for Few-Shot Image Classification. 2332-2343 - Yuxun Qu

, Yongqiang Tang
, Chenyang Zhang
, Wensheng Zhang
:
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery. 2344-2357 - Zhi Wang

, Zixuan Wang
, Chao Xu
, Shengze Cai
:
GOTrack+: A Deep Learning Framework With Graph Optimal Transport for Particle Tracking Velocimetry. 2358-2371 - Jiepan Li

, Wei He
, Fangxiao Lu, Hongyan Zhang
:
Toward Complex Backgrounds: A Unified Difference-Aware Decoder for Binary Segmentation. 2372-2386 - Haopeng Fang

, Fei Liu
, Wenfeng Han, He Tang
:
Toward Universal Instance Shadow Detection Based on Pairwise Grouping With Contrastive Morphological Alignment. 2387-2402 - You Wu

, Yongxin Li, Mengyuan Liu, Xucheng Wang, Xiangyang Yang
, Hengzhou Ye
, Dan Zeng, Qijun Zhao
, Shuiwang Li
:
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking. 2403-2418 - Zhenrong Zhang

, Jianan Liu
, Yuxuan Xia
, Tao Huang
, Qing-Long Han
, Hongbin Liu
:
LEGO: Learning and Graph-Optimized Modular Tracker for Online Multi-Object Tracking With Point Clouds. 2419-2432 - Yidong Song

, Shilei Wang
, Zhaochuan Zeng, Jikai Zheng
, Zhenhua Wang
, Jifeng Ning
:
Exploring Pruning-Based Efficient Object Tracking via Hybrid Knowledge Distillation. 2433-2448 - Jianbo Ma

, Hui Luo
, Shuaicheng Niu, Peilin Zhao
, Yunfeng Liu, Yuxing Wei
, Jianlin Zhang
:
Multi-Stage Cross-Modality Feature Interaction for RGB-Thermal Multi-Object Tracking. 2449-2463 - Wei Luo

, Peng Xing
, Yunkang Cao
, Haiming Yao
, Weiming Shen
, Zechao Li
:
URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection. 2464-2477 - Zimu Lu

, Ning Xu
, Hongshuo Tian
, Lanjun Wang
, An-An Liu
:
Medical VLP Model Is Vulnerable: Toward Multimodal Adversarial Attack on Large Medical Vision-Language Models. 2478-2491 - Xuan Xie, Xiang Yuan

, Gong Cheng
:
Weakly Supervised Object Detection for Aerial Images With Instance-Aware Label Assignment. 2492-2504 - Hao Wang

, Junyan Huo
, Fei Yang
, Shuai Wan
, Gaoxing Chen, Kun Yang, Luis Herranz
, Fuzheng Yang
:
Text and Non-Text Latent Feature Disentanglement for Screen Content Image Compression. 2505-2519 - Semih Esenlik

, Yaojun Wu
, Zhaobin Zhang
, Ye-Kui Wang
, Kai Zhang
, Li Zhang
, João Ascenso
, Shan Liu
:
An Overview of the JPEG AI Learning-Based Image Coding Standard. 2520-2537 - Shuai Huo

, Hewei Liu
, Jiawen Gu
, Dengchao Jin
, Meng Lei, Bo Huang, Chao Zhou:
Deep Network-Based Adaptive Quantization for Practical Video Coding. 2538-2550 - Longtao Feng

, Qian Yin
, Jiaqi Zhang
, Yuwen He, Siwei Ma
:
High Accuracy Rate Control for Neural Video Coding Based on Rate-Distortion Modeling. 2551-2567 - Zhongqing Yu, Xin Liu

, Yiu-Ming Cheung
, Lei Zhu
, Xing Xu
, Nannan Wang
:
FPAD: Fuzzy-Prototype-Guided Adversarial Attack and Defense for Deep Cross-Modal Hashing. 2568-2580 - Zhangxiang Shi

, Yunlai Ding, Junyu Dong
, Tianzhu Zhang
:
Beyond One and Two Tower: Cross-Modal Consensus Learning for Image-Text Retrieval. 2581-2593 - Jianxiang Dong

, Zhaozheng Yin
:
Annotation-Efficient Hybrid Learning for Temporal Sentence Grounding. 2594-2606 - Liuxiang Qiu

, Si Chen
, Jing-Hao Xue
, Da-Han Wang
, Shunzhi Zhu
, Yan Yan
:
HOH-Net: High-Order Hierarchical Middle-Feature Learning Network for Visible-Infrared Person Re-Identification. 2607-2622 - Karama Abdelhedi

, Faten Chaabane, Walid Wannes
, William Puech
, Chokri Ben Amar
:
Phylogeny-Based Traitor Tracing Method for Interleaving Attacks. 2623-2634 - Lang He

, Weizhao Yang
, Junnan Zhao
, Haifeng Chen
, Dongmei Jiang
:
FedDAAM: Federated Domain Adversarial Learning With Attention Mechanism for Privacy Preserving Multimodal Depression Assessment. 2635-2648 - Mingyue Niu

, Zhuhong Shao, Yongjun He, Jianhua Tao, Björn W. Schuller
:
Multimodal Local Global Interaction Networks for Automatic Depression Severity Estimation. 2649-2664

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














