


default search action
Zhihang Yuan
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i55]Zhihang Yuan, Chengyu Yue, Long Huang, Litu Ou, Lei Shi:
Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning. CoRR abs/2601.13697 (2026)- 2025
[j8]An Guo, Xi Chen, Fangyuan Dong, Jinwu Chen, Zhihang Yuan, Xing Hu, Guangyu Sun, Xiaomin Li, Arindam Basu
, Jun Yang, Xin Si:
A 22-nm 64-kB lightning-like hybrid computing-in-memory macro with a compressed adder tree and analog-storage quantizers for transformer and CNNs. Sci. China Inf. Sci. 68(12) (2025)
[c34]Sifan Zhou, Shuo Wang, Zhihang Yuan, Mingjia Shi, Yuzhang Shang, Dawei Yang:
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning. ACL (Findings) 2025: 22971-22988
[c33]Kai Wang, Mingjia Shi, Yukun Zhou, Zekai Li, Zhihang Yuan, Yuzhang Shang, Xiaojiang Peng, Hanwang Zhang, Yang You:
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training. CVPR 2025: 12934-12944
[c32]Sifan Zhou
, Zhihang Yuan, Dawei Yang, Xing Hu, Jian Qian, Ziyu Zhao:
PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram. CVPR 2025: 27336-27345
[c31]Songsheng Wang, Rucheng Yu, Zhihang Yuan, Chao Yu, Feng Gao, Yu Wang, Derek F. Wong:
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance. EMNLP 2025: 26928-26940
[c30]Xing Hu, Yuan Cheng, Dawei Yang, Zhixuan Chen, Zukang Xu, Jiangyong Yu, Chen Xu, Zhihang Yuan, Zhe Jiang, Sifan Zhou:
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting. ICLR 2025
[c29]Zukang Xu, Yuxuan Yue, Xing Hu, Dawei Yang, Zhihang Yuan, Zixu Jiang, Zhixuan Chen, Jiangyong Yu, Chen Xu, Sifan Zhou:
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods. ICLR 2025
[c28]Zhixuan Chen, Xing Hu, Dawei Yang, Zukang Xu, Chen Xu, Zhihang Yuan, Sifan Zhou, Jiangyong Yu:
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance. ICML 2025
[c27]Haojie Duanmu, Xiuhong Li, Zhihang Yuan, Size Zheng, Jiangfei Duan, Xingcheng Zhang, Dahua Lin:
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design. ICML 2025
[c26]Chen Xu, Yuxuan Yue, Zukang Xu, Xing Hu, Jiangyong Yu, Zhixuan Chen, Sifan Zhou, Zhihang Yuan, Dawei Yang:
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization. ICML 2025
[c25]Zhaofeng Hu, Sifan Zhou
, Zhihang Yuan, Dawei Yang, Shibo Zhao, Ci-Jyun Liang:
MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues. ICRA 2025: 3745-3751
[c24]Songsheng Wang, Shijie Zhang, Zhihang Yuan, Dongkun Wang, Derek F. Wong:
Bidirectional Multitask Learning for Non-Autoregressive Machine Translation. IJCNN 2025: 1-8
[c23]Yuanpeng Zhang
, Xing Hu
, Xi Chen
, Zhihang Yuan
, Cong Li
, Jingchen Zhu
, Zhao Wang
, Chenguang Zhang
, Xin Si
, Wei Gao
, Qiang Wu
, Runsheng Wang
, Guangyu Sun
:
AIM: Software and Hardware Co-design for Architecture-level IR-drop Mitigation in High-performance PIM. ISCA 2025: 849-866
[c22]Jiangyong Yu
, Sifan Zhou
, Dawei Yang
, Shuoyu Li
, Shuo Wang
, Xing Hu
, Chen Xu
, Zukang Xu
, Changyong Shu
, Zhihang Yuan
:
MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Static Quantization. ACM Multimedia 2025: 1783-1792
[c21]Zhihang Yuan
, Siyuan Wang
, Yuzhang Shang
, Hanling Zhang
, Tongcheng Fang
, Rui Xie
, Shengen Yan
, Guohao Dai, Yu Wang
:
DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation. ACM Multimedia 2025: 10388-10397
[i54]Zukang Xu, Yuxuan Yue, Xing Hu, Zhihang Yuan, Zixu Jiang, Zhixuan Chen, Jiangyong Yu, Chen Xu, Sifan Zhou, Dawei Yang:
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods. CoRR abs/2501.13484 (2025)
[i53]Xing Hu, Yuan Cheng, Dawei Yang, Zukang Xu, Zhihang Yuan, Jiangyong Yu
, Chen Xu, Zhe Jiang, Sifan Zhou:
OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting. CoRR abs/2501.13987 (2025)
[i52]Jiangyong Yu
, Sifan Zhou, Dawei Yang, Shuo Wang, Shuoyu Li, Xing Hu, Chen Xu, Zukang Xu, Changyong Shu, Zhihang Yuan:
MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization. CoRR abs/2502.00425 (2025)
[i51]Zhihang Yuan, Siyuan Wang, Rui Xie, Hanling Zhang, Tongcheng Fang, Yuzhang Shang
, Shengen Yan, Guohao Dai, Yu Wang:
DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation. CoRR abs/2502.11897 (2025)
[i50]Sifan Zhou, Shuo Wang, Zhihang Yuan, Mingjia Shi, Yuzhang Shang
, Dawei Yang:
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning. CoRR abs/2502.12913 (2025)
[i49]Hanling Zhang, Rundong Su, Zhihang Yuan, Pengtao Chen, Mingzhu Shen, Yibo Fan, Shengen Yan, Guohao Dai, Yu Wang:
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers. CoRR abs/2503.22796 (2025)
[i48]Zhihang Yuan, Rui Xie, Yuzhang Shang
, Hanling Zhang, Siyuan Wang, Shengen Yan, Guohao Dai, Yu Wang:
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate. CoRR abs/2504.12259 (2025)
[i47]Chen Xu, Yuxuan Yue, Zukang Xu, Xing Hu, Jiangyong Yu, Zhixuan Chen, Sifan Zhou, Zhihang Yuan, Dawei Yang:
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization. CoRR abs/2505.03803 (2025)
[i46]Xing Hu, Zhixuan Chen, Dawei Yang, Zukang Xu, Chen Xu, Zhihang Yuan, Sifan Zhou, Jiangyong Yu:
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance. CoRR abs/2505.03804 (2025)
[i45]Haojie Duanmu, Xiuhong Li, Zhihang Yuan, Size Zheng, Jiangfei Duan, Xingcheng Zhang, Dahua Lin:
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design. CoRR abs/2505.05799 (2025)
[i44]Tianyu Fu, Yi Ge, Yichen You, Enshu Liu, Zhihang Yuan, Guohao Dai, Shengen Yan, Huazhong Yang, Yu Wang:
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing. CoRR abs/2505.21600 (2025)
[i43]Yuxuan Yue, Zukang Xu, Zhihang Yuan, Dawei Yang, Jianlong Wu, Liqiang Nie:
PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling. CoRR abs/2506.05432 (2025)
[i42]ZhengLin Lai, MengYao Liao, Dong Xu, Zebin Zhao, Zhihang Yuan, Chao Fan, Jianqiang Li, Bingzhe Wu:
SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification. CoRR abs/2506.17368 (2025)
[i41]Yi Guo, Wei Wang, Zhihang Yuan, Rong Cao, Kuan Chen, Zhengyang Chen, Yuanyuan Huo, Yang Zhang, Yuping Wang, Shouda Liu, Yuxuan Wang:
SplitMeanFlow: Interval Splitting Consistency in Few-Step Generative Modeling. CoRR abs/2507.16884 (2025)
[i40]Chen Zhu, Wangbo Zhao, Huiwen Zhang, Samir Khaki, Yuhao Zhou, Weidong Tang, Shuo Wang, Zhihang Yuan, Yuzhang Shang
, Xiaojiang Peng, Kai Wang, Dawei Yang:
EA-ViT: Efficient Adaptation for Elastic Vision Transformer. CoRR abs/2507.19360 (2025)
[i39]Songsheng Wang, Rucheng Yu, Zhihang Yuan, Chao Yu, Feng Gao, Yu Wang, Derek F. Wong:
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance. CoRR abs/2507.22424 (2025)
[i38]Hanling Zhang, Yayu Zhou, Tongcheng Fang, Zhihang Yuan, Guohao Dai, Yu Wang:
VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models. CoRR abs/2508.15229 (2025)
[i37]Ang Li, Zhihang Yuan, Yang Zhang, Shouda Liu, Yisen Wang:
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning. CoRR abs/2509.00125 (2025)
[i36]Ang Li, Yifei Wang, Zhihang Yuan, Stefanie Jegelka, Yisen Wang:
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs. CoRR abs/2510.16552 (2025)
[i35]Mengzhao Chen, Meng Wu
, Hui Jin, Zhihang Yuan, Jing Liu, Chaoyi Zhang, Yunshui Li, Jie Huang, Jin Ma, Zeyue Xue, Zhiheng Liu, Xingyan Bin, Ping Luo:
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats. CoRR abs/2510.25602 (2025)
[i34]Yuanpeng Zhang, Xing Hu, Xi Chen, Zhihang Yuan, Cong Li, Jingchen Zhu, Zhao Wang, Chenguang Zhang, Xin Si, Wei Gao, Qiang Wu, Runsheng Wang, Guangyu Sun:
AIM: Software and Hardware Co-design for Architecture-level IR-drop Mitigation in High-performance PIM. CoRR abs/2511.04321 (2025)
[i33]Shaoyuan Chen, Zhixuan Chen, Dawei Yang, Zhihang Yuan, Qiang Wu:
OTARo: Once Tuning for All Precisions toward Robust On-Device LLMs. CoRR abs/2511.13147 (2025)
[i32]Yushi Huang, Zining Wang, Zhihang Yuan, Yifu Ding, Ruihao Gong, Jinyang Guo, Xianglong Liu, Jun Zhang:
MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping. CoRR abs/2511.15690 (2025)- 2024
[j7]Zhenyang Hao, Xinggang Wang
, Jiawei Liu, Zhihang Yuan, Dawei Yang
, Wenyu Liu:
Stabilized activation scale estimation for precise Post-Training Quantization. Neurocomputing 569: 127120 (2024)
[j6]Dawei Yang
, Ning He, Xing Hu, Zhihang Yuan, Jiangyong Yu, Chen Xu, Zhe Jiang:
Post-training quantization for re-parameterization via coarse & fine weight splitting. J. Syst. Archit. 147: 103065 (2024)
[j5]Yizeng Han
, Zeyu Liu
, Zhihang Yuan
, Yifan Pu
, Chaofei Wang
, Shiji Song
, Gao Huang
:
Latency-Aware Unified Dynamic Networks for Efficient Image Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 7760-7774 (2024)
[c20]Chenguang Zhang
, Zhihang Yuan, Xingchen Li, Guangyu Sun:
Algorithm-Hardware Co-Design for Energy-Efficient A/D Conversion in ReRAM-Based Accelerators. DATE 2024: 1-6
[c19]Luning Wang, Shiyao Li, Xuefei Ning, Zhihang Yuan, Shengen Yan, Guohao Dai, Yu Wang:
CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios. ENLSP 2024: 468-484
[c18]Zhihang Yuan, Yuzhang Shang, Zhen Dong:
PB-LLM: Partially Binarized Large Language Models. ICLR 2024
[c17]Chuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, N. Siddharth, Shuo Chen, Mahesh K. Marina:
Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding. ICML 2024: 47218-47233
[c16]An Guo, Xi Chen, Fangyuan Dong, Jinwu Chen, Zhihang Yuan, Xing Hu, Yuanpeng Zhang, Jingmin Zhang, Yuchen Tang, Zhican Zhang
, Gang Chen, Dawei Yang, Zhaoyang Zhang, Lizheng Ren, Tianzhu Xiong, Bo Wang, Bo Liu, Weiwei Shan, Xinning Liu, Hao Cai, Guangyu Sun, Jun Yang, Xin Si:
34.3 A 22nm 64kb Lightning-Like Hybrid Computing-in-Memory Macro with a Compressed Adder Tree and Analog-Storage Quantizers for Transformer and CNNs. ISSCC 2024: 570-572
[c15]Zhihang Yuan, Hanling Zhang, Lu Pu, Xuefei Ning, Linfeng Zhang, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang:
DiTFastAttn: Attention Compression for Diffusion Transformer Models. NeurIPS 2024
[c14]Zhihang Yuan, Linshuai Zhang, Tao Jiang, Shuoxin Gu, Lin Xu
, Yujie Zhang, Peng Zeng, Marcin Grzegorzek:
A Force Control Method of Medical Robot Based on Impedance Control. ROBIO 2024: 1072-1076
[i31]Haoxuan Wang, Yuzhang Shang
, Zhihang Yuan, Junyi Wu, Yan Yan:
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning. CoRR abs/2402.03666 (2024)
[i30]Chenguang Zhang, Zhihang Yuan, Xingchen Li, Guangyu Sun:
Algorithm-hardware co-design for Energy-Efficient A/D conversion in ReRAM-based accelerators. CoRR abs/2402.06164 (2024)
[i29]Yuxuan Yue, Zhihang Yuan, Haojie Duanmu, Sifan Zhou, Jianlong Wu, Liqiang Nie:
WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More. CoRR abs/2402.12065 (2024)
[i28]Zhihang Yuan, Yuzhang Shang
, Yang Zhou, Zhen Dong, Zhe Zhou, Chenhao Xue, Bingzhe Wu
, Zhikai Li, Qingyi Gu, Yong Jae Lee, Yan Yan, Beidi Chen, Guangyu Sun, Kurt Keutzer:
LLM Inference Unveiled: Survey and Roofline Model Insights. CoRR abs/2402.16363 (2024)
[i27]Weisheng Xu, Sifan Zhou, Zhihang Yuan:
PillarTrack: Redesigning Pillar-based Transformer Network for Single Object Tracking on Point Clouds. CoRR abs/2404.07495 (2024)
[i26]Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou
, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-Ping Zhang, Yuhan Dong, Yu Wang:
A Survey on Efficient Inference for Large Language Models. CoRR abs/2404.14294 (2024)
[i25]Haojie Duanmu, Zhihang Yuan, Xiuhong Li, Jiangfei Duan, Xingcheng Zhang, Dahua Lin:
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models. CoRR abs/2405.06219 (2024)
[i24]Kai Wang, Yukun Zhou, Mingjia Shi, Zhihang Yuan, Yuzhang Shang
, Xiaojiang Peng, Hanwang Zhang, Yang You:
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training. CoRR abs/2405.17403 (2024)
[i23]Xing Hu, Yuan Cheng, Dawei Yang, Zhihang Yuan, Jiangyong Yu, Chen Xu, Sifan Zhou:
I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models. CoRR abs/2405.17849 (2024)
[i22]Zhihang Yuan, Pu Lu, Hanling Zhang, Xuefei Ning, Linfeng Zhang
, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang:
DiTFastAttn: Attention Compression for Diffusion Transformer Models. CoRR abs/2406.08552 (2024)
[i21]Chuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, N. Siddharth, Shuo Chen, Mahesh K. Marina:
Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding. CoRR abs/2407.09370 (2024)
[i20]Luning Wang, Shiyao Li, Xuefei Ning, Zhihang Yuan, Shengen Yan, Guohao Dai, Yu Wang:
CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios. CoRR abs/2409.10593 (2024)
[i19]Rui Xie, Tianchen Zhao, Zhihang Yuan, Rui Wan, Wenxi Gao, Zhenhua Zhu, Xuefei Ning, Yu Wang:
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization. CoRR abs/2411.17178 (2024)
[i18]Zhaofeng Hu, Sifan Zhou, Shibo Zhao, Zhihang Yuan:
MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues. CoRR abs/2412.02734 (2024)
[i17]Zhihang Yuan, Yuzhang Shang
, Hanling Zhang, Tongcheng Fang, Rui Xie, Bingxin Xu, Yan Yan, Shengen Yan, Guohao Dai, Yu Wang:
E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling. CoRR abs/2412.14170 (2024)- 2023
[j4]Xiaoyang Wang
, Zhe Zhou
, Zhihang Yuan
, Jingchen Zhu
, Yulong Cao
, Yao Zhang
, Kangrui Sun
, Guangyu Sun
:
FD-CNN: A Frequency-Domain FPGA Acceleration Scheme for CNN-Based Image-Processing Applications. ACM Trans. Embed. Comput. Syst. 22(6): 91:1-91:30 (2023)
[c13]Yuzhang Shang, Zhihang Yuan
, Bin Xie, Bingzhe Wu
, Yan Yan:
Post-Training Quantization on Diffusion Models. CVPR 2023: 1972-1981
[c12]Jiawei Liu, Lin Niu, Zhihang Yuan
, Dawei Yang, Xinggang Wang
, Wenyu Liu:
PD-Quant: Post-Training Quantization Based on Prediction Difference Metric. CVPR 2023: 24427-24437
[c11]Yuzhang Shang, Zhihang Yuan, Yan Yan:
MIM4DD: Mutual Information Maximization for Dataset Distillation. NeurIPS 2023
[i16]Zhihang Yuan, Jiawei Liu, Jiaxiang Wu, Dawei Yang, Qiang Wu, Guangyu Sun, Wenyu Liu, Xinggang Wang, Bingzhe Wu
:
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance. CoRR abs/2303.13003 (2023)
[i15]Zhihang Yuan
, Lin Niu, Jiawei Liu, Wenyu Liu, Xinggang Wang, Yuzhang Shang
, Guangyu Sun, Qiang Wu, Jiaxiang Wu, Bingzhe Wu:
RPTQ: Reorder-based Post-training Quantization for Large Language Models. CoRR abs/2304.01089 (2023)
[i14]Lin Niu, Jiawei Liu, Zhihang Yuan, Dawei Yang, Xinggang Wang, Wenyu Liu:
Improving Post-Training Quantization on Object Detection with Task Loss-Guided Lp Metric. CoRR abs/2304.09785 (2023)
[i13]Yizeng Han, Zeyu Liu, Zhihang Yuan, Yifan Pu, Chaofei Wang, Shiji Song, Gao Huang:
Latency-aware Unified Dynamic Networks for Efficient Image Recognition. CoRR abs/2308.15949 (2023)
[i12]Yuzhang Shang
, Zhihang Yuan, Qiang Wu, Zhen Dong:
PB-LLM: Partially Binarized Large Language Models. CoRR abs/2310.00034 (2023)
[i11]Zhihang Yuan, Yuzhang Shang
, Yue Song, Qiang Wu, Yan Yan, Guangyu Sun:
ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models. CoRR abs/2312.05821 (2023)
[i10]Dawei Yang, Ning He, Xing Hu, Zhihang Yuan, Jiangyong Yu, Chen Xu, Zhe Jiang:
Post-Training Quantization for Re-parameterization via Coarse & Fine Weight Splitting. CoRR abs/2312.10588 (2023)
[i9]Yuzhang Shang
, Zhihang Yuan, Yan Yan:
MIM4DD: Mutual Information Maximization for Dataset Distillation. CoRR abs/2312.16627 (2023)- 2022
[j3]Xingchen Li
, Zhihang Yuan, Yijin Guan, Guangyu Sun, Tao Zhang, Rongshan Wei
, Dimin Niu:
Flatfish: A Reinforcement Learning Approach for Application-Aware Address Mapping. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(11): 4758-4770 (2022)
[c10]Xingchen Li, Zhihang Yuan, Guangyu Sun, Liang Zhao, Zhichao Lu:
Tailor: removing redundant operations in memristive analog neural network accelerators. DAC 2022: 1009-1014
[c9]Zhihang Yuan
, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun:
PTQ4ViT: Post-training Quantization for Vision Transformers with Twin Uniform Quantization. ECCV (12) 2022: 191-207
[c8]Xingchen Li, Bingzhe Wu
, Guangyu Sun, Zhe Zhang, Zhihang Yuan
, Runsheng Wang, Ru Huang, Dimin Niu, Hongzhong Zheng, Zhichao Lu, Liang Zhao, Meng-Fan Marvin Chang, Tianchan Guan, Xin Si:
Enabling High-Quality Uncertainty Quantification in a PIM Designed for Bayesian Neural Network. HPCA 2022: 1043-1055
[c7]Yizeng Han, Zhihang Yuan, Yifan Pu, Chenhao Xue, Shiji Song, Guangyu Sun, Gao Huang:
Latency-aware Spatial-wise Dynamic Networks. NeurIPS 2022
[i8]Yizeng Han, Zhihang Yuan, Yifan Pu, Chenhao Xue, Shiji Song, Guangyu Sun, Gao Huang:
Latency-aware Spatial-wise Dynamic Networks. CoRR abs/2210.06223 (2022)
[i7]Yuzhang Shang
, Zhihang Yuan, Bin Xie, Bingzhe Wu
, Yan Yan:
Post-training Quantization on Diffusion Models. CoRR abs/2211.15736 (2022)
[i6]Jiawei Liu, Lin Niu, Zhihang Yuan, Dawei Yang, Xinggang Wang, Wenyu Liu:
PD-Quant: Post-Training Quantization based on Prediction Difference Metric. CoRR abs/2212.07048 (2022)- 2021
[j2]Zhihang Yuan
, Jingze Liu, Xingchen Li, Longhao Yan, Haoxiang Chen, Bingzhe Wu
, Yuchao Yang, Guangyu Sun:
NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators. Sci. China Inf. Sci. 64(6) (2021)
[c6]Spencer Nelson, Sang Yun Kim, Jia Di, Zhe Zhou
, Zhihang Yuan, Guangyu Sun:
Reconfigurable ASIC Implementation of Asynchronous Recurrent Neural Networks. ASYNC 2021: 48-54
[c5]Spencer Nelson, Wassim Khalil, SangYun Kim, Jia Di, Zhe Zhou
, Zhihang Yuan, Guang-Yu Sun:
Rapid Configuration of Asynchronous Recurrent Neural Networks for ASIC Implementations. HPEC 2021: 1-6
[i5]Zhao Wang, Guangyu Sun, Jingchen Zhu, Zhe Zhou, Yijiang Guo, Zhihang Yuan:
METRO: A Software-Hardware Co-Design of Interconnections for Spatial DNN Accelerators. CoRR abs/2108.10570 (2021)
[i4]Zhihang Yuan, Yiqi Chen, Chenhao Xue, Chenguang Zhang, Qiankun Wang, Guangyu Sun:
PTQ-SL: Exploring the Sub-layerwise Post-training Quantization. CoRR abs/2110.07809 (2021)
[i3]Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun:
PTQ4ViT: Post-Training Quantization Framework for Vision Transformers. CoRR abs/2111.12293 (2021)- 2020
[j1]Yijin Guan
, Guangyu Sun, Zhihang Yuan
, Xingchen Li
, Ningyi Xu, Shu Chen, Jason Cong
, Yuan Xie:
Crane: Mitigating Accelerator Under-utilization Caused by Sparsity Irregularities in CNNs. IEEE Trans. Computers 69(7): 931-943 (2020)
[c4]Zhihang Yuan
, Bingzhe Wu
, Guangyu Sun, Zheng Liang
, Shiwan Zhao
, Weichen Bi:
S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search. ECCV (2) 2020: 175-192
[i2]Zhihang Yuan, Xin Liu
, Bingzhe Wu, Guangyu Sun:
ENAS4D: Efficient Multi-stage CNN Architecture Search for Dynamic Inference. CoRR abs/2009.09182 (2020)
2010 – 2019
- 2019
[i1]Zhihang Yuan, Bingzhe Wu, Zheng Liang, Shiwan Zhao, Weichen Bi, Guangyu Sun:
S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search. CoRR abs/1911.07033 (2019)- 2017
[c3]Yijin Guan, Ningyi Xu, Chen Zhang
, Zhihang Yuan
, Jason Cong:
Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators. APPT 2017: 14-26
[c2]Yijin Guan, Zhihang Yuan, Guangyu Sun, Jason Cong:
FPGA-based accelerator for long short-term memory recurrent neural networks. ASP-DAC 2017: 629-634
[c1]Bingzhe Wu
, Zhichao Liu, Zhihang Yuan
, Guangyu Sun, Charles Wu:
Reducing Overfitting in Deep Convolutional Neural Networks Using Redundancy Regularizer. ICANN (2) 2017: 49-55
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-26 23:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







