


default search action
Zhilin Yang 0001
Person information
- affiliation: Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China
- affiliation: Shanghai Qi Zhi Institute, China
- affiliation (PhD 2020): Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA
Other persons with the same name
- Zhilin Yang (aka: Zhi-Lin Yang) — disambiguation page
- Zhilin Yang 0002
— City University of Hong Kong, Department of Marketing, Hong Kong - Zhilin Yang 0003
— Qingdao Technological University, Department of Mathematics, Qingdao, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
[j2]Xiao Liu
, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian
, Zhilin Yang, Jie Tang:
GPT understands, too. AI Open 5: 208-215 (2024)- 2023
[c35]Yanru Chen, Yanan Zheng, Zhilin Yang:
Prompt-Based Metric Learning for Few-Shot NER. ACL (Findings) 2023: 7199-7212
[c34]Haike Xu, Zongyu Lin, Jing Zhou, Yanan Zheng, Zhilin Yang:
A Universal Discriminator for Zero-Shot Generalization. ACL (1) 2023: 10559-10575
[c33]Nan Shao, Zefan Cai, Hanwei Xu, Chonghua Liao, Yanan Zheng, Zhilin Yang:
Compositional Task Representations for Large Language Models. ICLR 2023
[c32]Jing Zhou, Zongyu Lin, Yanan Zheng, Jian Li, Zhilin Yang:
Not All Tasks Are Born Equal: Understanding Zero-Shot Generalization. ICLR 2023
[c31]Qinkai Zheng
, Xiao Xia
, Xu Zou
, Yuxiao Dong
, Shan Wang
, Yufei Xue
, Lei Shen
, Zihan Wang
, Andi Wang
, Yang Li
, Teng Su
, Zhilin Yang
, Jie Tang
:
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X. KDD 2023: 5673-5684
[i32]Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang:
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X. CoRR abs/2303.17568 (2023)- 2022
[c30]Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Tam, Zhengxiao Du, Zhilin Yang, Jie Tang:
P-Tuning: Prompt Tuning Can Be Comparable to Fine-tuning Across Scales and Tasks. ACL (2) 2022: 61-68
[c29]Zhengxiao Du, Yujie Qian, Xiao Liu, Ming Ding, Jiezhong Qiu, Zhilin Yang, Jie Tang:
GLM: General Language Model Pretraining with Autoregressive Blank Infilling. ACL (1) 2022: 320-335
[c28]Yanan Zheng, Jing Zhou, Yujie Qian, Ming Ding, Chonghua Liao, Li Jian, Ruslan Salakhutdinov, Jie Tang, Sebastian Ruder, Zhilin Yang:
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding. ACL (1) 2022: 501-516
[c27]Jing Zhou, Yanan Zheng, Jie Tang, Li Jian, Zhilin Yang:
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning. ACL (1) 2022: 8646-8665
[c26]Zhihao Wang, Zongyu Lin, Junjie Wen, Xianxin Chen, Peiqi Liu, Guidong Zheng, Yujun Chen, Zhilin Yang:
Learning to Detect Noisy Labels Using Model-Based Features. EMNLP (Findings) 2022: 5796-5808
[c25]Xingcheng Yao, Yanan Zheng, Xiaocong Yang, Zhilin Yang:
NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework. ICML 2022: 25438-25451
[i31]Yanru Chen, Yanan Zheng, Zhilin Yang:
Prompt-Based Metric Learning for Few-Shot NER. CoRR abs/2211.04337 (2022)
[i30]Chonghua Liao, Yanan Zheng, Zhilin Yang:
Zero-Label Prompt Selection. CoRR abs/2211.04668 (2022)
[i29]Haike Xu, Zongyu Lin, Jing Zhou, Yanan Zheng, Zhilin Yang:
A Universal Discriminator for Zero-Shot Generalization. CoRR abs/2211.08099 (2022)
[i28]Zhihao Wang, Zongyu Lin, Peiqi Liu, Guidong Zheng, Junjie Wen, Xianxin Chen, Yujun Chen, Zhilin Yang:
Learning to Detect Noisy Labels Using Model-Based Features. CoRR abs/2212.13767 (2022)- 2021
[j1]Sha Yuan, Hanyu Zhao, Zhengxiao Du, Ming Ding, Xiao Liu, Yukuo Cen
, Xu Zou
, Zhilin Yang, Jie Tang:
WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models. AI Open 2: 65-68 (2021)
[c24]Xu Zou, Da Yin, Qingyang Zhong, Hongxia Yang, Zhilin Yang, Jie Tang:
Controllable Generation from Pre-trained Language Models via Inverse Prompting. KDD 2021: 2450-2460
[c23]Ming Ding, Yuxiao Dong, Xiao Liu, Jiezhong Qiu, Jie Tang, Zhilin Yang:
The International Workshop on Pretraining: Algorithms, Architectures, and Applications ([email protected] 2021). KDD 2021: 4119-4120
[i27]Zhengxiao Du, Yujie Qian, Xiao Liu, Ming Ding, Jiezhong Qiu, Zhilin Yang, Jie Tang:
All NLP Tasks Are Generation Tasks: A General Pretraining Framework. CoRR abs/2103.10360 (2021)
[i26]Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, Jie Tang:
GPT Understands, Too. CoRR abs/2103.10385 (2021)
[i25]Xu Zou, Da Yin, Qingyang Zhong, Hongxia Yang, Zhilin Yang, Jie Tang:
Controllable Generation from Pre-trained Language Models via Inverse Prompting. CoRR abs/2103.10685 (2021)
[i24]Jiaao He
, Jiezhong Qiu, Aohan Zeng, Zhilin Yang, Jidong Zhai, Jie Tang:
FastMoE: A Fast Mixture-of-Expert Training System. CoRR abs/2103.13262 (2021)
[i23]Jing Zhou, Yanan Zheng, Jie Tang, Jian Li, Zhilin Yang:
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning. CoRR abs/2108.06332 (2021)
[i22]Yanan Zheng, Jing Zhou, Yujie Qian, Ming Ding, Jian Li, Ruslan Salakhutdinov, Jie Tang, Sebastian Ruder, Zhilin Yang:
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding. CoRR abs/2109.12742 (2021)
[i21]Xiao Liu, Kaixuan Ji, Yicheng Fu, Zhengxiao Du, Zhilin Yang, Jie Tang:
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks. CoRR abs/2110.07602 (2021)
[i20]Xingcheng Yao, Yanan Zheng, Xiaocong Yang, Zhilin Yang:
NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework. CoRR abs/2111.04130 (2021)
2010 – 2019
- 2019
[c22]Zihang Dai, Zhilin Yang, Yiming Yang, Jaime G. Carbonell, Quoc Viet Le, Ruslan Salakhutdinov:
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context. ACL (1) 2019: 2978-2988
[c21]Zhilin Yang, Zihang Dai, Yiming Yang, Jaime G. Carbonell, Ruslan Salakhutdinov, Quoc V. Le:
XLNet: Generalized Autoregressive Pretraining for Language Understanding. NeurIPS 2019: 5754-5764
[c20]Zhilin Yang, Thang Luong, Ruslan Salakhutdinov, Quoc V. Le:
Mixtape: Breaking the Softmax Bottleneck Efficiently. NeurIPS 2019: 15922-15930
[i19]Zihang Dai, Zhilin Yang, Yiming Yang, Jaime G. Carbonell, Quoc V. Le, Ruslan Salakhutdinov:
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context. CoRR abs/1901.02860 (2019)
[i18]Zhilin Yang, Zihang Dai, Yiming Yang, Jaime G. Carbonell, Ruslan Salakhutdinov, Quoc V. Le:
XLNet: Generalized Autoregressive Pretraining for Language Understanding. CoRR abs/1906.08237 (2019)- 2018
[c19]Jiateng Xie, Zhilin Yang, Graham Neubig, Noah A. Smith, Jaime G. Carbonell:
Neural Cross-lingual Named Entity Recognition with Minimal Resources. EMNLP 2018: 369-379
[c18]Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William W. Cohen, Ruslan Salakhutdinov, Christopher D. Manning:
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering. EMNLP 2018: 2369-2380
[c17]Zhilin Yang, Zihang Dai, Ruslan Salakhutdinov, William W. Cohen:
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model. ICLR 2018
[c16]Zhilin Yang, Saizheng Zhang, Jack Urbanek, Will Feng, Alexander H. Miller, Arthur Szlam, Douwe Kiela, Jason Weston:
Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent. ICLR (Poster) 2018
[c15]Bhuwan Dhingra, Qiao Jin
, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov:
Neural Models for Reasoning over Multiple Mentions Using Coreference. NAACL-HLT (2) 2018: 42-48
[c14]Zhilin Yang, Junbo Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann LeCun:
GLoMo: Unsupervised Learning of Transferable Relational Graphs. NeurIPS 2018: 8964-8975
[i17]Bhuwan Dhingra, Qiao Jin, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov:
Neural Models for Reasoning over Multiple Mentions using Coreference. CoRR abs/1804.05922 (2018)
[i16]Zhilin Yang, Junbo Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann LeCun:
GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations. CoRR abs/1806.05662 (2018)
[i15]Jiateng Xie, Zhilin Yang, Graham Neubig, Noah A. Smith, Jaime G. Carbonell:
Neural Cross-Lingual Named Entity Recognition with Minimal Resources. CoRR abs/1808.09861 (2018)
[i14]Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William W. Cohen, Ruslan Salakhutdinov, Christopher D. Manning:
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering. CoRR abs/1809.09600 (2018)- 2017
[c13]Zhilin Yang, Junjie Hu, Ruslan Salakhutdinov, William W. Cohen:
Semi-Supervised QA with Generative Domain-Adaptive Nets. ACL (1) 2017: 1040-1050
[c12]Bhuwan Dhingra, Hanxiao Liu, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov:
Gated-Attention Readers for Text Comprehension. ACL (1) 2017: 1832-1846
[c11]Zhilin Yang, Bhuwan Dhingra, Ye Yuan, Junjie Hu, William W. Cohen, Ruslan Salakhutdinov:
Words or Characters? Fine-grained Gating for Reading Comprehension. ICLR (Poster) 2017
[c10]Zhilin Yang, Ruslan Salakhutdinov, William W. Cohen:
Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks. ICLR (Poster) 2017
[c9]Fan Yang, Zhilin Yang, William W. Cohen:
Differentiable Learning of Logical Rules for Knowledge Base Reasoning. NIPS 2017: 2319-2328
[c8]Zihang Dai, Zhilin Yang, Fan Yang, William W. Cohen, Ruslan Salakhutdinov:
Good Semi-supervised Learning That Requires a Bad GAN. NIPS 2017: 6510-6520
[i13]Zhilin Yang, Junjie Hu, Ruslan Salakhutdinov, William W. Cohen:
Semi-Supervised QA with Generative Domain-Adaptive Nets. CoRR abs/1702.02206 (2017)
[i12]Yujie Qian, Jie Tang, Zhilin Yang, Binxuan Huang, Wei Wei, Kathleen M. Carley:
A Probabilistic Framework for Location Inference from Social Media. CoRR abs/1702.07281 (2017)
[i11]Fan Yang, Zhilin Yang, William W. Cohen:
Differentiable Learning of Logical Rules for Knowledge Base Completion. CoRR abs/1702.08367 (2017)
[i10]Bhuwan Dhingra, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov:
Linguistic Knowledge as Memory for Recurrent Neural Networks. CoRR abs/1703.02620 (2017)
[i9]Zhilin Yang, Ruslan Salakhutdinov, William W. Cohen:
Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks. CoRR abs/1703.06345 (2017)
[i8]Zihang Dai, Zhilin Yang, Fan Yang, William W. Cohen, Ruslan Salakhutdinov:
Good Semi-supervised Learning that Requires a Bad GAN. CoRR abs/1705.09783 (2017)
[i7]Zhilin Yang, Zihang Dai, Ruslan Salakhutdinov, William W. Cohen:
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model. CoRR abs/1711.03953 (2017)
[i6]Zhilin Yang, Saizheng Zhang, Jack Urbanek, Will Feng, Alexander H. Miller, Arthur Szlam, Douwe Kiela, Jason Weston:
Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent. CoRR abs/1711.07950 (2017)- 2016
[c7]Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov:
Revisiting Semi-Supervised Learning with Graph Embeddings. ICML 2016: 40-48
[c6]Zhilin Yang, Jie Tang, William W. Cohen:
Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs. IJCAI 2016: 2287-2293
[c5]Zhilin Yang, Ye Yuan, Yuexin Wu, William W. Cohen, Ruslan Salakhutdinov:
Review Networks for Caption Generation. NIPS 2016: 2361-2369
[i5]Zhilin Yang, Ruslan Salakhutdinov, William W. Cohen:
Multi-Task Cross-Lingual Sequence Tagging from Scratch. CoRR abs/1603.06270 (2016)
[i4]Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov:
Revisiting Semi-Supervised Learning with Graph Embeddings. CoRR abs/1603.08861 (2016)
[i3]Zhilin Yang, Ye Yuan, Yuexin Wu, Ruslan Salakhutdinov, William W. Cohen:
Encode, Review, and Decode: Reviewer Module for Caption Generation. CoRR abs/1605.07912 (2016)
[i2]Zhilin Yang, Bhuwan Dhingra, Ye Yuan, Junjie Hu, William W. Cohen, Ruslan Salakhutdinov:
Words or Characters? Fine-grained Gating for Reading Comprehension. CoRR abs/1611.01724 (2016)- 2015
[c4]Yutao Zhang, Jie Tang, Zhilin Yang, Jian Pei
, Philip S. Yu:
COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency. KDD 2015: 1485-1494
[i1]Zhilin Yang, Jie Tang:
Multi-Source Bayesian Embeddings for Learning Social Knowledge Graphs. CoRR abs/1508.00715 (2015)- 2014
[c3]Zhilin Yang, Jie Tang, Yutao Zhang:
Active Learning for Streaming Networked Data. CIKM 2014: 1129-1138
[c2]Zhilin Yang, Jie Tang, Bin Xu, Chunxiao Xing
:
Active learning for networked data based on non-progressive diffusion model. WSDM 2014: 363-372- 2013
[c1]Yang Yang, Jianfei Wang, Yutao Zhang, Wei Chen, Jing Zhang, Honglei Zhuang, Zhilin Yang, Bo Ma, Zhanpeng Fang, Sen Wu, Xiaoxiao Li, Debing Liu, Jie Tang:
SAE: social analytic engine for large networks. KDD 2013: 1502-1505
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-10 22:57 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







