Google Scholar

User profiles for Tianyi Zhou

Tianyi Zhou

- Verified email at umiacs.umd.edu - Cited by 5801

Tianyi Zhou

- Verified email at usc.edu - Cited by 314

[PDF] unimelb.edu.au

[PDF][PDF] Xgboost: extreme gradient boosting

…, K Chen, R Mitchell, I Cano, T Zhou - … version 0.4-2, 2015 - cran.ms.unimelb.edu.au

This is an introductory document of using the xgboost package in R. xgboost is short for
eXtreme Gradient Boosting package. It is an efficient and scalable implementation of gradient …

Save Cite Cited by 3601 Related articles All 18 versions View as HTML

[PDF] uts.edu.au

Godec: Randomized low-rank & sparse matrix decomposition in noisy case

T Zhou, D Tao - … of the 28th International Conference on …, 2011 - opus.lib.uts.edu.au

Low-rank and sparse structures have been profoundly studied in matrix completion and
compressed sensing. In this paper, we develop "Go Decomposition" (GoDec) to efficiently and …

Save Cite Cited by 825 Related articles All 13 versions View as HTML

[PDF] aaai.org

Disan: Directional self-attention network for rnn/cnn-free language understanding

T Shen, T Zhou, G Long, J Jiang, S Pan… - Proceedings of the AAAI …, 2018 - ojs.aaai.org

Recurrent neural nets (RNN) and convolutional neural nets (CNN) are widely used on NLP
tasks to capture the long-term and local dependencies, respectively. Attention mechanisms …

Save Cite Cited by 844 Related articles All 12 versions View as HTML

[PDF] aaai.org

Fedproto: Federated prototype learning across heterogeneous clients

Y Tan, G Long, L Liu, T Zhou, Q Lu, J Jiang… - Proceedings of the …, 2022 - ojs.aaai.org

Heterogeneity across clients in federated learning (FL) usually hinders the optimization
convergence and generalization performance when the aggregation of clients' knowledge …

Save Cite Cited by 283 Related articles All 8 versions View as HTML

[PDF] mlr.press

Deja vu: Contextual sparsity for efficient llms at inference time

Z Liu, J Wang, T Dao, T Zhou, B Yuan… - International …, 2023 - proceedings.mlr.press

Large language models (LLMs) with hundreds of billions of parameters have sparked a new
wave of exciting AI applications. However, they are computationally expensive at inference …

Save Cite Cited by 80 Related articles All 8 versions View as HTML

[PDF] neurips.cc

Federated learning from pre-trained models: A contrastive learning approach

Y Tan, G Long, J Ma, L Liu, T Zhou… - Advances in neural …, 2022 - proceedings.neurips.cc

Federated Learning (FL) is a machine learning paradigm that allows decentralized clients to
learn collaboratively without sharing their private data. However, excessive computation …

Save Cite Cited by 98 Related articles All 7 versions View as HTML

[PDF] neurips.cc

H2o: Heavy-hitter oracle for efficient generative inference of large language models

Z Zhang, Y Sheng, T Zhou, T Chen… - Advances in …, 2024 - proceedings.neurips.cc

… [100] Jan van den Brand, Zhao Song, and Tianyi Zhou. Algorithm and hardness for dynamic
attention maintenance in large language models. arXiv preprint arXiv:2304.02207, 2023. …

Save Cite Cited by 56 Related articles All 5 versions View as HTML

[PDF] arxiv.org

Structure-augmented text representation learning for efficient knowledge graph completion

B Wang, T Shen, G Long, T Zhou, Y Wang… - Proceedings of the Web …, 2021 - dl.acm.org

Human-curated knowledge graphs provide critical supportive information to various natural
language processing tasks, but these graphs are usually incomplete, urging auto-completion …

Save Cite Cited by 183 Related articles All 9 versions

[PDF] arxiv.org

Manifold elastic net: a unified framework for sparse dimension reduction

T Zhou, D Tao, X Wu - Data Mining and Knowledge Discovery, 2011 - Springer

It is difficult to find the optimal sparse solution of a manifold learning based dimensionality
reduction algorithm. The lasso or the elastic net penalized manifold learning based …

Save Cite Cited by 202 Related articles All 15 versions

[PDF] arxiv.org

Bi-directional block self-attention for fast and memory-efficient sequence modeling

T Shen, T Zhou, G Long, J Jiang, C Zhang - arXiv preprint arXiv …, 2018 - arxiv.org

Recurrent neural networks (RNN), convolutional neural networks (CNN) and self-attention
networks (SAN) are commonly used to produce context-aware representations. RNN can …

Save Cite Cited by 172 Related articles All 7 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Tianyi Zhou

Tianyi Zhou

Tianyi Zhou

[PDF][PDF] Xgboost: extreme gradient boosting

Godec: Randomized low-rank & sparse matrix decomposition in noisy case

Disan: Directional self-attention network for rnn/cnn-free language understanding

Fedproto: Federated prototype learning across heterogeneous clients

Deja vu: Contextual sparsity for efficient llms at inference time

Federated learning from pre-trained models: A contrastive learning approach

H2o: Heavy-hitter oracle for efficient generative inference of large language models

Structure-augmented text representation learning for efficient knowledge graph completion

Manifold elastic net: a unified framework for sparse dimension reduction

Bi-directional block self-attention for fast and memory-efficient sequence modeling

Related searches