User profiles for Difan Zou
Difan ZouThe University of Hong Kong Verified email at cs.hku.hk Cited by 3925 |
An improved analysis of training over-parameterized deep neural networks
A recent line of research has shown that gradient-based algorithms with random
initialization can converge to the global minima of the training loss for over-parameterized (ie, …
initialization can converge to the global minima of the training loss for over-parameterized (ie, …
An overview of rotating machine systems with high-temperature bulk superconductors
D Zhou, M Izumi, M Miki, B Felder, T Ida… - Superconductor …, 2012 - iopscience.iop.org
The paper contains a review of recent advancements in rotating machines with bulk high-temperature
superconductors (HTS). The high critical current density of bulk HTS enables us to …
superconductors (HTS). The high critical current density of bulk HTS enables us to …
[HTML][HTML] Gradient descent optimizes over-parameterized deep ReLU networks
We study the problem of training deep fully connected neural networks with Rectified Linear
Unit (ReLU) activation function and cross entropy loss function for binary classification using …
Unit (ReLU) activation function and cross entropy loss function for binary classification using …
Improving adversarial robustness requires revisiting misclassified examples
Deep neural networks (DNNs) are vulnerable to adversarial examples crafted by imperceptible
perturbations. A range of defense techniques have been proposed to improve DNN …
perturbations. A range of defense techniques have been proposed to improve DNN …
Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States
Short-term probabilistic forecasts of the trajectory of the COVID-19 pandemic in the United
States have served as a visible and important communication channel between the scientific …
States have served as a visible and important communication channel between the scientific …
Layer-dependent importance sampling for training deep and large graph convolutional networks
Graph convolutional networks (GCNs) have recently received wide attentions, due to their
successful applications in different graph tasks and different domains. Training GCNs for a …
successful applications in different graph tasks and different domains. Training GCNs for a …
Bulk superconductors: a roadmap to applications
JH Durrell, MD Ainslie, D Zhou… - Superconductor …, 2018 - iopscience.iop.org
Progress in superconducting bulk materials has been somewhat overshadowed by the
considerable effort required to produce practical long-length conductors. There has, however, …
considerable effort required to produce practical long-length conductors. There has, however, …
Global convergence of Langevin dynamics based algorithms for nonconvex optimization
We present a unified framework to analyze the global convergence of Langevin dynamics
based algorithms for nonconvex finite-sum optimization with $ n $ component functions. At the …
based algorithms for nonconvex finite-sum optimization with $ n $ component functions. At the …
Epidemic model guided machine learning for COVID-19 forecasts in the United States
We propose a new epidemic model (SuEIR) for forecasting the spread of COVID-19, including
numbers of confirmed and fatality cases at national and state levels in the United States. …
numbers of confirmed and fatality cases at national and state levels in the United States. …
The benefits of mixup for feature learning
Mixup, a simple data augmentation method that randomly mixes two data points via linear
interpolation, has been extensively applied in various deep learning applications to gain …
interpolation, has been extensively applied in various deep learning applications to gain …