User profiles for Mark Rowland
Mark RowlandResearch Scientist, Google DeepMind Verified email at google.com Cited by 3792 |
Distributional reinforcement learning with quantile regression
In reinforcement learning (RL), an agent interacts with the environment by taking actions and
observing the next state and reward. When sampled probabilistically, these state transitions…
observing the next state and reward. When sampled probabilistically, these state transitions…
[HTML][HTML] Reduced efficacy of insecticide-treated nets and indoor residual spraying for malaria control in pyrethroid resistance area, Benin
…, V Corbel, M Akogbéto, M Rowland - Emerging infectious …, 2007 - ncbi.nlm.nih.gov
The pyrethroid knockdown resistance gene (kdr) has become widespread in Anopheles
gambiae in West Africa. A trial to test the continuing efficacy of insecticide-treated nets (ITN) and …
gambiae in West Africa. A trial to test the continuing efficacy of insecticide-treated nets (ITN) and …
Mastering the game of stratego with model-free multiagent reinforcement learning
We introduce DeepNash, an autonomous agent that plays the imperfect information game
Stratego at a human expert level. Stratego is one of the few iconic board games that artificial …
Stratego at a human expert level. Stratego is one of the few iconic board games that artificial …
Revisiting fundamentals of experience replay
Experience replay is central to off-policy algorithms in deep reinforcement learning (RL), but
there remain significant gaps in our understanding. We therefore present a systematic and …
there remain significant gaps in our understanding. We therefore present a systematic and …
Gaussian process behaviour in wide deep neural networks
Whilst deep neural networks have shown great empirical success, there is still much work to
be done to understand their theoretical properties. In this paper, we study the relationship …
be done to understand their theoretical properties. In this paper, we study the relationship …
The importance of mosquito behavioural adaptations to malaria control in Africa
Over the past decade the use of long-lasting insecticidal nets (LLINs), in combination with
improved drug therapies, indoor residual spraying (IRS), and better health infrastructure, has …
improved drug therapies, indoor residual spraying (IRS), and better health infrastructure, has …
[HTML][HTML] Effectiveness of a long-lasting piperonyl butoxide-treated insecticidal net and indoor residual spray interventions, separately and together, against malaria …
Background Progress in malaria control is under threat by wide-scale insecticide resistance
in malaria vectors. Two recent vector control products have been developed: a long-lasting …
in malaria vectors. Two recent vector control products have been developed: a long-lasting …
A general theoretical paradigm to understand learning from human preferences
The prevalent deployment of learning from human preferences through reinforcement
learning (RLHF) relies on two important approximations: the first assumes that pairwise …
learning (RLHF) relies on two important approximations: the first assumes that pairwise …
The Innovative Vector Control Consortium: improved control of mosquito-borne diseases
J Hemingway, BJ Beaty, M Rowland, TW Scott… - Trends in …, 2006 - cell.com
Few new insecticides have been produced for control of disease vectors for public health in
developing countries over the past three decades, owing to market constraints, and the …
developing countries over the past three decades, owing to market constraints, and the …
[HTML][HTML] Spatial repellents: from discovery and development to evidence-based validation
International public health workers are challenged by a burden of arthropod-borne disease
that remains elevated despite best efforts in control programmes. With this challenge comes …
that remains elevated despite best efforts in control programmes. With this challenge comes …