GSTDTAP

浏览/检索结果: 共92条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Real-time data from mobile platforms to evaluate sustainable transportation infrastructure 期刊论文
NATURE SUSTAINABILITY, 2020, 3 (6) : 463-+
作者:  Asensio, Omar Isaac;  Alvarez, Kevin;  Dror, Arielle;  Wenzel, Emerson;  Hollauer, Catharina;  Ha, Sooji
收藏  |  浏览/下载:8/0  |  提交时间:2020/06/09
Juvenile cleaner fish can socially learn the consequences of cheating 期刊论文
NATURE COMMUNICATIONS, 2020, 11 (1)
作者:  Truskanov, Noa;  Emery, Yasmin;  Bshary, Redouan
收藏  |  浏览/下载:2/0  |  提交时间:2020/05/13
A distributional code for value in dopamine-based reinforcement learning 期刊论文
NATURE, 2020, 577 (7792) : 671-+
作者:  House, Robert A.;  Maitra, Urmimala;  Perez-Osorio, Miguel A.;  Lozano, Juan G.;  Jin, Liyu;  Somerville, James W.;  Duda, Laurent C.;  Nag, Abhishek;  Walters, Andrew;  Zhou, Ke-Jin;  Roberts, Matthew R.;  Bruce, Peter G.
收藏  |  浏览/下载:61/0  |  提交时间:2020/07/03

Since its introduction, the reward prediction error theory of dopamine has explained a wealth of empirical phenomena, providing a unifying framework for understanding the representation of reward and value in the brain(1-3). According to the now canonical theory, reward predictions are represented as a single scalar quantity, which supports learning about the expectation, or mean, of stochastic outcomes. Here we propose an account of dopamine-based reinforcement learning inspired by recent artificial intelligence research on distributional reinforcement learning(4-6). We hypothesized that the brain represents possible future rewards not as a single mean, but instead as a probability distribution, effectively representing multiple future outcomes simultaneously and in parallel. This idea implies a set of empirical predictions, which we tested using single-unit recordings from mouse ventral tegmental area. Our findings provide strong evidence for a neural realization of distributional reinforcement learning.


Analyses of single-cell recordings from mouse ventral tegmental area are consistent with a model of reinforcement learning in which the brain represents possible future rewards not as a single mean of stochastic outcomes, as in the canonical model, but instead as a probability distribution.


  
Deep learning takes on tumours 期刊论文
NATURE, 2020, 580 (7804) : 551-553
作者:  Dance, Amber
收藏  |  浏览/下载:0/0  |  提交时间:2020/07/03

Artificial-intelligence methods are moving into cancer research.


Artificial-intelligence methods are moving into cancer research.


  
Dopamine D2 receptors in discrimination learning and spine enlargement 期刊论文
NATURE, 2020, 579 (7800) : 555-+
作者:  Luo, Zhaochu;  Hrabec, Ales;  Dao, Trong Phuong;  Sala, Giacomo;  Finizio, Simone;  Feng, Junxiao;  Mayr, Sina;  Raabe, Joerg;  Gambardella, Pietro;  Heyderman, Laura J.
收藏  |  浏览/下载:24/0  |  提交时间:2020/07/03

Detection of dopamine dips by neurons that express dopamine D2 receptors in the striatum is used to refine generalized reward conditioning mediated by dopamine D1 receptors.


Dopamine D2 receptors (D2Rs) are densely expressed in the striatum and have been linked to neuropsychiatric disorders such as schizophrenia(1,2). High-affinity binding of dopamine suggests that D2Rs detect transient reductions in dopamine concentration (the dopamine dip) during punishment learning(3-5). However, the nature and cellular basis of D2R-dependent behaviour are unclear. Here we show that tone reward conditioning induces marked stimulus generalization in a manner that depends on dopamine D1 receptors (D1Rs) in the nucleus accumbens (NAc) of mice, and that discrimination learning refines the conditioning using a dopamine dip. In NAc slices, a narrow dopamine dip (as short as 0.4 s) was detected by D2Rs to disinhibit adenosine A(2A) receptor (A(2A)R)-mediated enlargement of dendritic spines in D2R-expressing spiny projection neurons (D2-SPNs). Plasticity-related signalling by Ca2+/calmodulin-dependent protein kinase II and A(2A)Rs in the NAc was required for discrimination learning. By contrast, extinction learning did not involve dopamine dips or D2-SPNs. Treatment with methamphetamine, which dysregulates dopamine signalling, impaired discrimination learning and spine enlargement, and these impairments were reversed by a D2R antagonist. Our data show that D2Rs refine the generalized reward learning mediated by D1Rs.


  
Unraveling the Historical Economies of Scale and Learning Effects for Desalination Technologies 期刊论文
WATER RESOURCES RESEARCH, 2020, 56 (2)
作者:  Mayor, B.
收藏  |  浏览/下载:6/0  |  提交时间:2020/07/02
Desalination  Cost  Economies of scale  Learning  
Learning to teach 期刊论文
SCIENCE, 2019, 366 (6472) : 1574-1574
作者:  Khan, Firdous A.
收藏  |  浏览/下载:0/0  |  提交时间:2020/02/17
Preventing undesirable behavior of intelligent machines 期刊论文
SCIENCE, 2019, 366 (6468) : 999-+
作者:  Thomas, Philip S.;  da Silva, Bruno Castro;  Barto, Andrew G.;  Giguere, Stephen;  Brun, Yuriy;  Brunskill, Emma
收藏  |  浏览/下载:10/0  |  提交时间:2020/02/17
Response to Comment on "Cultural flies: Conformist social learning in fruitflies predicts long-lasting mate-choice traditions" 期刊论文
SCIENCE, 2019, 366 (6462)
作者:  Pocheville, Arnaud;  Nobel, Sabine;  Isabel, Guillaume;  Danchin, Etienne
收藏  |  浏览/下载:5/0  |  提交时间:2019/11/27
Comment on "Cultural flies: Conformist social learning in fruitflies predicts long-lasting mate-choice traditions" 期刊论文
SCIENCE, 2019, 366 (6462)
作者:  Thornquist, Stephen C.;  Crickmore, Michael A.
收藏  |  浏览/下载:0/0  |  提交时间:2019/11/27