Tesauro, Gerald

50 publications

AAAI 2022 Context-Specific Representation Abstraction for Deep Option Learning Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How
NeurIPS 2022 Influencing Long-Term Behavior in Multiagent Reinforcement Learning Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P How
ICLRW 2022 Influencing Long-Term Behavior in Multiagent Reinforcement Learning Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob Nicolaus Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P How
NeurIPSW 2022 Learning in Factored Domains with Information-Constrained Visual Representations Tailia Malloy, Chris R Sims, Tim Klinger, Matthew Riemer, Miao Liu, Gerald Tesauro
ICML 2021 A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning Dong Ki Kim, Miao Liu, Matthew D Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan How
IJCAI 2021 Efficient Black-Box Planning Using Macro-Actions with Focused Effects Cameron Allen, Michael Katz, Tim Klinger, George Konidaris, Matthew Riemer, Gerald Tesauro
AAAI 2021 RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract) Tyler Malloy, Tim Klinger, Miao Liu, Gerald Tesauro, Matthew Riemer, Chris R. Sims
AAAI 2021 Text-Based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell
NeurIPS 2020 Decentralized TD Tracking with Linear Function Approximation and Its Finite-Time Analysis Gang Wang, Songtao Lu, Georgios Giannakis, Gerald Tesauro, Jian Sun
AAAI 2020 On the Role of Weight Sharing During Deep Option Learning Matthew Riemer, Ignacio Cases, Clemens Rosenbaum, Miao Liu, Gerald Tesauro
AAAI 2019 Hybrid Reinforcement Learning with Expert State Sequences Xiaoxiao Guo, Shiyu Chang, Mo Yu, Gerald Tesauro, Murray Campbell
AAAI 2019 Learning to Teach in Cooperative Multiagent Reinforcement Learning Shayegan Omidshafiei, Dong-Ki Kim, Miao Liu, Gerald Tesauro, Matthew Riemer, Christopher Amato, Murray Campbell, Jonathan P. How
NeurIPS 2018 Dialog-Based Interactive Image Retrieval Xiaoxiao Guo, Hui Wu, Yu Cheng, Steven Rennie, Gerald Tesauro, Rogerio Feris
ICLR 2018 Eigenoption Discovery Through the Deep Successor Representation Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell
ICLR 2018 Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering Shuohang Wang, Mo Yu, Jing Jiang, Wei Zhang, Xiaoxiao Guo, Shiyu Chang, Zhiguo Wang, Tim Klinger, Gerald Tesauro, Murray Campbell
NeurIPS 2018 Learning Abstract Options Matthew Riemer, Miao Liu, Gerald Tesauro
AAAI 2017 Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation Iulian Vlad Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula, Bowen Zhou, Yoshua Bengio, Aaron C. Courville
AAAI 2017 Optimal Sequential Drilling for Hydrocarbon Field Development Planning Ruben Rodriguez Torrado, Jesus Rios, Gerald Tesauro
AAAI 2016 Selecting Near-Optimal Learners via Incremental Data Allocation Ashish Sabharwal, Horst Samulowitz, Gerald Tesauro
AAAI 2015 Budgeted Prediction with Expert Advice Kareem Amin, Satyen Kale, Gerald Tesauro, Deepak S. Turaga
AAAI 2015 Towards Cognitive Automation of Data Science Alain Biem, Maria Butrico, Mark Feblowitz, Tim Klinger, Yuri Malitsky, Kenney Ng, Adam Perer, Chandra Reddy, Anton Riabov, Horst Samulowitz, Daby M. Sow, Gerald Tesauro, Deepak S. Turaga
UAI 2010 Bayesian Inference in Monte-Carlo Tree Search Gerald Tesauro, V. T. Rajan, Richard B. Segal
ICML 2009 Monte-Carlo Simulation Balancing David Silver, Gerald Tesauro
NeurIPS 2007 Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey Kephart, David Levine, Freeman Rawson, Charles Lefurgy
AISTATS 2007 Metric Learning for Kernel Regression Kilian Q. Weinberger, Gerald Tesauro
ECML-PKDD 2006 Improvement of Systems Management Policies Using Hybrid Reinforcement Learning Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani
AAAI 2005 New Approaches to Optimization and Utility Elicitation in Autonomic Computing Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh
AAAI 2005 Online Resource Allocation Using Decompositional Reinforcement Learning Gerald Tesauro
UAI 2003 Cooperative Negotiation in Autonomic Systems Using Incremental Utility Elicitation Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh
NeurIPS 2003 Extending Q-Learning to General Adaptive Multi-Agent Systems Gerald Tesauro
IJCAI 2001 Agent-Human Interactions in the Continuous Double Auction Rajarshi Das, James E. Hanson, Jeffrey O. Kephart, Gerald Tesauro
ICML 2000 Multi-Agent Q-Learning and Regression Trees for Automated Pricing Decisions Manu Sridharan, Gerald Tesauro
ICML 2000 Pseudo-Convergent Q-Learning by Competitive Pricebots Jeffrey O. Kephart, Gerald Tesauro
MLJ 1998 Comments on "Co-Evolution in the Successful Learning of Backgammon Strategy" Gerald Tesauro
NeurIPS 1996 On-Line Policy Improvement Using Monte-Carlo Search Gerald Tesauro, Gregory R. Galperin
IJCAI 1995 Biologically Inspired Defenses Against Computer Viruses Jeffrey O. Kephart, Gregory B. Sorkin, William C. Arnold, David M. Chess, Gerald Tesauro, Steve R. White
NeCo 1994 TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play Gerald Tesauro
NeCo 1992 How Tight Are the Vapnik-Chervonenkis Bounds? David A. Cohn, Gerald Tesauro
MLJ 1992 Practical Issues in Temporal Difference Learning Gerald Tesauro
ICML 1992 Temporal Difference Learning of Backgammon Strategy Gerald Tesauro
NeurIPS 1991 Practical Issues in Temporal Difference Learning Gerald Tesauro
NeurIPS 1990 Can Neural Networks Do Better than the Vapnik-Chervonenkis Bounds? David Cohn, Gerald Tesauro
NeCo 1989 Asymptotic Convergence of Backpropagation Gerald Tesauro, Yu He, Subutai Ahmad
NeurIPS 1989 Asymptotic Convergence of Backpropagation: Numerical Experiments Subutai Ahmad, Gerald Tesauro, Yu He
NeurIPS 1989 Neural Network Visualization Jakub Wejchert, Gerald Tesauro
NeCo 1989 Neurogammon Wins Computer Olympiad Gerald Tesauro
ICML 1988 Connectionist Learning of Expert Backgammon Evaluations Gerald Tesauro
NeurIPS 1988 Connectionist Learning of Expert Preferences by Comparison Training Gerald Tesauro
NeurIPS 1988 Scaling and Generalization in Neural Networks: A Case Study Subutai Ahmad, Gerald Tesauro
NeurIPS 1987 A 'Neural' Network That Learns to Play Backgammon Gerald Tesauro, Terrence J. Sejnowski