Tesauro, Gerald

50 publications

AAAI 2022 Context-Specific Representation Abstraction for Deep Option Learning Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How

NeurIPS 2022 Influencing Long-Term Behavior in Multiagent Reinforcement Learning Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P How

ICLRW 2022 Influencing Long-Term Behavior in Multiagent Reinforcement Learning Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob Nicolaus Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P How

NeurIPSW 2022 Learning in Factored Domains with Information-Constrained Visual Representations Tailia Malloy, Chris R Sims, Tim Klinger, Matthew Riemer, Miao Liu, Gerald Tesauro

ICML 2021 A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning Dong Ki Kim, Miao Liu, Matthew D Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan How

IJCAI 2021 Efficient Black-Box Planning Using Macro-Actions with Focused Effects Cameron Allen, Michael Katz, Tim Klinger, George Konidaris, Matthew Riemer, Gerald Tesauro

AAAI 2021 RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract) Tyler Malloy, Tim Klinger, Miao Liu, Gerald Tesauro, Matthew Riemer, Chris R. Sims

AAAI 2021 Text-Based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

NeurIPS 2020 Decentralized TD Tracking with Linear Function Approximation and Its Finite-Time Analysis Gang Wang, Songtao Lu, Georgios Giannakis, Gerald Tesauro, Jian Sun

AAAI 2020 On the Role of Weight Sharing During Deep Option Learning Matthew Riemer, Ignacio Cases, Clemens Rosenbaum, Miao Liu, Gerald Tesauro

AAAI 2019 Hybrid Reinforcement Learning with Expert State Sequences Xiaoxiao Guo, Shiyu Chang, Mo Yu, Gerald Tesauro, Murray Campbell

AAAI 2019 Learning to Teach in Cooperative Multiagent Reinforcement Learning Shayegan Omidshafiei, Dong-Ki Kim, Miao Liu, Gerald Tesauro, Matthew Riemer, Christopher Amato, Murray Campbell, Jonathan P. How

NeurIPS 2018 Dialog-Based Interactive Image Retrieval Xiaoxiao Guo, Hui Wu, Yu Cheng, Steven Rennie, Gerald Tesauro, Rogerio Feris

ICLR 2018 Eigenoption Discovery Through the Deep Successor Representation Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell

ICLR 2018 Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering Shuohang Wang, Mo Yu, Jing Jiang, Wei Zhang, Xiaoxiao Guo, Shiyu Chang, Zhiguo Wang, Tim Klinger, Gerald Tesauro, Murray Campbell

NeurIPS 2018 Learning Abstract Options Matthew Riemer, Miao Liu, Gerald Tesauro

AAAI 2017 Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation Iulian Vlad Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula, Bowen Zhou, Yoshua Bengio, Aaron C. Courville

AAAI 2017 Optimal Sequential Drilling for Hydrocarbon Field Development Planning Ruben Rodriguez Torrado, Jesus Rios, Gerald Tesauro

AAAI 2016 Selecting Near-Optimal Learners via Incremental Data Allocation Ashish Sabharwal, Horst Samulowitz, Gerald Tesauro

AAAI 2015 Budgeted Prediction with Expert Advice Kareem Amin, Satyen Kale, Gerald Tesauro, Deepak S. Turaga

AAAI 2015 Towards Cognitive Automation of Data Science Alain Biem, Maria Butrico, Mark Feblowitz, Tim Klinger, Yuri Malitsky, Kenney Ng, Adam Perer, Chandra Reddy, Anton Riabov, Horst Samulowitz, Daby M. Sow, Gerald Tesauro, Deepak S. Turaga

UAI 2010 Bayesian Inference in Monte-Carlo Tree Search Gerald Tesauro, V. T. Rajan, Richard B. Segal

ICML 2009 Monte-Carlo Simulation Balancing David Silver, Gerald Tesauro

NeurIPS 2007 Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey Kephart, David Levine, Freeman Rawson, Charles Lefurgy

AISTATS 2007 Metric Learning for Kernel Regression Kilian Q. Weinberger, Gerald Tesauro

ECML-PKDD 2006 Improvement of Systems Management Policies Using Hybrid Reinforcement Learning Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani

AAAI 2005 New Approaches to Optimization and Utility Elicitation in Autonomic Computing Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh

AAAI 2005 Online Resource Allocation Using Decompositional Reinforcement Learning Gerald Tesauro

UAI 2003 Cooperative Negotiation in Autonomic Systems Using Incremental Utility Elicitation Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh

NeurIPS 2003 Extending Q-Learning to General Adaptive Multi-Agent Systems Gerald Tesauro

IJCAI 2001 Agent-Human Interactions in the Continuous Double Auction Rajarshi Das, James E. Hanson, Jeffrey O. Kephart, Gerald Tesauro

ICML 2000 Multi-Agent Q-Learning and Regression Trees for Automated Pricing Decisions Manu Sridharan, Gerald Tesauro

ICML 2000 Pseudo-Convergent Q-Learning by Competitive Pricebots Jeffrey O. Kephart, Gerald Tesauro

MLJ 1998 Comments on "Co-Evolution in the Successful Learning of Backgammon Strategy" Gerald Tesauro

NeurIPS 1996 On-Line Policy Improvement Using Monte-Carlo Search Gerald Tesauro, Gregory R. Galperin

IJCAI 1995 Biologically Inspired Defenses Against Computer Viruses Jeffrey O. Kephart, Gregory B. Sorkin, William C. Arnold, David M. Chess, Gerald Tesauro, Steve R. White

NeCo 1994 TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play Gerald Tesauro

NeCo 1992 How Tight Are the Vapnik-Chervonenkis Bounds? David A. Cohn, Gerald Tesauro

MLJ 1992 Practical Issues in Temporal Difference Learning Gerald Tesauro

ICML 1992 Temporal Difference Learning of Backgammon Strategy Gerald Tesauro

NeurIPS 1991 Practical Issues in Temporal Difference Learning Gerald Tesauro

NeurIPS 1990 Can Neural Networks Do Better than the Vapnik-Chervonenkis Bounds? David Cohn, Gerald Tesauro

NeCo 1989 Asymptotic Convergence of Backpropagation Gerald Tesauro, Yu He, Subutai Ahmad

NeurIPS 1989 Asymptotic Convergence of Backpropagation: Numerical Experiments Subutai Ahmad, Gerald Tesauro, Yu He

NeurIPS 1989 Neural Network Visualization Jakub Wejchert, Gerald Tesauro

NeCo 1989 Neurogammon Wins Computer Olympiad Gerald Tesauro

ICML 1988 Connectionist Learning of Expert Backgammon Evaluations Gerald Tesauro

NeurIPS 1988 Connectionist Learning of Expert Preferences by Comparison Training Gerald Tesauro

NeurIPS 1988 Scaling and Generalization in Neural Networks: A Case Study Subutai Ahmad, Gerald Tesauro

NeurIPS 1987 A 'Neural' Network That Learns to Play Backgammon Gerald Tesauro, Terrence J. Sejnowski