ICMLW 2019

92 papers

A Functional Extension of Multi-Output Learning Alex Lambert, Romain Brault, Zoltan Szabo, Florence d'Alche-Buc
PDF OpenReview
A Meta Understanding of Meta-Learning Wei-Lun Chao, Han-Jia Ye, De-Chuan Zhan, Mark Campbell, Kilian Q. Weinberger
PDF OpenReview
A Modern Take on the Bias-Variance Tradeoff in Neural Networks Brady Neal, Sarthak Mittal, Aristide Baratin, Vinayak Tantia, Matthew Scicluna, Simon Lacoste-Julien, Ioannis Mitliagkas
PDF OpenReview
A Reinforcement Learning Approach for Joint Replenishment Policy in Multi-Product Inventory System Hiroshi Suetsugu, Yoshiaki Narusue, Hiroyuki Morikawa
PDF OpenReview
A Systematic Framework for Natural Perturbations from Videos Vaishaal Shankar, Achal Dave, Rebecca Roelofs, Deva Ramanan, Benjamin Recht, Ludwig Schmidt
PDF OpenReview
Active Multitask Learning with Committees Jingxi Xu, Da Tang, Tony Jebara
PDF OpenReview
Addressing Sample Complexity in Visual Tasks Using Hindsight Experience Replay and Hallucinatory GANs Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin
PDF OpenReview
Adversarial Training Can Hurt Generalization Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, Percy Liang
PDF OpenReview
Angular Visual Hardness Beidi Chen, Weiyang Liu, Animesh Garg, Zhiding Yu, Anshumali Shrivastava, Anima Anandkumar
PDF OpenReview
Are All Layers Created Equal? Chiyuan Zhang, Samy Bengio, Yoram Singer
PDF OpenReview
Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach Marc Brittain, Peng Wei
PDF OpenReview
Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking Syed Arbab Mohd Shihab, Caleb Logemann, Deepak-George Thomas, Peng Wei
PDF OpenReview
Bad Global Minima Exist and SGD Can Reach Them Shengchao Liu, Dimitris Papailiopoulos, Dimitris Achlioptas
PDF OpenReview
Batch Normalization Is a Cause of Adversarial Vulnerability Angus Galloway, Anna Golubeva, Thomas Tanay, Medhat Moussa, Graham W. Taylor
PDF OpenReview
Challenges of Real-World Reinforcement Learning Gabriel Dulac-Arnold, Daniel Mankowitz, Todd Hester
PDF OpenReview
Channel Normalization in Convolutional Neural Network Avoids Vanishing Gradients Zhenwei Dai and Reinhard Heckel
PDF OpenReview
Connections Between Optimization in Machine Learning and Adaptive Control Joseph E. Gaudio, Travis E. Gibson, Anuradha M. Annaswamy, Michael A. Bolender, Eugene Lavretsky
PDF OpenReview
Contextual Markov Decision Processes Using Generalized Linear Models Aditya Modi, Ambuj Tewari
PDF OpenReview
Continual Adaptation for Efficient Machine Communication Robert Hawkins, Minae Kwon, Dorsa Sadigh, Noah Goodman
PDF OpenReview
Crowdsourcing Reinforcement Learning to Optimize Knee Replacement Pathway Hao Lu, Mengdi Wang
PDF OpenReview
Curious iLQR: Resolving Uncertainty in Model-Based RL Sarah Bechtle, Akshara Rai, Yixin Lin, Ludovic Righetti, Franziska Meier
PDF OpenReview
Data Enrichment: Multi-Task Learning in High Dimension with Theoretical Guarantees Amir Asiaee, Samet Oymak, Kevin R. Coombes, Arindam Banerjee
PDF OpenReview
Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask Hattie Zhou, Janice Lan, Rosanne Liu, Jason Yosinski
PDF OpenReview
Deep Knowledge Based Agent: Learning to Do Tasks by Self-Thinking About Imaginary Worlds Ali Davody
PDF OpenReview
Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites Juan Jose Garau Luis, Markus Guerster, Inigo del Portillo, Edward Crawley, Bruce Cameron
PDF OpenReview
Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Reward Signals Gerrit Schoettler, Ashvin Nair, Jianlan Luo, Shikhar Bahl, Juan Aparicio Ojea, Eugen Solowjow, Sergey Levine
PDF OpenReview
Differentiable Hebbian Plasticity for Continual Learning Vithursan Thangarasa, Thomas Miconi, Graham W. Taylor
PDF OpenReview
Distribution-Dependent and Time-Uniform Bounds for Piecewise I.i.d Bandits Subhojyoti Mukherjee, Odalric Maillard
PDF OpenReview
Distributionally Robust Reinforcement Learning Elena Smirnova, Elvis Dohmatob, Jérémie Mary
PDF OpenReview
Do Deep Neural Networks Learn Shallow Learnable Examples First? Karttikeya Mangalam, Vinay Uday Prabhu
PDF OpenReview
DualDICE: Efficient Estimation of Off-Policy Stationary Distribution Corrections Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li
PDF OpenReview
Emergence of Implicit Filter Sparsity in Convolutional Neural Networks Dushyant Mehta, Kwang In Kim, Christian Theobalt
PDF OpenReview
Every Sample a Task: Pushing the Limits of Heterogeneous Models with Personalized Regression Ben Lengerich, Bryon Aragam, Eric Xing
PDF OpenReview
Fast Efficient Hyperparameter Tuning for Policy Gradients Supratik Paul, Vitaly Kurin, Shimon Whiteson
PDF OpenReview
Federated Optimization for Heterogeneous Networks Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, Virginia Smith
PDF OpenReview
Goal-Conditioned Imitation Learning Yiming Ding, Carlos Florensa, Mariano Phielipp, Pieter Abbeel
PDF OpenReview
Horizon: Facebook’s Open Source Applied Reinforcement Learning Platform Jason Gauci, Edoardo Conti, Yitao Liang, Kittipat Virochsiri, Yuchen He, Zachary Kaden, Vivek Narayanan, Xiaohui Ye, Zhengxing Chen
PDF OpenReview
Identity Crisis: Memorization and Generalization Under Extreme Overparameterization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Michael C. Mozer, Yoram Singer
PDF OpenReview
Improving Relevance Prediction with Transfer Learning in Large-Scale Retrieval Systems Ruoxi Wang, Zhe Zhao, Xinyang Yi, Ji Yang, Derek Zhiyuan Cheng, Lichan Hong, Steve Tjoa, Jieqi Kang, Evan Ettinger, Ed Chi
PDF OpenReview
Improving the Generalization of Visual Navigation Policies Using Invariance Regularization Michel Aractingi, Christopher Dance, Julien Perez, Tomi Silander
PDF OpenReview
In Support of Over-Parametrization in Deep Reinforcement Learning: An Empirical Study Brady Neal, Ioannis Mitliagkas
PDF OpenReview
Intelligent Pooling in Thompson Sampling for Rapid Personalization in Mobile Health Sabina Tomkins, Peng Liao, Serena Yeung, Predrag Klasnja, Susan Murphy
PDF OpenReview
Interpretable Robust Recommender Systems with Side Information Wenyu Chen, Zhechao Huang, Jason Cheuk Nam Liang, Zihao Xu
PDF OpenReview
Invariance-Inducing Regularization Using Worst-Case Transformations Suffices to Boost Accuracy and Spatial Robustness Fanny Yang, Zuowen Wang, Christina Heinze-Deml
PDF OpenReview
Layer Rotation: A Surprisingly Simple Indicator of Generalization in Deep Networks? Simon Carbonnelle, Christophe De Vleeschouwer
PDF OpenReview
Learning Cancer Outcomes from Heterogeneous Genomic Data Sources: An Adversarial Multi-Task Learning Approach Safoora Yousefi, Amirreza Shaban, Mohamed Amgad, Lee Cooper
PDF OpenReview
Learning Exploration Policies for Model-Agnostic Meta-Reinforcement Learning Swaminathan Gurumurthy, Sumit Kumar, Katia Sycara
PDF OpenReview
Learning to Learn to Communicate Ryan Lowe, Abhinav Gupta, Jakob Foerster, Douwe Kiela, Joelle Pineau
PDF OpenReview
Lessons from Contextual Bandit Learning in a Customer Support Bot Nikos Karampatziakis, Sebastian Kochman, Jade Huang, Paul Mineiro, Kathy Osborne, Weizhu Chen
PDF OpenReview
Lifelong Learning via Online Leverage Score Sampling Dan Teng, Sakyasingha Dasgupta
PDF OpenReview
Line Attractor Dynamics in Recurrent Networks for Sentiment Classification Niru Maheswaranathan, Alex H. Williams, Matthew D. Golub, Surya Ganguli, David Sussillo
PDF OpenReview
Lyapunov-Based Safe Policy Optimization for Continuous Control Yinlam Chow, Ofir Nachum, Aleksandra Faust, Edgar Duenez-Guzman, Mohammad Ghavamzadeh
PDF OpenReview
Memorization in Overparameterized Autoencoders Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler
PDF OpenReview
Meta-Reinforcement Learning for Adaptive Autonomous Driving Yesmina Jaafra, Jean Luc Laurent, Aline Deruyver, Mohamed Saber Naceur
PDF OpenReview
Multi-Task Learning via Task Multi-Clustering Andy Yan, Xin Wang, Ion Stoica, Joseph Gonzalez, Roy Fox
PDF OpenReview
Multinomial Logit Contextual Bandits Min-hwan Oh, Garud Iyengar
PDF OpenReview
Off-Policy Evaluation of Generalization for Deep Q-Learning in BinaryReward Tasks Alex Irpan, Kanishka Rao, Konstantinos Bousmalis, Chris Harris, Julian Ibarz, Sergey Levine
PDF OpenReview
Off-Policy Policy Gradient with State Distribution Correction Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill
PDF OpenReview
On the Convex Behavior of Deep Neural Networks in Relation to the Layers' Width Etai Littwin, Lior Wolf
PDF OpenReview
Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit Problem Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistuba
PDF OpenReview
Optimizing 3D Structure of H2O Molecule Using DDPG Soo Kyung Kim, Peggy Li, Joanne Taery Kim, Piyush Karande, Yong Han
PDF OpenReview
ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems Bharathan Balaji, Jordan Bell-Masterson, Enes Bilgin, Andreas Damianou, Pablo Moreno Garcia, Arpit Jain, Anna Luo, Alvaro Maggiar, Balakrishnan Narayanaswamy, Chun Ye
PDF OpenReview
P3O: Policy-on Policy-Off Policy Optimization Rasool Fakoor, Pratik Chaudhari, Alexander J. Smola
PDF OpenReview
PAGANDA: An Adaptive Task-Independent Automatic Data Augmentation Boli Fang, Miao Jiang, Jerry Shen
PDF OpenReview
Park: An Open Platform for Learning Augmented Computer Systems Hongzi Mao, Parimarjan Negi, Akshay Narayan, Hanrui Wang, Jiacheng Yang, Haonan Wang, Ryan Marcus, Ravichandra Addanki, Mehrdad Khani, Songtao He, Vikram Nathan, Frank Cangialosi, Shaileshh Bojja Venkatakrishnan, Wei-Hung Weng, Song Han, Tim Kraska, Mohammad Alizadeh
PDF OpenReview
Personalized Student Stress Prediction with Deep Multi-Task Network Abhinav Shaw, Natcha Simsiri, Iman Dezbani, Madelina Fiterau, Tauhidur Rahaman
PDF OpenReview
Predicting the Accuracy of Neural Networks from Final and Intermediate Layer Outputs Chad DeChant, Seungwook Han, Hod Lipson
PDF OpenReview
Progressive Memory Banks for Incremental Domain Adaptation Nabiha Asghar, Lili Mou, Kira A. Selby, Kevin D. Pantasdo, Pascal Poupart, Xin Jiang
PDF OpenReview
Prototypical Bregman Networks Kubra Cilingir, Brian Kulis
PDF OpenReview
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies Riley Simmons-Edler, Ben Eisner, Eric Mitchell, Sebastian Seung, Daniel Lee
PDF OpenReview
R-MADDPG for Partially Observable Environments and Limited Communication Rose E. Wang, Michael Everett, Jonathan P. How
PDF OpenReview
Real-World Autonomous Vehicle Control Trained Entirely Within Data-Driven Simulation Alexander Amini, Igor Gilitschenski, Jacob Phillips, Julia Moseyko, Sertac Karaman, Daniela Rus
PDF OpenReview
Real-World Video Adaptation with Reinforcement Learning Hongzi Mao, Shannon Chen, Drew Dimmery, Shaun Singh, Drew Blaisdell, Yuandong Tian, Mohammad Alizadeh, Eytan Bakshy
PDF OpenReview
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation Byung Hoon Ahn, Prannoy Pilligundla, Hadi Esmaeilzadeh
PDF OpenReview
Reinforcement Learning for Blood Glucose Control: Challenges and Opportunities Ian Fox, Jenna Wiens
PDF OpenReview
Reinforcement Learning for Sepsis Treatment: Baselines and Analysis Aniruddh Raghu
PDF OpenReview
Reinforcement Learning in the Maintenance of Civil Infrastructures Shiyin Wei, Xiaowei Jin, Hui Li
PDF OpenReview
RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning Xiyao Ma, Fan Lu, Xiajun Amy Pan, Yanlin Zhou, Xiaolin Andy Li
PDF OpenReview
Scaling Characteristics of Sequential Multitask Learning: Networks Naturally Learn to Learn Guy Davidson, Michael C. Mozer
PDF OpenReview
Sensitivity of Deep Convolutional Networks to Gabor Noise Kenneth T. Co, Luis Muñoz-González, Emil C. Lupu
PDF OpenReview
SmartChoices: Hybridizing Programming and Machine Learning Victor Carbune, Thierry Coppey, Alexander Daryin, Thomas Deselaers, Nikhil Sarda, Jay Yagnik
PDF OpenReview
Sparsity Emerges Naturally in Neural Language Models Naomi Saphra, Adam Lopez
PDF OpenReview
Staying up to Date with Online Content Changes Using Reinforcement Learning for Scheduling Andrey Kolobov, Yuval Peres, Cheng Lu, Eric Horvitz
PDF OpenReview
Sub-Policy Adaptation for Hierarchical Reinforcement Learning Alexander Li, Carlos Florensa, Pieter Abbeel
PDF OpenReview
SuperTML: Domain Transfer from Computer Vision to Structured Tabular Data Through Two-Dimensional Word Embedding Baohua Sun, Lin Yang, Wenhan Zhang, Michael Lin, Patrick Dong, Charles Young, Jason Dong
PDF OpenReview
Tasks Without Borders: A New Approach to Online Multi-Task Learning Alexander Zimin, Christoph H. Lampert
PDF OpenReview
The Difficulty of Training Sparse Neural Networks Utku Evci, Fabian Pedregosa, Aidan Gomez, Erich Elsen
PDF OpenReview
The Effect of Network Depth on the Optimization Landscape Behrooz Ghorbani, Ying Xiao, Shankar Krishnan
PDF OpenReview
The Role of Embedding Complexity in Domain-Invariant Representations Ching-Yao Chuang, Antonio Torralba, Stefanie Jegelka
PDF OpenReview
TuckER: Tensor Factorization for Knowledge Graph Completion Ivana Balazevic, Carl Allen, Timothy Hospedales
PDF OpenReview
Using Effective Dimension to Analyze Feature Transformations in Deep Neural Networks Kavya Ravichandran, Ajay Jain, Alexander Rakhlin
PDF OpenReview
VRKitchen: An Interactive 3D Environment for Learning Real Life Cooking Tasks Xiaofeng Gao, Ran Gong, Tianmin Shu, Xu Xie, Shu Wang, Song-Chun Zhu
PDF OpenReview