ICMLW 2019

92 papers

A Functional Extension of Multi-Output Learning Alex Lambert, Romain Brault, Zoltan Szabo, Florence d'Alche-Buc

A Meta Understanding of Meta-Learning Wei-Lun Chao, Han-Jia Ye, De-Chuan Zhan, Mark Campbell, Kilian Q. Weinberger

A Modern Take on the Bias-Variance Tradeoff in Neural Networks Brady Neal, Sarthak Mittal, Aristide Baratin, Vinayak Tantia, Matthew Scicluna, Simon Lacoste-Julien, Ioannis Mitliagkas

PDF OpenReview

A Reinforcement Learning Approach for Joint Replenishment Policy in Multi-Product Inventory System Hiroshi Suetsugu, Yoshiaki Narusue, Hiroyuki Morikawa

PDF OpenReview

A Systematic Framework for Natural Perturbations from Videos Vaishaal Shankar, Achal Dave, Rebecca Roelofs, Deva Ramanan, Benjamin Recht, Ludwig Schmidt

PDF OpenReview

Active Multitask Learning with Committees Jingxi Xu, Da Tang, Tony Jebara

PDF OpenReview

Addressing Sample Complexity in Visual Tasks Using Hindsight Experience Replay and Hallucinatory GANs Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

PDF OpenReview

Adversarial Training Can Hurt Generalization Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, Percy Liang

PDF OpenReview

Angular Visual Hardness Beidi Chen, Weiyang Liu, Animesh Garg, Zhiding Yu, Anshumali Shrivastava, Anima Anandkumar

PDF OpenReview

Are All Layers Created Equal? Chiyuan Zhang, Samy Bengio, Yoram Singer

PDF OpenReview

Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach Marc Brittain, Peng Wei

PDF OpenReview

Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking Syed Arbab Mohd Shihab, Caleb Logemann, Deepak-George Thomas, Peng Wei

PDF OpenReview

Bad Global Minima Exist and SGD Can Reach Them Shengchao Liu, Dimitris Papailiopoulos, Dimitris Achlioptas

PDF OpenReview

Batch Normalization Is a Cause of Adversarial Vulnerability Angus Galloway, Anna Golubeva, Thomas Tanay, Medhat Moussa, Graham W. Taylor

PDF OpenReview

Challenges of Real-World Reinforcement Learning Gabriel Dulac-Arnold, Daniel Mankowitz, Todd Hester

PDF OpenReview

Channel Normalization in Convolutional Neural Network Avoids Vanishing Gradients Zhenwei Dai and Reinhard Heckel

PDF OpenReview

Connections Between Optimization in Machine Learning and Adaptive Control Joseph E. Gaudio, Travis E. Gibson, Anuradha M. Annaswamy, Michael A. Bolender, Eugene Lavretsky

PDF OpenReview

Contextual Markov Decision Processes Using Generalized Linear Models Aditya Modi, Ambuj Tewari

PDF OpenReview

Continual Adaptation for Efficient Machine Communication Robert Hawkins, Minae Kwon, Dorsa Sadigh, Noah Goodman

PDF OpenReview

Crowdsourcing Reinforcement Learning to Optimize Knee Replacement Pathway Hao Lu, Mengdi Wang

PDF OpenReview

Curious iLQR: Resolving Uncertainty in Model-Based RL Sarah Bechtle, Akshara Rai, Yixin Lin, Ludovic Righetti, Franziska Meier

PDF OpenReview

Data Enrichment: Multi-Task Learning in High Dimension with Theoretical Guarantees Amir Asiaee, Samet Oymak, Kevin R. Coombes, Arindam Banerjee

PDF OpenReview

Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask Hattie Zhou, Janice Lan, Rosanne Liu, Jason Yosinski

PDF OpenReview

Deep Knowledge Based Agent: Learning to Do Tasks by Self-Thinking About Imaginary Worlds Ali Davody

PDF OpenReview

Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites Juan Jose Garau Luis, Markus Guerster, Inigo del Portillo, Edward Crawley, Bruce Cameron

PDF OpenReview

Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Reward Signals Gerrit Schoettler, Ashvin Nair, Jianlan Luo, Shikhar Bahl, Juan Aparicio Ojea, Eugen Solowjow, Sergey Levine

PDF OpenReview

Differentiable Hebbian Plasticity for Continual Learning Vithursan Thangarasa, Thomas Miconi, Graham W. Taylor

PDF OpenReview

Distribution-Dependent and Time-Uniform Bounds for Piecewise I.i.d Bandits Subhojyoti Mukherjee, Odalric Maillard

PDF OpenReview

Distributionally Robust Reinforcement Learning Elena Smirnova, Elvis Dohmatob, Jérémie Mary

PDF OpenReview

Do Deep Neural Networks Learn Shallow Learnable Examples First? Karttikeya Mangalam, Vinay Uday Prabhu

PDF OpenReview

DualDICE: Efficient Estimation of Off-Policy Stationary Distribution Corrections Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li

PDF OpenReview

Emergence of Implicit Filter Sparsity in Convolutional Neural Networks Dushyant Mehta, Kwang In Kim, Christian Theobalt

PDF OpenReview

Every Sample a Task: Pushing the Limits of Heterogeneous Models with Personalized Regression Ben Lengerich, Bryon Aragam, Eric Xing

PDF OpenReview

Fast Efficient Hyperparameter Tuning for Policy Gradients Supratik Paul, Vitaly Kurin, Shimon Whiteson

PDF OpenReview

Federated Optimization for Heterogeneous Networks Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, Virginia Smith

PDF OpenReview

Goal-Conditioned Imitation Learning Yiming Ding, Carlos Florensa, Mariano Phielipp, Pieter Abbeel

PDF OpenReview

Horizon: Facebook’s Open Source Applied Reinforcement Learning Platform Jason Gauci, Edoardo Conti, Yitao Liang, Kittipat Virochsiri, Yuchen He, Zachary Kaden, Vivek Narayanan, Xiaohui Ye, Zhengxing Chen

PDF OpenReview

Identity Crisis: Memorization and Generalization Under Extreme Overparameterization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Michael C. Mozer, Yoram Singer

PDF OpenReview

Improving Relevance Prediction with Transfer Learning in Large-Scale Retrieval Systems Ruoxi Wang, Zhe Zhao, Xinyang Yi, Ji Yang, Derek Zhiyuan Cheng, Lichan Hong, Steve Tjoa, Jieqi Kang, Evan Ettinger, Ed Chi

PDF OpenReview

Improving the Generalization of Visual Navigation Policies Using Invariance Regularization Michel Aractingi, Christopher Dance, Julien Perez, Tomi Silander

PDF OpenReview

In Support of Over-Parametrization in Deep Reinforcement Learning: An Empirical Study Brady Neal, Ioannis Mitliagkas

PDF OpenReview

Intelligent Pooling in Thompson Sampling for Rapid Personalization in Mobile Health Sabina Tomkins, Peng Liao, Serena Yeung, Predrag Klasnja, Susan Murphy

PDF OpenReview

Interpretable Robust Recommender Systems with Side Information Wenyu Chen, Zhechao Huang, Jason Cheuk Nam Liang, Zihao Xu

PDF OpenReview

Invariance-Inducing Regularization Using Worst-Case Transformations Suffices to Boost Accuracy and Spatial Robustness Fanny Yang, Zuowen Wang, Christina Heinze-Deml

PDF OpenReview

Layer Rotation: A Surprisingly Simple Indicator of Generalization in Deep Networks? Simon Carbonnelle, Christophe De Vleeschouwer

PDF OpenReview

Learning Cancer Outcomes from Heterogeneous Genomic Data Sources: An Adversarial Multi-Task Learning Approach Safoora Yousefi, Amirreza Shaban, Mohamed Amgad, Lee Cooper

PDF OpenReview

Learning Exploration Policies for Model-Agnostic Meta-Reinforcement Learning Swaminathan Gurumurthy, Sumit Kumar, Katia Sycara

PDF OpenReview

Learning to Learn to Communicate Ryan Lowe, Abhinav Gupta, Jakob Foerster, Douwe Kiela, Joelle Pineau

PDF OpenReview

Lessons from Contextual Bandit Learning in a Customer Support Bot Nikos Karampatziakis, Sebastian Kochman, Jade Huang, Paul Mineiro, Kathy Osborne, Weizhu Chen

PDF OpenReview

Lifelong Learning via Online Leverage Score Sampling Dan Teng, Sakyasingha Dasgupta

PDF OpenReview

Line Attractor Dynamics in Recurrent Networks for Sentiment Classiﬁcation Niru Maheswaranathan, Alex H. Williams, Matthew D. Golub, Surya Ganguli, David Sussillo

PDF OpenReview

Lyapunov-Based Safe Policy Optimization for Continuous Control Yinlam Chow, Ofir Nachum, Aleksandra Faust, Edgar Duenez-Guzman, Mohammad Ghavamzadeh

PDF OpenReview

Memorization in Overparameterized Autoencoders Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

PDF OpenReview

Meta-Reinforcement Learning for Adaptive Autonomous Driving Yesmina Jaafra, Jean Luc Laurent, Aline Deruyver, Mohamed Saber Naceur

PDF OpenReview

Multi-Task Learning via Task Multi-Clustering Andy Yan, Xin Wang, Ion Stoica, Joseph Gonzalez, Roy Fox

PDF OpenReview

Multinomial Logit Contextual Bandits Min-hwan Oh, Garud Iyengar

PDF OpenReview

Off-Policy Evaluation of Generalization for Deep Q-Learning in BinaryReward Tasks Alex Irpan, Kanishka Rao, Konstantinos Bousmalis, Chris Harris, Julian Ibarz, Sergey Levine

PDF OpenReview

Off-Policy Policy Gradient with State Distribution Correction Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

PDF OpenReview

On the Convex Behavior of Deep Neural Networks in Relation to the Layers' Width Etai Littwin, Lior Wolf

PDF OpenReview

Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit Problem Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistuba

PDF OpenReview

Optimizing 3D Structure of H2O Molecule Using DDPG Soo Kyung Kim, Peggy Li, Joanne Taery Kim, Piyush Karande, Yong Han

PDF OpenReview

ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems Bharathan Balaji, Jordan Bell-Masterson, Enes Bilgin, Andreas Damianou, Pablo Moreno Garcia, Arpit Jain, Anna Luo, Alvaro Maggiar, Balakrishnan Narayanaswamy, Chun Ye

PDF OpenReview

P3O: Policy-on Policy-Off Policy Optimization Rasool Fakoor, Pratik Chaudhari, Alexander J. Smola

PDF OpenReview

PAGANDA: An Adaptive Task-Independent Automatic Data Augmentation Boli Fang, Miao Jiang, Jerry Shen

PDF OpenReview

Park: An Open Platform for Learning Augmented Computer Systems Hongzi Mao, Parimarjan Negi, Akshay Narayan, Hanrui Wang, Jiacheng Yang, Haonan Wang, Ryan Marcus, Ravichandra Addanki, Mehrdad Khani, Songtao He, Vikram Nathan, Frank Cangialosi, Shaileshh Bojja Venkatakrishnan, Wei-Hung Weng, Song Han, Tim Kraska, Mohammad Alizadeh

PDF OpenReview

Personalized Student Stress Prediction with Deep Multi-Task Network Abhinav Shaw, Natcha Simsiri, Iman Dezbani, Madelina Fiterau, Tauhidur Rahaman

PDF OpenReview

Predicting the Accuracy of Neural Networks from Final and Intermediate Layer Outputs Chad DeChant, Seungwook Han, Hod Lipson

PDF OpenReview

Progressive Memory Banks for Incremental Domain Adaptation Nabiha Asghar, Lili Mou, Kira A. Selby, Kevin D. Pantasdo, Pascal Poupart, Xin Jiang

PDF OpenReview

Prototypical Bregman Networks Kubra Cilingir, Brian Kulis

PDF OpenReview

Q-Learning for Continuous Actions with Cross-Entropy Guided Policies Riley Simmons-Edler, Ben Eisner, Eric Mitchell, Sebastian Seung, Daniel Lee

PDF OpenReview

R-MADDPG for Partially Observable Environments and Limited Communication Rose E. Wang, Michael Everett, Jonathan P. How

PDF OpenReview

Real-World Autonomous Vehicle Control Trained Entirely Within Data-Driven Simulation Alexander Amini, Igor Gilitschenski, Jacob Phillips, Julia Moseyko, Sertac Karaman, Daniela Rus

PDF OpenReview

Real-World Video Adaptation with Reinforcement Learning Hongzi Mao, Shannon Chen, Drew Dimmery, Shaun Singh, Drew Blaisdell, Yuandong Tian, Mohammad Alizadeh, Eytan Bakshy

PDF OpenReview

Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation Byung Hoon Ahn, Prannoy Pilligundla, Hadi Esmaeilzadeh

PDF OpenReview

Reinforcement Learning for Blood Glucose Control: Challenges and Opportunities Ian Fox, Jenna Wiens

PDF OpenReview

Reinforcement Learning for Sepsis Treatment: Baselines and Analysis Aniruddh Raghu

PDF OpenReview

Reinforcement Learning in the Maintenance of Civil Infrastructures Shiyin Wei, Xiaowei Jin, Hui Li

PDF OpenReview

RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning Xiyao Ma, Fan Lu, Xiajun Amy Pan, Yanlin Zhou, Xiaolin Andy Li

PDF OpenReview

Scaling Characteristics of Sequential Multitask Learning: Networks Naturally Learn to Learn Guy Davidson, Michael C. Mozer

PDF OpenReview

Sensitivity of Deep Convolutional Networks to Gabor Noise Kenneth T. Co, Luis Muñoz-González, Emil C. Lupu

PDF OpenReview

SmartChoices: Hybridizing Programming and Machine Learning Victor Carbune, Thierry Coppey, Alexander Daryin, Thomas Deselaers, Nikhil Sarda, Jay Yagnik

PDF OpenReview

Sparsity Emerges Naturally in Neural Language Models Naomi Saphra, Adam Lopez

PDF OpenReview

Staying up to Date with Online Content Changes Using Reinforcement Learning for Scheduling Andrey Kolobov, Yuval Peres, Cheng Lu, Eric Horvitz

PDF OpenReview

Sub-Policy Adaptation for Hierarchical Reinforcement Learning Alexander Li, Carlos Florensa, Pieter Abbeel

PDF OpenReview

SuperTML: Domain Transfer from Computer Vision to Structured Tabular Data Through Two-Dimensional Word Embedding Baohua Sun, Lin Yang, Wenhan Zhang, Michael Lin, Patrick Dong, Charles Young, Jason Dong

PDF OpenReview

Tasks Without Borders: A New Approach to Online Multi-Task Learning Alexander Zimin, Christoph H. Lampert

PDF OpenReview

The Difficulty of Training Sparse Neural Networks Utku Evci, Fabian Pedregosa, Aidan Gomez, Erich Elsen

PDF OpenReview

The Effect of Network Depth on the Optimization Landscape Behrooz Ghorbani, Ying Xiao, Shankar Krishnan

PDF OpenReview

The Role of Embedding Complexity in Domain-Invariant Representations Ching-Yao Chuang, Antonio Torralba, Stefanie Jegelka

PDF OpenReview

TuckER: Tensor Factorization for Knowledge Graph Completion Ivana Balazevic, Carl Allen, Timothy Hospedales

PDF OpenReview

Using Effective Dimension to Analyze Feature Transformations in Deep Neural Networks Kavya Ravichandran, Ajay Jain, Alexander Rakhlin

PDF OpenReview

VRKitchen: An Interactive 3D Environment for Learning Real Life Cooking Tasks Xiaofeng Gao, Ran Gong, Tianmin Shu, Xu Xie, Shu Wang, Song-Chun Zhu

PDF OpenReview