ICML 2017

434 papers

“Convex Until Proven Guilty”: Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions Yair Carmon, John C. Duchi, Oliver Hinder, Aaron Sidford
PDF
A Birth-Death Process for Feature Allocation Konstantina Palla, David Knowles, Zoubin Ghahramani
PDF
A Closer Look at Memorization in Deep Networks Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, Simon Lacoste-Julien
PDF
A Distributional Perspective on Reinforcement Learning Marc G. Bellemare, Will Dabney, Rémi Munos
PDF
A Divergence Bound for Hybrids of MCMC and Variational Inference and an Application to Langevin Dynamics and SGVI Justin Domke
PDF
A Laplacian Framework for Option Discovery in Reinforcement Learning Marlos C. Machado, Marc G. Bellemare, Michael Bowling
PDF
A Richer Theory of Convex Constrained Optimization with Reduced Projections and Improved Rates Tianbao Yang, Qihang Lin, Lijun Zhang
PDF
A Semismooth Newton Method for Fast, Generic Convex Programming Alnur Ali, Eric Wong, J. Zico Kolter
PDF
A Simple Multi-Class Boosting Framework with Theoretical Guarantees and Empirical Proficiency Ron Appel, Pietro Perona
PDF
A Simulated Annealing Based Inexact Oracle for Wasserstein Loss Minimization Jianbo Ye, James Z. Wang, Jia Li
PDF
A Unified Maximum Likelihood Approach for Estimating Symmetric Properties of Discrete Distributions Jayadev Acharya, Hirakendu Das, Alon Orlitsky, Ananda Theertha Suresh
PDF
A Unified Variance Reduction-Based Framework for Nonconvex Low-Rank Matrix Recovery Lingxiao Wang, Xiao Zhang, Quanquan Gu
PDF
A Unified View of Multi-Label Performance Measures Xi-Zhu Wu, Zhi-Hua Zhou
PDF
Accelerating Eulerian Fluid Simulation with Convolutional Networks Jonathan Tompson, Kristofer Schlachter, Pablo Sprechmann, Ken Perlin
PDF
Active Heteroscedastic Regression Kamalika Chaudhuri, Prateek Jain, Nagarajan Natarajan
PDF
Active Learning for Accurate Estimation of Linear Models Carlos Riquelme, Mohammad Ghavamzadeh, Alessandro Lazaric
PDF
Active Learning for Cost-Sensitive Classification Akshay Krishnamurthy, Alekh Agarwal, Tzu-Kuo Huang, Hal Daumé, John Langford
PDF
Active Learning for Top-$k$ Rank Aggregation from Noisy Comparisons Soheil Mohajer, Changho Suh, Adel Elmahdy
PDF
AdaNet: Adaptive Structural Learning of Artificial Neural Networks Corinna Cortes, Xavier Gonzalvo, Vitaly Kuznetsov, Mehryar Mohri, Scott Yang
PDF
Adapting Kernel Representations Online Using Submodular Maximization Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White
PDF
Adaptive Consensus ADMM for Distributed Optimization Zheng Xu, Gavin Taylor, Hao Li, Mário A. T. Figueiredo, Xiaoming Yuan, Tom Goldstein
PDF
Adaptive Feature Selection: Computationally Efficient Online Sparse Linear Regression Under RIP Satyen Kale, Zohar Karnin, Tengyuan Liang, Dávid Pál
PDF
Adaptive Multiple-Arm Identification Jiecao Chen, Xi Chen, Qin Zhang, Yuan Zhou
PDF
Adaptive Neural Networks for Efficient Inference Tolga Bolukbasi, Joseph Wang, Ofer Dekel, Venkatesh Saligrama
PDF
Adaptive Sampling Probabilities for Non-Smooth Optimization Hongseok Namkoong, Aman Sinha, Steve Yadlowsky, John C. Duchi
PDF
Adversarial Feature Matching for Text Generation Yizhe Zhang, Zhe Gan, Kai Fan, Zhi Chen, Ricardo Henao, Dinghan Shen, Lawrence Carin
PDF
Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks Lars Mescheder, Sebastian Nowozin, Andreas Geiger
PDF
Algebraic Variety Models for High-Rank Matrix Completion Greg Ongie, Rebecca Willett, Robert D. Nowak, Laura Balzano
PDF
Algorithmic Stability and Hypothesis Complexity Tongliang Liu, Gábor Lugosi, Gergely Neu, Dacheng Tao
PDF
Algorithms for $\ell_p$ Low-Rank Approximation Flavio Chierichetti, Sreenivas Gollapudi, Ravi Kumar, Silvio Lattanzi, Rina Panigrahy, David P. Woodruff
PDF
An Adaptive Test of Independence with Analytic Kernel Embeddings Wittawat Jitkrittum, Zoltán Szabó, Arthur Gretton
PDF
An Alternative SoftMax Operator for Reinforcement Learning Kavosh Asadi, Michael L. Littman
PDF
An Analytical Formula of Population Gradient for Two-Layered ReLU Network and Its Applications in Convergence and Critical Point Analysis Yuandong Tian
PDF
An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation David Anderson, Ming Gu
PDF
An Infinite Hidden Markov Model with Similarity-Biased Transitions Colin Reimer Dawson, Chaofan Huang, Clayton T. Morrison
PDF
Analogical Inference for Multi-Relational Embeddings Hanxiao Liu, Yuexin Wu, Yiming Yang
PDF
Analysis and Optimization of Graph Decompositions by Lifted Multicuts Andrea Horňáková, Jan-Hendrik Lange, Bjoern Andres
PDF
Analytical Guarantees on Numerical Precision of Deep Neural Networks Charbel Sakr, Yongjune Kim, Naresh Shanbhag
PDF
Approximate Newton Methods and Their Local Convergence Haishan Ye, Luo Luo, Zhihua Zhang
PDF
Approximate Steepest Coordinate Descent Sebastian U. Stich, Anant Raj, Martin Jaggi
PDF
Asymmetric Tri-Training for Unsupervised Domain Adaptation Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada
PDF
Asynchronous Distributed Variational Gaussian Process for Regression Hao Peng, Shandian Zhe, Xiao Zhang, Yuan Qi
PDF
Asynchronous Stochastic Gradient Descent with Delay Compensation Shuxin Zheng, Qi Meng, Taifeng Wang, Wei Chen, Nenghai Yu, Zhi-Ming Ma, Tie-Yan Liu
PDF
Attentive Recurrent Comparators Pranav Shyam, Shubham Gupta, Ambedkar Dukkipati
PDF
Automated Curriculum Learning for Neural Networks Alex Graves, Marc G. Bellemare, Jacob Menick, Rémi Munos, Koray Kavukcuoglu
PDF
Automatic Discovery of the Statistical Types of Variables in a Dataset Isabel Valera, Zoubin Ghahramani
PDF
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning Oron Anschel, Nir Baram, Nahum Shimkin
PDF
Axiomatic Attribution for Deep Networks Mukund Sundararajan, Ankur Taly, Qiqi Yan
PDF
Batched High-Dimensional Bayesian Optimization via Structural Kernel Learning Zi Wang, Chengtao Li, Stefanie Jegelka, Pushmeet Kohli
PDF
Bayesian Boolean Matrix Factorisation Tammo Rukat, Chris C. Holmes, Michalis K. Titsias, Christopher Yau
PDF
Bayesian Inference on Random Simple Graphs with Power Law Degree Distributions Juho Lee, Creighton Heaukulani, Zoubin Ghahramani, Lancelot F. James, Seungjin Choi
PDF
Bayesian Models of Data Streams with Hierarchical Power Priors Andrés Masegosa, Thomas D. Nielsen, Helge Langseth, Darı́o Ramos-López, Antonio Salmerón, Anders L. Madsen
PDF
Bayesian Optimization with Tree-Structured Dependencies Rodolphe Jenatton, Cedric Archambeau, Javier González, Matthias Seeger
PDF
Being Robust (in High Dimensions) Can Be Practical Ilias Diakonikolas, Gautam Kamath, Daniel M. Kane, Jerry Li, Ankur Moitra, Alistair Stewart
PDF
Beyond Filters: Compact Feature mAP for Portable Deep Model Yunhe Wang, Chang Xu, Chao Xu, Dacheng Tao
PDF
Bidirectional Learning for Time-Series Models with Hidden Units Takayuki Osogami, Hiroshi Kajino, Taro Sekiyama
PDF
Boosted Fitted Q-Iteration Samuele Tosatto, Matteo Pirotta, Carlo D’Eramo, Marcello Restelli
PDF
Bottleneck Conditional Density Estimation Rui Shu, Hung H. Bui, Mohammad Ghavamzadeh
PDF
Breaking Locality Accelerates Block Gauss-Seidel Stephen Tu, Shivaram Venkataraman, Ashia C. Wilson, Alex Gittens, Michael I. Jordan, Benjamin Recht
PDF
Canopy Fast Sampling with Cover Trees Manzil Zaheer, Satwik Kottur, Amr Ahmed, José Moura, Alex Smola
PDF
Capacity Releasing Diffusion for Speed and Locality Di Wang, Kimon Fountoulakis, Monika Henzinger, Michael W. Mahoney, Satish Rao
PDF
ChoiceRank: Identifying Preferences from Node Traffic in Networks Lucas Maystre, Matthias Grossglauser
PDF
Clustering by Sum of Norms: Stochastic Incremental Algorithm, Convergence and Cluster Recovery Ashkan Panahi, Devdatt Dubhashi, Fredrik D. Johansson, Chiranjib Bhattacharyya
PDF
Clustering High Dimensional Dynamic Data Streams Vladimir Braverman, Gereon Frahling, Harry Lang, Christian Sohler, Lin F. Yang
PDF
Co-Clustering Through Optimal Transport Charlotte Laclau, Ievgen Redko, Basarab Matei, Younès Bennani, Vincent Brault
PDF
Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study Samuel Ritter, David G. T. Barrett, Adam Santoro, Matt M. Botvinick
PDF
Coherence Pursuit: Fast, Simple, and Robust Subspace Recovery Mostafa Rahmani, George Atia
PDF
Coherent Probabilistic Forecasts for Hierarchical Time Series Souhaib Ben Taieb, James W. Taylor, Rob J. Hyndman
PDF
Collect at Once, Use Effectively: Making Non-Interactive Locally Private Learning Possible Kai Zheng, Wenlong Mou, Liwei Wang
PDF
Combined Group and Exclusive Sparsity for Deep Neural Networks Jaehong Yoon, Sung Ju Hwang
PDF
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning Yevgen Chebotar, Karol Hausman, Marvin Zhang, Gaurav Sukhatme, Stefan Schaal, Sergey Levine
PDF
Communication-Efficient Algorithms for Distributed Stochastic Principal Component Analysis Dan Garber, Ohad Shamir, Nathan Srebro
PDF
Composing Tree Graphical Models with Persistent Homology Features for Clustering Mixed-Type Data Xiuyan Ni, Novi Quadrianto, Yusu Wang, Chao Chen
PDF
Compressed Sensing Using Generative Models Ashish Bora, Ajil Jalal, Eric Price, Alexandros G. Dimakis
PDF
Conditional Accelerated Lazy Stochastic Gradient Descent Guanghui Lan, Sebastian Pokutta, Yi Zhou, Daniel Zink
PDF
Conditional Image Synthesis with Auxiliary Classifier GANs Augustus Odena, Christopher Olah, Jonathon Shlens
PDF
Confident Multiple Choice Learning Kimin Lee, Changho Hwang, KyoungSoo Park, Jinwoo Shin
PDF
Connected Subgraph Detection with Mirror Descent on SDPs Cem Aksoylar, Lorenzo Orecchia, Venkatesh Saligrama
PDF
Consistency Analysis for Binary Classification Revisited Krzysztof Dembczyński, Wojciech Kotłowski, Oluwasanmi Koyejo, Nagarajan Natarajan
PDF
Consistent K-Clustering Silvio Lattanzi, Sergei Vassilvitskii
PDF
Consistent On-Line Off-Policy Evaluation Assaf Hallak, Shie Mannor
PDF
Constrained Policy Optimization Joshua Achiam, David Held, Aviv Tamar, Pieter Abbeel
PDF
Contextual Decision Processes with Low Bellman Rank Are PAC-Learnable Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire
PDF
Continual Learning Through Synaptic Intelligence Friedemann Zenke, Ben Poole, Surya Ganguli
PDF
Convergence Analysis of Proximal Gradient with Momentum for Nonconvex Optimization Qunwei Li, Yi Zhou, Yingbin Liang, Pramod K. Varshney
PDF
Convex Phase Retrieval Without Lifting via PhaseMax Tom Goldstein, Christoph Studer
PDF
Convexified Convolutional Neural Networks Yuchen Zhang, Percy Liang, Martin J. Wainwright
PDF
Convolutional Sequence to Sequence Learning Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin
PDF
Coordinated Multi-Agent Imitation Learning Hoang M. Le, Yisong Yue, Peter Carr, Patrick Lucey
PDF
Coresets for Vector Summarization with Applications to Network Graphs Dan Feldman, Sedat Ozer, Daniela Rus
PDF
Cost-Optimal Learning of Causal Graphs Murat Kocaoglu, Alex Dimakis, Sriram Vishwanath
PDF
Count-Based Exploration with Neural Density Models Georg Ostrovski, Marc G. Bellemare, Aäron Oord, Rémi Munos
PDF
Counterfactual Data-Fusion for Online Reinforcement Learners Andrew Forney, Judea Pearl, Elias Bareinboim
PDF
Coupling Distributed and Symbolic Execution for Natural Language Queries Lili Mou, Zhengdong Lu, Hang Li, Zhi Jin
PDF
Curiosity-Driven Exploration by Self-Supervised Prediction Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell
PDF
Dance Dance Convolution Chris Donahue, Zachary C. Lipton, Julian McAuley
PDF
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning Irina Higgins, Arka Pal, Andrei Rusu, Loic Matthey, Christopher Burgess, Alexander Pritzel, Matthew Botvinick, Charles Blundell, Alexander Lerchner
PDF
Data-Efficient Policy Evaluation Through Behavior Policy Search Josiah P. Hanna, Philip S. Thomas, Peter Stone, Scott Niekum
PDF
Deciding How to Decide: Dynamic Routing in Artificial Neural Networks Mason McGill, Pietro Perona
PDF
Decoupled Neural Interfaces Using Synthetic Gradients Max Jaderberg, Wojciech Marian Czarnecki, Simon Osindero, Oriol Vinyals, Alex Graves, David Silver, Koray Kavukcuoglu
PDF
Deep Bayesian Active Learning with Image Data Yarin Gal, Riashat Islam, Zoubin Ghahramani
PDF
Deep Decentralized Multi-Task Multi-Agent Reinforcement Learning Under Partial Observability Shayegan Omidshafiei, Jason Pazis, Christopher Amato, Jonathan P. How, John Vian
PDF
Deep Generative Models for Relational Data with Side Information Changwei Hu, Piyush Rai, Lawrence Carin
PDF
Deep IV: A Flexible Approach for Counterfactual Prediction Jason Hartford, Greg Lewis, Kevin Leyton-Brown, Matt Taddy
PDF
Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC Yulai Cong, Bo Chen, Hongwei Liu, Mingyuan Zhou
PDF
Deep Spectral Clustering Learning Marc T. Law, Raquel Urtasun, Richard S. Zemel
PDF
Deep Tensor Convolution on Multicores David Budden, Alexander Matveev, Shibani Santurkar, Shraman Ray Chaudhuri, Nir Shavit
PDF
Deep Transfer Learning with Joint Adaptation Networks Mingsheng Long, Han Zhu, Jianmin Wang, Michael I. Jordan
PDF
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs Michael Gygli, Mohammad Norouzi, Anelia Angelova
PDF
Deep Voice: Real-Time Neural Text-to-Speech Sercan Ö. Arık, Mike Chrzanowski, Adam Coates, Gregory Diamos, Andrew Gibiansky, Yongguo Kang, Xian Li, John Miller, Andrew Ng, Jonathan Raiman, Shubho Sengupta, Mohammad Shoeybi
PDF
DeepBach: A Steerable Model for Bach Chorales Generation Gaëtan Hadjeres, François Pachet, Frank Nielsen
PDF
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction Wen Sun, Arun Venkatraman, Geoffrey J. Gordon, Byron Boots, J. Andrew Bagnell
PDF
Deletion-Robust Submodular Maximization: Data Summarization with “the Right to Be Forgotten” Baharan Mirzasoleiman, Amin Karbasi, Andreas Krause
PDF
Delta Networks for Optimized Recurrent Network Computation Daniel Neil, Jun Haeng Lee, Tobi Delbruck, Shih-Chii Liu
PDF
Density Level Set Estimation on Manifolds with DBSCAN Heinrich Jiang
PDF
Depth-Width Tradeoffs in Approximating Natural Functions with Neural Networks Itay Safran, Ohad Shamir
PDF
Deriving Neural Architectures from Sequence and Graph Kernels Tao Lei, Wengong Jin, Regina Barzilay, Tommi Jaakkola
PDF
Developing Bug-Free Machine Learning Systems with Formal Mathematics Daniel Selsam, Percy Liang, David L. Dill
PDF
Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, Jeff Dean
PDF
Diameter-Based Active Learning Christopher Tosh, Sanjoy Dasgupta
PDF
Dictionary Learning Based on Sparse Distribution Tomography Pedram Pad, Farnood Salehi, Elisa Celis, Patrick Thiran, Michael Unser
PDF
Differentiable Programs with Neural Libraries Alexander L. Gaunt, Marc Brockschmidt, Nate Kushman, Daniel Tarlow
PDF
Differentially Private Chi-Squared Test by Unit Circle Mechanism Kazuya Kakizaki, Kazuto Fukuchi, Jun Sakuma
PDF
Differentially Private Clustering in High-Dimensional Euclidean Spaces Maria-Florina Balcan, Travis Dick, Yingyu Liang, Wenlong Mou, Hongyang Zhang
PDF
Differentially Private Learning of Undirected Graphical Models Using Collective Graphical Models Garrett Bernstein, Ryan McKenna, Tao Sun, Daniel Sheldon, Michael Hay, Gerome Miklau
PDF
Differentially Private Ordinary Least Squares Or Sheffet
PDF
Differentially Private Submodular Maximization: Data Summarization in Disguise Marko Mitrovic, Mark Bun, Andreas Krause, Amin Karbasi
PDF
Discovering Discrete Latent Topics with Neural Variational Inference Yishu Miao, Edward Grefenstette, Phil Blunsom
PDF
Dissipativity Theory for Nesterov’s Accelerated Method Bin Hu, Laurent Lessard
PDF
Distributed and Provably Good Seedings for K-Means in Constant Rounds Olivier Bachem, Mario Lucic, Andreas Krause
PDF
Distributed Batch Gaussian Process Optimization Erik A. Daxberger, Bryan Kian Hsiang Low
PDF
Distributed Mean Estimation with Limited Communication Ananda Theertha Suresh, Felix X. Yu, Sanjiv Kumar, H. Brendan McMahan
PDF
Doubly Accelerated Methods for Faster CCA and Generalized Eigendecomposition Zeyuan Allen-Zhu, Yuanzhi Li
PDF
Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization Qi Lei, Ian En-Hsu Yen, Chao-yuan Wu, Inderjit S. Dhillon, Pradeep Ravikumar
PDF
Dropout Inference in Bayesian Neural Networks with Alpha-Divergences Yingzhen Li, Yarin Gal
PDF
Dual Iterative Hard Thresholding: From Non-Convex Sparse Minimization to Non-Smooth Concave Maximization Bo Liu, Xiao-Tong Yuan, Lezi Wang, Qingshan Liu, Dimitris N. Metaxas
PDF
Dual Supervised Learning Yingce Xia, Tao Qin, Wei Chen, Jiang Bian, Nenghai Yu, Tie-Yan Liu
PDF
Dueling Bandits with Weak Regret Bangrui Chen, Peter I. Frazier
PDF
Dynamic Word Embeddings Robert Bamler, Stephan Mandt
PDF
Efficient Distributed Learning with Sparsity Jialei Wang, Mladen Kolar, Nathan Srebro, Tong Zhang
PDF
Efficient Nonmyopic Active Search Shali Jiang, Gustavo Malkomes, Geoff Converse, Alyssa Shofner, Benjamin Moseley, Roman Garnett
PDF
Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret Alina Beygelzimer, Francesco Orabona, Chicheng Zhang
PDF
Efficient Orthogonal Parametrisation of Recurrent Neural Networks Using Householder Reflections Zakaria Mhammedi, Andrew Hellicar, Ashfaqur Rahman, James Bailey
PDF
Efficient Regret Minimization in Non-Convex Games Elad Hazan, Karan Singh, Cyril Zhang
PDF
Efficient SoftMax Approximation for GPUs Grave, Armand Joulin, Moustapha Cissé, David Grangier, Hervé Jégou
PDF
Emulating the Expert: Inverse Optimization Through Online Learning Andreas Bärmann, Sebastian Pokutta, Oskar Schneider
PDF
End-to-End Differentiable Adversarial Imitation Learning Nir Baram, Oron Anschel, Itai Caspi, Shie Mannor
PDF
End-to-End Learning for Structured Prediction Energy Networks David Belanger, Bishan Yang, Andrew McCallum
PDF
Enumerating Distinct Decision Trees Salvatore Ruggieri
PDF
Equivariance Through Parameter-Sharing Siamak Ravanbakhsh, Jeff Schneider, Barnabás Póczos
PDF
Estimating Individual Treatment Effect: Generalization Bounds and Algorithms Uri Shalit, Fredrik D. Johansson, David Sontag
PDF
Estimating the Unseen from Multiple Populations Aditi Raghunathan, Gregory Valiant, James Zou
PDF
Evaluating Bayesian Models with Posterior Dispersion Indices Alp Kucukelbir, Yixin Wang, David M. Blei
PDF
Evaluating the Variance of Likelihood-Ratio Gradient Estimators Seiya Tokui, Issei Sato
PDF
Exact Inference for Integer Latent-Variable Models Kevin Winner, Debora Sujono, Dan Sheldon
PDF
Exact MAP Inference by Avoiding Fractional Vertices Erik M. Lindgren, Alexandros G. Dimakis, Adam Klivans
PDF
Exploiting Strong Convexity from Data with Primal-Dual First-Order Algorithms Jialei Wang, Lin Xiao
PDF
Failures of Gradient-Based Deep Learning Shai Shalev-Shwartz, Ohad Shamir, Shaked Shammah
PDF
Fairness in Reinforcement Learning Shahin Jabbari, Matthew Joseph, Michael Kearns, Jamie Morgenstern, Aaron Roth
PDF
Fake News Mitigation via Point Process Based Intervention Mehrdad Farajtabar, Jiachen Yang, Xiaojing Ye, Huan Xu, Rakshit Trivedi, Elias Khalil, Shuang Li, Le Song, Hongyuan Zha
PDF
Fast Bayesian Intensity Estimation for the Permanental Process Christian J. Walder, Adrian N. Bishop
PDF
Fast K-Nearest Neighbour Search via Prioritized DCI Ke Li, Jitendra Malik
PDF
Faster Greedy MAP Inference for Determinantal Point Processes Insu Han, Prabhanjan Kambadur, Kyoungsoo Park, Jinwoo Shin
PDF
Faster Principal Component Regression and Stable Matrix Chebyshev Approximation Zeyuan Allen-Zhu, Yuanzhi Li
PDF
FeUdal Networks for Hierarchical Reinforcement Learning Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu
PDF
Follow the Compressed Leader: Faster Online Learning of Eigenvectors and Faster MMWU Zeyuan Allen-Zhu, Yuanzhi Li
PDF
Follow the Moving Leader in Deep Learning Shuai Zheng, James T. Kwok
PDF
Forest-Type Regression with General Losses and Robust Forest Alexander Hanbo Li, Andrew Martin
PDF
Forward and Reverse Gradient-Based Hyperparameter Optimization Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil
PDF
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for Markov Chain Monte Carlo Umut Şimşekli
PDF
Frame-Based Data Factorizations Sebastian Mair, Ahcène Boubekki, Ulf Brefeld
PDF
From Patches to Images: A Nonparametric Generative Model Geng Ji, Michael C. Hughes, Erik B. Sudderth
PDF
Generalization and Equilibrium in Generative Adversarial Nets (GANs) Sanjeev Arora, Rong Ge, Yingyu Liang, Tengyu Ma, Yi Zhang
PDF
Geometry of Neural Network Loss Surfaces via Random Matrix Theory Jeffrey Pennington, Yasaman Bahri
PDF
Global Optimization of Lipschitz Functions Cédric Malherbe, Nicolas Vayatis
PDF
Globally Induced Forest: A Prepruning Compression Scheme Jean-Michel Begon, Arnaud Joly, Pierre Geurts
PDF
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs Alon Brutzkus, Amir Globerson
PDF
Gradient Boosted Decision Trees for High Dimensional Sparse Output Si Si, Huan Zhang, S. Sathiya Keerthi, Dhruv Mahajan, Inderjit S. Dhillon, Cho-Jui Hsieh
PDF
Gradient Coding: Avoiding Stragglers in Distributed Learning Rashish Tandon, Qi Lei, Alexandros G. Dimakis, Nikos Karampatziakis
PDF
Gradient Projection Iterative Sketch for Large-Scale Constrained Least-Squares Junqi Tang, Mohammad Golbabaee, Mike E. Davies
PDF
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling Hairong Liu, Zhenyao Zhu, Xiangang Li, Sanjeev Satheesh
PDF
Grammar Variational Autoencoder Matt J. Kusner, Brooks Paige, José Miguel Hernández-Lobato
PDF
Graph-Based Isometry Invariant Representation Learning Renata Khasanova, Pascal Frossard
PDF
GSOS: Gauss-Seidel Operator Splitting Algorithm for Multi-Term Nonsmooth Convex Composite Optimization Li Shen, Wei Liu, Ganzhao Yuan, Shiqian Ma
PDF
Guarantees for Greedy Maximization of Non-Submodular Functions with Applications Andrew An Bian, Joachim M. Buhmann, Andreas Krause, Sebastian Tschiatschek
PDF
Hierarchy Through Composition with Multitask LMDPs Andrew M. Saxe, Adam C. Earle, Benjamin Rosman
PDF
High Dimensional Bayesian Optimization with Elastic Gaussian Process Santu Rana, Cheng Li, Sunil Gupta, Vu Nguyen, Svetha Venkatesh
PDF
High-Dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation Zhuoran Yang, Krishnakumar Balasubramanian, Han Liu
PDF
High-Dimensional Structured Quantile Regression Vidyashankar Sivakumar, Arindam Banerjee
PDF
High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm Rongda Zhu, Lingxiao Wang, Chengxiang Zhai, Quanquan Gu
PDF
How Close Are the Eigenvectors of the Sample and Actual Covariance Matrices? Andreas Loukas
PDF
How to Escape Saddle Points Efficiently Chi Jin, Rong Ge, Praneeth Netrapalli, Sham M. Kakade, Michael I. Jordan
PDF
Hyperplane Clustering via Dual Principal Component Pursuit Manolis C. Tsakiris, René Vidal
PDF
Identification and Model Testing in Linear Structural Equation Models Using Auxiliary Variables Bryant Chen, Daniel Kumor, Elias Bareinboim
PDF
Identify the Nash Equilibrium in Static Games with Random Payoffs Yichi Zhou, Jialian Li, Jun Zhu
PDF
Identifying Best Interventions Through Online Importance Sampling Rajat Sen, Karthikeyan Shanmugam, Alexandros G. Dimakis, Sanjay Shakkottai
PDF
Image-to-Markup Generation with Coarse-to-Fine Attention Yuntian Deng, Anssi Kanervisto, Jeffrey Ling, Alexander M. Rush
PDF
Improved Variational Autoencoders for Text Modeling Using Dilated Convolutions Zichao Yang, Zhiting Hu, Ruslan Salakhutdinov, Taylor Berg-Kirkpatrick
PDF
Improving Gibbs Sampler Scan Quality with DoGS Ioannis Mitliagkas, Lester Mackey
PDF
Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning Using the Beta Distribution Po-Wei Chou, Daniel Maturana, Sebastian Scherer
PDF
Improving Viterbi Is Hard: Better Runtimes Imply Faster Clique Algorithms Arturs Backurs, Christos Tzamos
PDF
Innovation Pursuit: A New Approach to the Subspace Clustering Problem Mostafa Rahmani, George Atia
PDF
Input Convex Neural Networks Brandon Amos, Lei Xu, J. Zico Kolter
PDF
Input Switched Affine Networks: An RNN Architecture Designed for Interpretability Jakob N. Foerster, Justin Gilmer, Jascha Sohl-Dickstein, Jan Chorowski, David Sussillo
PDF
Interactive Learning from Policy-Dependent Human Feedback James MacGlashan, Mark K. Ho, Robert Loftin, Bei Peng, Guan Wang, David L. Roberts, Matthew E. Taylor, Michael L. Littman
PDF
iSurvive: An Interpretable, Event-Time Prediction Model for mHealth Walter H. Dempsey, Alexander Moreno, Christy K. Scott, Michael L. Dennis, David H. Gustafson, Susan A. Murphy, James M. Rehg
PDF
Iterative Machine Teaching Weiyang Liu, Bo Dai, Ahmad Humayun, Charlene Tay, Chen Yu, Linda B. Smith, James M. Rehg, Le Song
PDF
Joint Dimensionality Reduction and Metric Learning: A Geometric Take Mehrtash Harandi, Mathieu Salzmann, Richard Hartley
PDF
Just Sort It! a Simple and Effective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser
PDF
Kernelized Support Tensor Machines Lifang He, Chun-Ta Lu, Guixiang Ma, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin
PDF
Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs Rakshit Trivedi, Hanjun Dai, Yichen Wang, Le Song
PDF
Language Modeling with Gated Convolutional Networks Yann N. Dauphin, Angela Fan, Michael Auli, David Grangier
PDF
Large-Scale Evolution of Image Classifiers Esteban Real, Sherry Moore, Andrew Selle, Saurabh Saxena, Yutaka Leon Suematsu, Jie Tan, Quoc V. Le, Alexey Kurakin
PDF
Latent Feature Lasso Ian En-Hsu Yen, Wei-Cheng Lee, Sung-En Chang, Arun Sai Suggala, Shou-De Lin, Pradeep Ravikumar
PDF
Latent Intention Dialogue Models Tsung-Hsien Wen, Yishu Miao, Phil Blunsom, Steve Young
PDF
Latent LSTM Allocation: Joint Clustering and Non-Linear Dynamic Modeling of Sequence Data Manzil Zaheer, Amr Ahmed, Alexander J. Smola
PDF
Lazifying Conditional Gradient Algorithms Gábor Braun, Sebastian Pokutta, Daniel Zink
PDF
Learned Optimizers That Scale and Generalize Olga Wichrowska, Niru Maheswaranathan, Matthew W. Hoffman, Sergio Gómez Colmenarejo, Misha Denil, Nando Freitas, Jascha Sohl-Dickstein
PDF
Learning Algorithms for Active Learning Philip Bachman, Alessandro Sordoni, Adam Trischler
PDF
Learning Continuous Semantic Representations of Symbolic Expressions Miltiadis Allamanis, Pankajan Chanthirasegaran, Pushmeet Kohli, Charles Sutton
PDF
Learning Deep Architectures via Generalized Whitened Neural Networks Ping Luo
PDF
Learning Deep Latent Gaussian Models with Markov Chain Monte Carlo Matthew D. Hoffman
PDF
Learning Determinantal Point Processes with Moments and Cycles John Urschel, Victor-Emmanuel Brunel, Ankur Moitra, Philippe Rigollet
PDF
Learning Discrete Representations via Information Maximizing Self-Augmented Training Weihua Hu, Takeru Miyato, Seiya Tokui, Eiichi Matsumoto, Masashi Sugiyama
PDF
Learning from Clinical Judgments: Semi-Markov-Modulated Marked Hawkes Processes for Risk Prognosis Ahmed M. Alaa, Scott Hu, Mihaela Schaar
PDF
Learning Gradient Descent: Better Generalization and Longer Horizons Kaifeng Lv, Shunhua Jiang, Jian Li
PDF
Learning Hawkes Processes from Short Doubly-Censored Event Sequences Hongteng Xu, Dixin Luo, Hongyuan Zha
PDF
Learning Hierarchical Features from Deep Generative Models Shengjia Zhao, Jiaming Song, Stefano Ermon
PDF
Learning Important Features Through Propagating Activation Differences Avanti Shrikumar, Peyton Greenside, Anshul Kundaje
PDF
Learning in POMDPs with Monte Carlo Tree Search Sammie Katt, Frans A. Oliehoek, Christopher Amato
PDF
Learning Infinite Layer Networks Without the Kernel Trick Roi Livni, Daniel Carmon, Amir Globerson
PDF
Learning Latent Space Models with Angular Constraints Pengtao Xie, Yuntian Deng, Yi Zhou, Abhimanu Kumar, Yaoliang Yu, James Zou, Eric P. Xing
PDF
Learning Sleep Stages from Radio Signals: A Conditional Adversarial Architecture Mingmin Zhao, Shichao Yue, Dina Katabi, Tommi S. Jaakkola, Matt T. Bianchi
PDF
Learning Stable Stochastic Nonlinear Dynamical Systems Jonas Umlauft, Sandra Hirche
PDF
Learning Texture Manifolds with the Periodic Spatial GAN Urs Bergmann, Nikolay Jetchev, Roland Vollgraf
PDF
Learning the Structure of Generative Models Without Labeled Data Stephen H. Bach, Bryan He, Alexander Ratner, Christopher Ré
PDF
Learning to Aggregate Ordinal Labels by Maximizing Separating Width Guangyong Chen, Shengyu Zhang, Di Lin, Hui Huang, Pheng Ann Heng
PDF
Learning to Align the Source Code to the Compiled Object Code Dor Levy, Lior Wolf
PDF
Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier Joseph Futoma, Sanjay Hariharan, Katherine Heller
PDF
Learning to Discover Cross-Domain Relations with Generative Adversarial Networks Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, Jiwon Kim
PDF
Learning to Discover Sparse Graphical Models Eugene Belilovsky, Kyle Kastner, Gael Varoquaux, Matthew B. Blaschko
PDF
Learning to Generate Long-Term Future via Hierarchical Prediction Ruben Villegas, Jimei Yang, Yuliang Zou, Sungryull Sohn, Xunyu Lin, Honglak Lee
PDF
Learning to Learn Without Gradient Descent by Gradient Descent Yutian Chen, Matthew W. Hoffman, Sergio Gómez Colmenarejo, Misha Denil, Timothy P. Lillicrap, Matt Botvinick, Nando Freitas
PDF
Leveraging Node Attributes for Incomplete Relational Data He Zhao, Lan Du, Wray Buntine
PDF
Leveraging Union of Subspace Structure to Improve Constrained Clustering John Lipor, Laura Balzano
PDF
Local Bayesian Optimization of Motor Skills Riad Akrour, Dmitry Sorokin, Jan Peters, Gerhard Neumann
PDF
Local-to-Global Bayesian Network Structure Learning Tian Gao, Kshitij Fadnis, Murray Campbell
PDF
Logarithmic Time One-Against-Some Hal Daumé, Nikos Karampatziakis, John Langford, Paul Mineiro
PDF
Lost Relatives of the Gumbel Trick Matej Balog, Nilesh Tripuraneni, Zoubin Ghahramani, Adrian Weller
PDF
Magnetic Hamiltonian Monte Carlo Nilesh Tripuraneni, Mark Rowland, Zoubin Ghahramani, Richard Turner
PDF
Max-Value Entropy Search for Efficient Bayesian Optimization Zi Wang, Stefanie Jegelka
PDF
Maximum Selection and Ranking Under Noisy Comparisons Moein Falahatgar, Alon Orlitsky, Venkatadheeraj Pichapati, Ananda Theertha Suresh
PDF
McGan: Mean and Covariance Feature Matching GAN Youssef Mroueh, Tom Sercu, Vaibhava Goel
PDF
Measuring Sample Quality with Kernels Jackson Gorham, Lester Mackey
PDF
MEC: Memory-Efficient Convolution for Deep Neural Network Minsik Cho, Daniel Brand
PDF
meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting Xu Sun, Xuancheng Ren, Shuming Ma, Houfeng Wang
PDF
Meritocratic Fairness for Cross-Population Selection Michael Kearns, Aaron Roth, Zhiwei Steven Wu
PDF
Meta Networks Tsendsuren Munkhdalai, Hong Yu
PDF
Minimax Regret Bounds for Reinforcement Learning Mohammad Gheshlaghi Azar, Ian Osband, Rémi Munos
PDF
Minimizing Trust Leaks for Robust Sybil Detection János Höner, Shinichi Nakajima, Alexander Bauer, Klaus-Robert Müller, Nico Görnitz
PDF
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn, Pieter Abbeel, Sergey Levine
PDF
Model-Independent Online Learning for Influence Maximization Sharan Vaswani, Branislav Kveton, Zheng Wen, Mohammad Ghavamzadeh, Laks V. S. Lakshmanan, Mark Schmidt
PDF
Modular Multitask Reinforcement Learning with Policy Sketches Jacob Andreas, Dan Klein, Sergey Levine
PDF
Multi-Class Optimal Margin Distribution Machine Teng Zhang, Zhi-Hua Zhou
PDF
Multi-Fidelity Bayesian Optimisation with Continuous Approximations Kirthevasan Kandasamy, Gautam Dasarathy, Jeff Schneider, Barnabás Póczos
PDF
Multi-Objective Bandits: Optimizing the Generalized Gini Index Róbert Busa-Fekete, Balázs Szörényi, Paul Weng, Shie Mannor
PDF
Multi-Task Learning with Labeled and Unlabeled Tasks Anastasia Pentina, Christoph H. Lampert
PDF
Multichannel End-to-End Speech Recognition Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey
PDF
Multilabel Classification with Group Testing and Codes Shashanka Ubaru, Arya Mazumdar
PDF
Multilevel Clustering via Wasserstein Means Nhat Ho, XuanLong Nguyen, Mikhail Yurochkin, Hung Hai Bui, Viet Huynh, Dinh Phung
PDF
Multiple Clustering Views from Multiple Uncertain Experts Yale Chang, Junxiang Chen, Michael H. Cho, Peter J. Castaldi, Edwin K. Silverman, Jennifer G. Dy
PDF
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks Christos Louizos, Max Welling
PDF
Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter Zeyuan Allen-Zhu
PDF
Near-Optimal Design of Experiments via Regret Minimization Zeyuan Allen-Zhu, Yuanzhi Li, Aarti Singh, Yining Wang
PDF
Nearly Optimal Robust Matrix Completion Yeshwanth Cherapanamjeri, Kartik Gupta, Prateek Jain
PDF
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders Jesse Engel, Cinjon Resnick, Adam Roberts, Sander Dieleman, Mohammad Norouzi, Douglas Eck, Karen Simonyan
PDF
Neural Episodic Control Alexander Pritzel, Benigno Uria, Sriram Srinivasan, Adrià Puigdomènech Badia, Oriol Vinyals, Demis Hassabis, Daan Wierstra, Charles Blundell
PDF
Neural Message Passing for Quantum Chemistry Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, George E. Dahl
PDF
Neural Networks and Rational Functions Matus Telgarsky
PDF
Neural Optimizer Search with Reinforcement Learning Irwan Bello, Barret Zoph, Vijay Vasudevan, Quoc V. Le
PDF
Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks David Balduzzi, Brian McWilliams, Tony Butler-Yeoman
PDF
No Spurious Local Minima in Nonconvex Low Rank Problems: A Unified Geometric Analysis Rong Ge, Chi Jin, Yi Zheng
PDF
Nonnegative Matrix Factorization for Time Series Recovery from a Few Temporal Aggregates Jiali Mei, Yohann De Castro, Yannig Goude, Georges Hébrail
PDF
Nonparanormal Information Estimation Shashank Singh, Barnabás Póczos
PDF
Nyström Method with Kernel K-Means++ Samples as Landmarks Dino Oglic, Thomas Gärtner
PDF
On Approximation Guarantees for Greedy Low Rank Optimization Rajiv Khanna, Ethan R. Elenberg, Alexandros G. Dimakis, Joydeep Ghosh, Sahand Negahban
PDF
On Calibration of Modern Neural Networks Chuan Guo, Geoff Pleiss, Yu Sun, Kilian Q. Weinberger
PDF
On Context-Dependent Clustering of Bandits Claudio Gentile, Shuai Li, Purushottam Kar, Alexandros Karatzoglou, Giovanni Zappella, Evans Etrue
PDF
On Kernelized Multi-Armed Bandits Sayak Ray Chowdhury, Aditya Gopalan
PDF
On Mixed Memberships and Symmetric Nonnegative Matrix Factorizations Xueyu Mao, Purnamrita Sarkar, Deepayan Chakrabarti
PDF
On Orthogonality and Learning Recurrent Networks with Long Term Dependencies Eugene Vorontsov, Chiheb Trabelsi, Samuel Kadoury, Chris Pal
PDF
On Relaxing Determinism in Arithmetic Circuits Arthur Choi, Adnan Darwiche
PDF
On the Expressive Power of Deep Neural Networks Maithra Raghu, Ben Poole, Jon Kleinberg, Surya Ganguli, Jascha Sohl-Dickstein
PDF
On the Iteration Complexity of Support Recovery via Hard Thresholding Pursuit Jie Shen, Ping Li
PDF
On the Projection Operator to a Three-View Cardinality Constrained Set Haichuan Yang, Shupeng Gui, Chuyang Ke, Daniel Stefankovic, Ryohei Fujimaki, Ji Liu
PDF
On the Sampling Problem for Kernel Quadrature François-Xavier Briol, Chris J. Oates, Jon Cockayne, Wilson Ye Chen, Mark Girolami
PDF
Online and Linear-Time Attention by Enforcing Monotonic Alignments Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck
PDF
Online Learning to Rank in Stochastic Click Models Masrour Zoghi, Tomas Tunys, Mohammad Ghavamzadeh, Branislav Kveton, Csaba Szepesvari, Zheng Wen
PDF
Online Learning with Local Permutations and Delayed Feedback Ohad Shamir, Liran Szlak
PDF
Online Partial Least Square Optimization: Dropping Convexity for Better Efficiency and Scalability Zhehui Chen, Lin F. Yang, Chris Junchi Li, Tuo Zhao
PDF
Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks Kevin Scaman, Francis Bach, Sébastien Bubeck, Yin Tat Lee, Laurent Massoulié
PDF
Optimal and Adaptive Off-Policy Evaluation in Contextual Bandits Yu-Xiang Wang, Alekh Agarwal, Miroslav Dudı́k
PDF
Optimal Densification for Fast and Accurate Minwise Hashing Anshumali Shrivastava
PDF
OptNet: Differentiable Optimization as a Layer in Neural Networks Brandon Amos, J. Zico Kolter
PDF
Oracle Complexity of Second-Order Methods for Finite-Sum Problems Yossi Arjevani, Ohad Shamir
PDF
Ordinal Graphical Models: A Tale of Two Approaches Arun Sai Suggala, Eunho Yang, Pradeep Ravikumar
PDF
Orthogonalized ALS: A Theoretically Principled Tensor Decomposition Algorithm for Practical Use Vatsal Sharan, Gregory Valiant
PDF
Pain-Free Random Differential Privacy with Sensitivity Sampling Benjamin I. P. Rubinstein, Francesco Aldà
PDF
Parallel and Distributed Thompson Sampling for Large-Scale Accelerated Exploration of Chemical Space José Miguel Hernández-Lobato, James Requeima, Edward O. Pyzer-Knapp, Alán Aspuru-Guzik
PDF
Parallel Multiscale Autoregressive Density Estimation Scott Reed, Aäron Oord, Nal Kalchbrenner, Sergio Gómez Colmenarejo, Ziyu Wang, Yutian Chen, Dan Belov, Nando Freitas
PDF
Parseval Networks: Improving Robustness to Adversarial Examples Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, Nicolas Usunier
PDF
Partitioned Tensor Factorizations for Learning Mixed Membership Models Zilong Tan, Sayan Mukherjee
PDF
PixelCNN Models with Auxiliary Variables for Natural Image Modeling Alexander Kolesnikov, Christoph H. Lampert
PDF
Post-Inference Prior Swapping Willie Neiswanger, Eric Xing
PDF
Practical Gauss-Newton Optimisation for Deep Learning Aleksandar Botev, Hippolyt Ritter, David Barber
PDF
Prediction and Control with Temporal Segment Models Nikhil Mishra, Pieter Abbeel, Igor Mordatch
PDF
Prediction Under Uncertainty in Sparse Spectrum Gaussian Processes with Applications to Filtering and Control Yunpeng Pan, Xinyan Yan, Evangelos A. Theodorou, Byron Boots
PDF
Preferential Bayesian Optimization Javier González, Zhenwen Dai, Andreas Damianou, Neil D. Lawrence
PDF
Priv’IT: Private and Sample Efficient Identity Testing Bryan Cai, Constantinos Daskalakis, Gautam Kamath
PDF
Probabilistic Path Hamiltonian Monte Carlo Vu Dinh, Arman Bilge, Cheng Zhang, Frederick A. Matsen IV
PDF
Probabilistic Submodular Maximization in Sub-Linear Time Serban Stan, Morteza Zadimoghaddam, Andreas Krause, Amin Karbasi
PDF
Programming with a Differentiable Forth Interpreter Matko Bošnjak, Tim Rocktäschel, Jason Naradowsky, Sebastian Riedel
PDF
Projection-Free Distributed Online Learning in Networks Wenpeng Zhang, Peilin Zhao, Wenwu Zhu, Steven C. H. Hoi, Tong Zhang
PDF
ProtoNN: Compressed and Accurate kNN for Resource-Scarce Devices Chirag Gupta, Arun Sai Suggala, Ankit Goyal, Harsha Vardhan Simhadri, Bhargavi Paranjape, Ashish Kumar, Saurabh Goyal, Raghavendra Udupa, Manik Varma, Prateek Jain
PDF
Provable Alternating Gradient Descent for Non-Negative Matrix Factorization with Strong Correlations Yuanzhi Li, Yingyu Liang
PDF
Provably Optimal Algorithms for Generalized Linear Contextual Bandits Lihong Li, Yu Lu, Dengyong Zhou
PDF
Prox-PDA: The Proximal Primal-Dual Algorithm for Fast Distributed Nonconvex Optimization and Learning over Networks Mingyi Hong, Davood Hajinezhad, Ming-Min Zhao
PDF
Random Feature Expansions for Deep Gaussian Processes Kurt Cutajar, Edwin V. Bonilla, Pietro Michiardi, Maurizio Filippone
PDF
Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees Haim Avron, Michael Kapralov, Cameron Musco, Christopher Musco, Ameya Velingker, Amir Zandieh
PDF
Re-Revisiting Learning on Hypergraphs: Confidence Interval and Subgradient Method Chenzi Zhang, Shuguang Hu, Zhihao Gavin Tang, T-H. Hubert Chan
PDF
Real-Time Adaptive Image Compression Oren Rippel, Lubomir Bourdev
PDF
Recovery Guarantees for One-Hidden-Layer Neural Networks Kai Zhong, Zhao Song, Prateek Jain, Peter L. Bartlett, Inderjit S. Dhillon
PDF
Recurrent Highway Networks Julian Georg Zilly, Rupesh Kumar Srivastava, Jan Koutnı́k, Jürgen Schmidhuber
PDF
Recursive Partitioning for Personalization Using Observational Data Nathan Kallus
PDF
Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning Noam Brown, Tuomas Sandholm
PDF
Regret Minimization in Behaviorally-Constrained Zero-Sum Games Gabriele Farina, Christian Kroer, Tuomas Sandholm
PDF
Regularising Non-Linear Models Using Feature Side-Information Amina Mollaysa, Pablo Strasser, Alexandros Kalousis
PDF
Reinforcement Learning with Deep Energy-Based Policies Tuomas Haarnoja, Haoran Tang, Pieter Abbeel, Sergey Levine
PDF
Relative Fisher Information and Natural Gradient for Learning Large Modular Models Ke Sun, Frank Nielsen
PDF
Resource-Efficient Machine Learning in 2 KB RAM for the Internet of Things Ashish Kumar, Saurabh Goyal, Manik Varma
PDF
Risk Bounds for Transferring Representations with and Without Fine-Tuning Daniel McNamara, Maria-Florina Balcan
PDF
Robust Adversarial Reinforcement Learning Lerrel Pinto, James Davidson, Rahul Sukthankar, Abhinav Gupta
PDF
Robust Budget Allocation via Continuous Submodular Functions Matthew Staib, Stefanie Jegelka
PDF
Robust Gaussian Graphical Model Estimation with Arbitrary Corruption Lingxiao Wang, Quanquan Gu
PDF
Robust Guarantees of Stochastic Greedy Algorithms Avinatan Hassidim, Yaron Singer
PDF
Robust Probabilistic Modeling with Bayesian Data Reweighting Yixin Wang, Alp Kucukelbir, David M. Blei
PDF
Robust Structured Estimation with Single-Index Models Sheng Chen, Arindam Banerjee
PDF
Robust Submodular Maximization: A Non-Uniform Partitioning Approach Ilija Bogunovic, Slobodan Mitrović, Jonathan Scarlett, Volkan Cevher
PDF
RobustFill: Neural Program Learning Under Noisy I/O Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli
PDF
Rule-Enhanced Penalized Regression by Column Generation Using Rectangular Maximum Agreement Jonathan Eckstein, Noam Goldberg, Ai Kagawa
PDF
Safety-Aware Algorithms for Adversarial Contextual Bandit Wen Sun, Debadeepta Dey, Ashish Kapoor
PDF
SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient Lam M. Nguyen, Jie Liu, Katya Scheinberg, Martin Takáč
PDF
Scalable Bayesian Rule Lists Hongyu Yang, Cynthia Rudin, Margo Seltzer
PDF
Scalable Generative Models for Multi-Label Learning with Missing Labels Vikas Jain, Nirbhay Modhe, Piyush Rai
PDF
Scalable Multi-Class Gaussian Process Classification Using Expectation Propagation Carlos Villacampa-Calvo, Daniel Hernández-Lobato
PDF
Scaling up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction Weizhong Zhang, Bin Hong, Wei Liu, Jieping Ye, Deng Cai, Xiaofei He, Jie Wang
PDF
Schema Networks: Zero-Shot Transfer with a Generative Causal Model of Intuitive Physics Ken Kansky, Tom Silver, David A. Mély, Mohamed Eldawy, Miguel Lázaro-Gredilla, Xinghua Lou, Nimrod Dorfman, Szymon Sidor, Scott Phoenix, Dileep George
PDF
Second-Order Kernel Online Convex Optimization with Adaptive Sketching Daniele Calandriello, Alessandro Lazaric, Michal Valko
PDF
Selective Inference for Sparse High-Order Interaction Models Shinya Suzumura, Kazuya Nakagawa, Yuta Umezu, Koji Tsuda, Ichiro Takeuchi
PDF
Self-Paced Co-Training Fan Ma, Deyu Meng, Qi Xie, Zina Li, Xuanyi Dong
PDF
Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data Tomoya Sakai, Marthinus Christoffel Plessis, Gang Niu, Masashi Sugiyama
PDF
Sequence Modeling via Segmentations Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng
PDF
Sequence to Better Sequence: Continuous Revision of Combinatorial Structures Jonas Mueller, David Gifford, Tommi Jaakkola
PDF
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-Control Natasha Jaques, Shixiang Gu, Dzmitry Bahdanau, José Miguel Hernández-Lobato, Richard E. Turner, Douglas Eck
PDF
Sharp Minima Can Generalize for Deep Nets Laurent Dinh, Razvan Pascanu, Samy Bengio, Yoshua Bengio
PDF
Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation Yacine Jernite, Anna Choromanska, David Sontag
PDF
Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging Shusen Wang, Alex Gittens, Michael W. Mahoney
PDF
Sliced Wasserstein Kernel for Persistence Diagrams Mathieu Carrière, Marco Cuturi, Steve Oudot
PDF
Soft-DTW: A Differentiable Loss Function for Time-Series Marco Cuturi, Mathieu Blondel
PDF
Source-Target Similarity Modelings for Multi-Source Transfer Gaussian Process Regression Pengfei Wei, Ramon Sagarna, Yiping Ke, Yew-Soon Ong, Chi-Keong Goh
PDF
Sparse + Group-Sparse Dirty Models: Statistical Guarantees Without Unreasonable Conditions and a Case for Non-Convexity Eunho Yang, Aurélie C. Lozano
PDF
Spectral Learning from a Single Trajectory Under Finite-State Policies Borja Balle, Odalric-Ambrym Maillard
PDF
Spherical Structured Feature Maps for Kernel Approximation Yueming Lyu
PDF
SPLICE: Fully Tractable Hierarchical Extension of ICA with Pooling Jun-ichiro Hirayama, Aapo Hyvärinen, Motoaki Kawanabe
PDF
SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization Juyong Kim, Yookoon Park, Gunhee Kim, Sung Ju Hwang
PDF
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning Jakob Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson
PDF
State-Frequency Memory Recurrent Neural Networks Hao Hu, Guo-Jun Qi
PDF
Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening Mohsen Ahmadi Fahandar, Eyke Hüllermeier, Inés Couso
PDF
StingyCD: Safely Avoiding Wasteful Updates in Coordinate Descent Tyler B. Johnson, Carlos Guestrin
PDF
Stochastic Adaptive Quasi-Newton Methods for Minimizing Expected Values Chaoxu Zhou, Wenbo Gao, Donald Goldfarb
PDF
Stochastic Bouncy Particle Sampler Ari Pakman, Dar Gilboa, David Carlson, Liam Paninski
PDF
Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Yi Xu, Qihang Lin, Tianbao Yang
PDF
Stochastic DCA for the Large-Sum of Non-Convex Functions Problem and Its Application to Group Variable Selection in Classification Hoai An Le Thi, Hoai Minh Le, Duy Nhat Phan, Bach Tran
PDF
Stochastic Generative Hashing Bo Dai, Ruiqi Guo, Sanjiv Kumar, Niao He, Le Song
PDF
Stochastic Gradient MCMC Methods for Hidden Markov Models Yi-An Ma, Nicholas J. Foti, Emily B. Fox
PDF
Stochastic Gradient Monomial Gamma Sampler Yizhe Zhang, Changyou Chen, Zhe Gan, Ricardo Henao, Lawrence Carin
PDF
Stochastic Modified Equations and Adaptive Stochastic Gradient Algorithms Qianxiao Li, Cheng Tai, Weinan E
PDF
Stochastic Variance Reduction Methods for Policy Evaluation Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou
PDF
Strong NP-Hardness for Sparse Optimization with Concave Penalty Functions Yichen Chen, Dongdong Ge, Mengdi Wang, Zizhuo Wang, Yinyu Ye, Hao Yin
PDF
Strongly-Typed Agents Are Guaranteed to Interact Safely David Balduzzi
PDF
Sub-Sampled Cubic Regularization for Non-Convex Optimization Jonas Moritz Kohler, Aurelien Lucchi
PDF
Tensor Balancing on Statistical Manifold Mahito Sugiyama, Hiroyuki Nakahara, Koji Tsuda
PDF
Tensor Belief Propagation Andrew Wrigley, Wee Sun Lee, Nan Ye
PDF
Tensor Decomposition via Simultaneous Power Iteration Po-An Wang, Chi-Jen Lu
PDF
Tensor Decomposition with Smoothness Masaaki Imaizumi, Kohei Hayashi
PDF
Tensor-Train Recurrent Neural Networks for Video Classification Yinchong Yang, Denis Krompass, Volker Tresp
PDF
The Loss Surface of Deep and Wide Neural Networks Quynh Nguyen, Matthias Hein
PDF
The Predictron: End-to-End Learning and Planning David Silver, Hado Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
PDF
The Price of Differential Privacy for Online Learning Naman Agarwal, Karan Singh
PDF
The Sample Complexity of Online One-Class Collaborative Filtering Reinhard Heckel, Kannan Ramchandran
PDF
The Shattered Gradients Problem: If Resnets Are the Answer, Then What Is the Question? David Balduzzi, Marcus Frean, Lennox Leary, J. P. Lewis, Kurt Wan-Duo Ma, Brian McWilliams
PDF
The Statistical Recurrent Unit Junier B. Oliva, Barnabás Póczos, Jeff Schneider
PDF
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank Liang Zhao, Siyu Liao, Yanzhi Wang, Zhe Li, Jian Tang, Bo Yuan
PDF
Tight Bounds for Approximate Carathéodory and Beyond Vahab Mirrokni, Renato Paes Leme, Adrian Vladu, Sam Chiu-wai Wong
PDF
Toward Controlled Generation of Text Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, Eric P. Xing
PDF
Toward Efficient and Accurate Covariance Matrix Estimation on Compressed Data Xixian Chen, Michael R. Lyu, Irwin King
PDF
Towards K-Means-Friendly Spaces: Simultaneous Deep Learning and Clustering Bo Yang, Xiao Fu, Nicholas D. Sidiropoulos, Mingyi Hong
PDF
Tunable Efficient Unitary Neural Networks (EUNN) and Their Application to RNNs Li Jing, Yichen Shen, Tena Dubcek, John Peurifoy, Scott Skirlo, Yann LeCun, Max Tegmark, Marin Soljačić
PDF
Uncertainty Assessment and False Discovery Rate Control in High-Dimensional Granger Causal Inference Aditya Chaudhry, Pan Xu, Quanquan Gu
PDF
Uncorrelation and Evenness: A New Diversity-Promoting Regularizer Pengtao Xie, Aarti Singh, Eric P. Xing
PDF
Uncovering Causality from Multivariate Hawkes Integrated Cumulants Massil Achab, Emmanuel Bacry, Stéphane Gaı̈ffas, Iacopo Mastromatteo, Jean-François Muzy
PDF
Understanding Black-Box Predictions via Influence Functions Pang Wei Koh, Percy Liang
PDF
Understanding Synthetic Gradients and Decoupled Neural Interfaces Wojciech Marian Czarnecki, Grzegorz Świrszcz, Max Jaderberg, Simon Osindero, Oriol Vinyals, Koray Kavukcuoglu
PDF
Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition Tasha Nagamine, Nima Mesgarani
PDF
Uniform Convergence Rates for Kernel Density Estimation Heinrich Jiang
PDF
Uniform Deviation Bounds for K-Means Clustering Olivier Bachem, Mario Lucic, S. Hamed Hassani, Andreas Krause
PDF
Unifying Task Specification in Reinforcement Learning Martha White
PDF
Unimodal Probability Distributions for Deep Ordinal Classification Christopher Beckham, Christopher Pal
PDF
Unsupervised Learning by Predicting Noise Piotr Bojanowski, Armand Joulin
PDF
Variants of RMSProp and AdaGrad with Logarithmic Regret Bounds Mahesh Chandra Mukkamala, Matthias Hein
PDF
Variational Boosting: Iteratively Refining Posterior Approximations Andrew C. Miller, Nicholas J. Foti, Ryan P. Adams
PDF
Variational Dropout Sparsifies Deep Neural Networks Dmitry Molchanov, Arsenii Ashukha, Dmitry Vetrov
PDF
Variational Inference for Sparse and Undirected Models John Ingraham, Debora Marks
PDF
Variational Policy for Guiding Point Processes Yichen Wang, Grady Williams, Evangelos Theodorou, Le Song
PDF
Video Pixel Networks Nal Kalchbrenner, Aäron Oord, Karen Simonyan, Ivo Danihelka, Oriol Vinyals, Alex Graves, Koray Kavukcuoglu
PDF
Warped Convolutions: Efficient Invariance to Spatial Transformations João F. Henriques, Andrea Vedaldi
PDF
Wasserstein Generative Adversarial Networks Martin Arjovsky, Soumith Chintala, Léon Bottou
PDF
When Can Multi-Site Datasets Be Pooled for Regression? Hypothesis Tests, $\ell_2$-Consistency and Neuroscience Applications Hao Henry Zhou, Yilin Zhang, Vamsi K. Ithapu, Sterling C. Johnson, Grace Wahba, Vikas Singh
PDF
Why Is Posterior Sampling Better than Optimism for Reinforcement Learning? Ian Osband, Benjamin Van Roy
PDF
World of Bits: An Open-Domain Platform for Web-Based Agents Tianlin Shi, Andrej Karpathy, Linxi Fan, Jonathan Hernandez, Percy Liang
PDF
Zero-Inflated Exponential Family Embeddings Li-Ping Liu, David M. Blei
PDF
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning Junhyuk Oh, Satinder Singh, Honglak Lee, Pushmeet Kohli
PDF
ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning Hantian Zhang, Jerry Li, Kaan Kara, Dan Alistarh, Ji Liu, Ce Zhang
PDF
Zonotope Hit-and-Run for Efficient Sampling from Projection DPPs Guillaume Gautier, Rémi Bardenet, Michal Valko
PDF