ICML 2017

434 papers

“Convex Until Proven Guilty”: Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions Yair Carmon, John C. Duchi, Oliver Hinder, Aaron Sidford

PDF

A Birth-Death Process for Feature Allocation Konstantina Palla, David Knowles, Zoubin Ghahramani

PDF

A Closer Look at Memorization in Deep Networks Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, Simon Lacoste-Julien

PDF

A Distributional Perspective on Reinforcement Learning Marc G. Bellemare, Will Dabney, Rémi Munos

PDF

A Divergence Bound for Hybrids of MCMC and Variational Inference and an Application to Langevin Dynamics and SGVI Justin Domke

PDF

A Laplacian Framework for Option Discovery in Reinforcement Learning Marlos C. Machado, Marc G. Bellemare, Michael Bowling

PDF

A Richer Theory of Convex Constrained Optimization with Reduced Projections and Improved Rates Tianbao Yang, Qihang Lin, Lijun Zhang

PDF

A Semismooth Newton Method for Fast, Generic Convex Programming Alnur Ali, Eric Wong, J. Zico Kolter

PDF

A Simple Multi-Class Boosting Framework with Theoretical Guarantees and Empirical Proficiency Ron Appel, Pietro Perona

PDF

A Simulated Annealing Based Inexact Oracle for Wasserstein Loss Minimization Jianbo Ye, James Z. Wang, Jia Li

PDF

A Unified Maximum Likelihood Approach for Estimating Symmetric Properties of Discrete Distributions Jayadev Acharya, Hirakendu Das, Alon Orlitsky, Ananda Theertha Suresh

PDF

A Unified Variance Reduction-Based Framework for Nonconvex Low-Rank Matrix Recovery Lingxiao Wang, Xiao Zhang, Quanquan Gu

PDF

A Unified View of Multi-Label Performance Measures Xi-Zhu Wu, Zhi-Hua Zhou

PDF

Accelerating Eulerian Fluid Simulation with Convolutional Networks Jonathan Tompson, Kristofer Schlachter, Pablo Sprechmann, Ken Perlin

PDF

Active Heteroscedastic Regression Kamalika Chaudhuri, Prateek Jain, Nagarajan Natarajan

PDF

Active Learning for Accurate Estimation of Linear Models Carlos Riquelme, Mohammad Ghavamzadeh, Alessandro Lazaric

PDF

Active Learning for Cost-Sensitive Classification Akshay Krishnamurthy, Alekh Agarwal, Tzu-Kuo Huang, Hal Daumé, John Langford

PDF

Active Learning for Top-$k$ Rank Aggregation from Noisy Comparisons Soheil Mohajer, Changho Suh, Adel Elmahdy

PDF

AdaNet: Adaptive Structural Learning of Artificial Neural Networks Corinna Cortes, Xavier Gonzalvo, Vitaly Kuznetsov, Mehryar Mohri, Scott Yang

PDF

Adapting Kernel Representations Online Using Submodular Maximization Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White

PDF

Adaptive Consensus ADMM for Distributed Optimization Zheng Xu, Gavin Taylor, Hao Li, Mário A. T. Figueiredo, Xiaoming Yuan, Tom Goldstein

PDF

Adaptive Feature Selection: Computationally Efficient Online Sparse Linear Regression Under RIP Satyen Kale, Zohar Karnin, Tengyuan Liang, Dávid Pál

PDF

Adaptive Multiple-Arm Identification Jiecao Chen, Xi Chen, Qin Zhang, Yuan Zhou

PDF

Adaptive Neural Networks for Efficient Inference Tolga Bolukbasi, Joseph Wang, Ofer Dekel, Venkatesh Saligrama

PDF

Adaptive Sampling Probabilities for Non-Smooth Optimization Hongseok Namkoong, Aman Sinha, Steve Yadlowsky, John C. Duchi

PDF

Adversarial Feature Matching for Text Generation Yizhe Zhang, Zhe Gan, Kai Fan, Zhi Chen, Ricardo Henao, Dinghan Shen, Lawrence Carin

PDF

Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks Lars Mescheder, Sebastian Nowozin, Andreas Geiger

PDF

Algebraic Variety Models for High-Rank Matrix Completion Greg Ongie, Rebecca Willett, Robert D. Nowak, Laura Balzano

PDF

Algorithmic Stability and Hypothesis Complexity Tongliang Liu, Gábor Lugosi, Gergely Neu, Dacheng Tao

PDF

Algorithms for $\ell_p$ Low-Rank Approximation Flavio Chierichetti, Sreenivas Gollapudi, Ravi Kumar, Silvio Lattanzi, Rina Panigrahy, David P. Woodruff

PDF

An Adaptive Test of Independence with Analytic Kernel Embeddings Wittawat Jitkrittum, Zoltán Szabó, Arthur Gretton

PDF

An Alternative SoftMax Operator for Reinforcement Learning Kavosh Asadi, Michael L. Littman

PDF

An Analytical Formula of Population Gradient for Two-Layered ReLU Network and Its Applications in Convergence and Critical Point Analysis Yuandong Tian

PDF

An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation David Anderson, Ming Gu

PDF

An Infinite Hidden Markov Model with Similarity-Biased Transitions Colin Reimer Dawson, Chaofan Huang, Clayton T. Morrison

PDF

Analogical Inference for Multi-Relational Embeddings Hanxiao Liu, Yuexin Wu, Yiming Yang

PDF

Analysis and Optimization of Graph Decompositions by Lifted Multicuts Andrea Horňáková, Jan-Hendrik Lange, Bjoern Andres

PDF

Analytical Guarantees on Numerical Precision of Deep Neural Networks Charbel Sakr, Yongjune Kim, Naresh Shanbhag

PDF

Approximate Newton Methods and Their Local Convergence Haishan Ye, Luo Luo, Zhihua Zhang

PDF

Approximate Steepest Coordinate Descent Sebastian U. Stich, Anant Raj, Martin Jaggi

PDF

Asymmetric Tri-Training for Unsupervised Domain Adaptation Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada

PDF

Asynchronous Distributed Variational Gaussian Process for Regression Hao Peng, Shandian Zhe, Xiao Zhang, Yuan Qi

PDF

Asynchronous Stochastic Gradient Descent with Delay Compensation Shuxin Zheng, Qi Meng, Taifeng Wang, Wei Chen, Nenghai Yu, Zhi-Ming Ma, Tie-Yan Liu

PDF

Attentive Recurrent Comparators Pranav Shyam, Shubham Gupta, Ambedkar Dukkipati

PDF

Automated Curriculum Learning for Neural Networks Alex Graves, Marc G. Bellemare, Jacob Menick, Rémi Munos, Koray Kavukcuoglu

PDF

Automatic Discovery of the Statistical Types of Variables in a Dataset Isabel Valera, Zoubin Ghahramani

PDF

Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning Oron Anschel, Nir Baram, Nahum Shimkin

PDF

Axiomatic Attribution for Deep Networks Mukund Sundararajan, Ankur Taly, Qiqi Yan

PDF

Batched High-Dimensional Bayesian Optimization via Structural Kernel Learning Zi Wang, Chengtao Li, Stefanie Jegelka, Pushmeet Kohli

PDF

Bayesian Boolean Matrix Factorisation Tammo Rukat, Chris C. Holmes, Michalis K. Titsias, Christopher Yau

PDF

Bayesian Inference on Random Simple Graphs with Power Law Degree Distributions Juho Lee, Creighton Heaukulani, Zoubin Ghahramani, Lancelot F. James, Seungjin Choi

PDF

Bayesian Models of Data Streams with Hierarchical Power Priors Andrés Masegosa, Thomas D. Nielsen, Helge Langseth, Darı́o Ramos-López, Antonio Salmerón, Anders L. Madsen

PDF

Bayesian Optimization with Tree-Structured Dependencies Rodolphe Jenatton, Cedric Archambeau, Javier González, Matthias Seeger

PDF

Being Robust (in High Dimensions) Can Be Practical Ilias Diakonikolas, Gautam Kamath, Daniel M. Kane, Jerry Li, Ankur Moitra, Alistair Stewart

PDF

Beyond Filters: Compact Feature mAP for Portable Deep Model Yunhe Wang, Chang Xu, Chao Xu, Dacheng Tao

PDF

Bidirectional Learning for Time-Series Models with Hidden Units Takayuki Osogami, Hiroshi Kajino, Taro Sekiyama

PDF

Boosted Fitted Q-Iteration Samuele Tosatto, Matteo Pirotta, Carlo D’Eramo, Marcello Restelli

PDF

Bottleneck Conditional Density Estimation Rui Shu, Hung H. Bui, Mohammad Ghavamzadeh

PDF

Breaking Locality Accelerates Block Gauss-Seidel Stephen Tu, Shivaram Venkataraman, Ashia C. Wilson, Alex Gittens, Michael I. Jordan, Benjamin Recht

PDF

Canopy Fast Sampling with Cover Trees Manzil Zaheer, Satwik Kottur, Amr Ahmed, José Moura, Alex Smola

PDF

Capacity Releasing Diffusion for Speed and Locality Di Wang, Kimon Fountoulakis, Monika Henzinger, Michael W. Mahoney, Satish Rao

PDF

ChoiceRank: Identifying Preferences from Node Traffic in Networks Lucas Maystre, Matthias Grossglauser

PDF

Clustering by Sum of Norms: Stochastic Incremental Algorithm, Convergence and Cluster Recovery Ashkan Panahi, Devdatt Dubhashi, Fredrik D. Johansson, Chiranjib Bhattacharyya

PDF

Clustering High Dimensional Dynamic Data Streams Vladimir Braverman, Gereon Frahling, Harry Lang, Christian Sohler, Lin F. Yang

PDF

Co-Clustering Through Optimal Transport Charlotte Laclau, Ievgen Redko, Basarab Matei, Younès Bennani, Vincent Brault

PDF

Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study Samuel Ritter, David G. T. Barrett, Adam Santoro, Matt M. Botvinick

PDF

Coherence Pursuit: Fast, Simple, and Robust Subspace Recovery Mostafa Rahmani, George Atia

PDF

Coherent Probabilistic Forecasts for Hierarchical Time Series Souhaib Ben Taieb, James W. Taylor, Rob J. Hyndman

PDF

Collect at Once, Use Effectively: Making Non-Interactive Locally Private Learning Possible Kai Zheng, Wenlong Mou, Liwei Wang

PDF

Combined Group and Exclusive Sparsity for Deep Neural Networks Jaehong Yoon, Sung Ju Hwang

PDF

Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning Yevgen Chebotar, Karol Hausman, Marvin Zhang, Gaurav Sukhatme, Stefan Schaal, Sergey Levine

PDF

Communication-Efficient Algorithms for Distributed Stochastic Principal Component Analysis Dan Garber, Ohad Shamir, Nathan Srebro

PDF

Composing Tree Graphical Models with Persistent Homology Features for Clustering Mixed-Type Data Xiuyan Ni, Novi Quadrianto, Yusu Wang, Chao Chen

PDF

Compressed Sensing Using Generative Models Ashish Bora, Ajil Jalal, Eric Price, Alexandros G. Dimakis

PDF

Conditional Accelerated Lazy Stochastic Gradient Descent Guanghui Lan, Sebastian Pokutta, Yi Zhou, Daniel Zink

PDF

Conditional Image Synthesis with Auxiliary Classifier GANs Augustus Odena, Christopher Olah, Jonathon Shlens

PDF

Confident Multiple Choice Learning Kimin Lee, Changho Hwang, KyoungSoo Park, Jinwoo Shin

PDF

Connected Subgraph Detection with Mirror Descent on SDPs Cem Aksoylar, Lorenzo Orecchia, Venkatesh Saligrama

PDF

Consistency Analysis for Binary Classification Revisited Krzysztof Dembczyński, Wojciech Kotłowski, Oluwasanmi Koyejo, Nagarajan Natarajan

PDF

Consistent K-Clustering Silvio Lattanzi, Sergei Vassilvitskii

PDF

Consistent On-Line Off-Policy Evaluation Assaf Hallak, Shie Mannor

PDF

Constrained Policy Optimization Joshua Achiam, David Held, Aviv Tamar, Pieter Abbeel

PDF

Contextual Decision Processes with Low Bellman Rank Are PAC-Learnable Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire

PDF

Continual Learning Through Synaptic Intelligence Friedemann Zenke, Ben Poole, Surya Ganguli

PDF

Convergence Analysis of Proximal Gradient with Momentum for Nonconvex Optimization Qunwei Li, Yi Zhou, Yingbin Liang, Pramod K. Varshney

PDF

Convex Phase Retrieval Without Lifting via PhaseMax Tom Goldstein, Christoph Studer

PDF

Convexified Convolutional Neural Networks Yuchen Zhang, Percy Liang, Martin J. Wainwright

PDF

Convolutional Sequence to Sequence Learning Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin

PDF

Coordinated Multi-Agent Imitation Learning Hoang M. Le, Yisong Yue, Peter Carr, Patrick Lucey

PDF

Coresets for Vector Summarization with Applications to Network Graphs Dan Feldman, Sedat Ozer, Daniela Rus

PDF

Cost-Optimal Learning of Causal Graphs Murat Kocaoglu, Alex Dimakis, Sriram Vishwanath

PDF

Count-Based Exploration with Neural Density Models Georg Ostrovski, Marc G. Bellemare, Aäron Oord, Rémi Munos

PDF

Counterfactual Data-Fusion for Online Reinforcement Learners Andrew Forney, Judea Pearl, Elias Bareinboim

PDF

Coupling Distributed and Symbolic Execution for Natural Language Queries Lili Mou, Zhengdong Lu, Hang Li, Zhi Jin

PDF

Curiosity-Driven Exploration by Self-Supervised Prediction Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell

PDF

Dance Dance Convolution Chris Donahue, Zachary C. Lipton, Julian McAuley

PDF

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning Irina Higgins, Arka Pal, Andrei Rusu, Loic Matthey, Christopher Burgess, Alexander Pritzel, Matthew Botvinick, Charles Blundell, Alexander Lerchner

PDF

Data-Efficient Policy Evaluation Through Behavior Policy Search Josiah P. Hanna, Philip S. Thomas, Peter Stone, Scott Niekum

PDF

Deciding How to Decide: Dynamic Routing in Artificial Neural Networks Mason McGill, Pietro Perona

PDF

Decoupled Neural Interfaces Using Synthetic Gradients Max Jaderberg, Wojciech Marian Czarnecki, Simon Osindero, Oriol Vinyals, Alex Graves, David Silver, Koray Kavukcuoglu

PDF

Deep Bayesian Active Learning with Image Data Yarin Gal, Riashat Islam, Zoubin Ghahramani

PDF

Deep Decentralized Multi-Task Multi-Agent Reinforcement Learning Under Partial Observability Shayegan Omidshafiei, Jason Pazis, Christopher Amato, Jonathan P. How, John Vian

PDF

Deep Generative Models for Relational Data with Side Information Changwei Hu, Piyush Rai, Lawrence Carin

PDF

Deep IV: A Flexible Approach for Counterfactual Prediction Jason Hartford, Greg Lewis, Kevin Leyton-Brown, Matt Taddy

PDF

Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC Yulai Cong, Bo Chen, Hongwei Liu, Mingyuan Zhou

PDF

Deep Spectral Clustering Learning Marc T. Law, Raquel Urtasun, Richard S. Zemel

PDF

Deep Tensor Convolution on Multicores David Budden, Alexander Matveev, Shibani Santurkar, Shraman Ray Chaudhuri, Nir Shavit

PDF

Deep Transfer Learning with Joint Adaptation Networks Mingsheng Long, Han Zhu, Jianmin Wang, Michael I. Jordan

PDF

Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs Michael Gygli, Mohammad Norouzi, Anelia Angelova

PDF

Deep Voice: Real-Time Neural Text-to-Speech Sercan Ö. Arık, Mike Chrzanowski, Adam Coates, Gregory Diamos, Andrew Gibiansky, Yongguo Kang, Xian Li, John Miller, Andrew Ng, Jonathan Raiman, Shubho Sengupta, Mohammad Shoeybi

PDF

DeepBach: A Steerable Model for Bach Chorales Generation Gaëtan Hadjeres, François Pachet, Frank Nielsen

PDF

Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction Wen Sun, Arun Venkatraman, Geoffrey J. Gordon, Byron Boots, J. Andrew Bagnell

PDF

Deletion-Robust Submodular Maximization: Data Summarization with “the Right to Be Forgotten” Baharan Mirzasoleiman, Amin Karbasi, Andreas Krause

PDF

Delta Networks for Optimized Recurrent Network Computation Daniel Neil, Jun Haeng Lee, Tobi Delbruck, Shih-Chii Liu

PDF

Density Level Set Estimation on Manifolds with DBSCAN Heinrich Jiang

PDF

Depth-Width Tradeoffs in Approximating Natural Functions with Neural Networks Itay Safran, Ohad Shamir

PDF

Deriving Neural Architectures from Sequence and Graph Kernels Tao Lei, Wengong Jin, Regina Barzilay, Tommi Jaakkola

PDF

Developing Bug-Free Machine Learning Systems with Formal Mathematics Daniel Selsam, Percy Liang, David L. Dill

PDF

Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, Jeff Dean

PDF

Diameter-Based Active Learning Christopher Tosh, Sanjoy Dasgupta

PDF

Dictionary Learning Based on Sparse Distribution Tomography Pedram Pad, Farnood Salehi, Elisa Celis, Patrick Thiran, Michael Unser

PDF

Differentiable Programs with Neural Libraries Alexander L. Gaunt, Marc Brockschmidt, Nate Kushman, Daniel Tarlow

PDF

Differentially Private Chi-Squared Test by Unit Circle Mechanism Kazuya Kakizaki, Kazuto Fukuchi, Jun Sakuma

PDF

Differentially Private Clustering in High-Dimensional Euclidean Spaces Maria-Florina Balcan, Travis Dick, Yingyu Liang, Wenlong Mou, Hongyang Zhang

PDF

Differentially Private Learning of Undirected Graphical Models Using Collective Graphical Models Garrett Bernstein, Ryan McKenna, Tao Sun, Daniel Sheldon, Michael Hay, Gerome Miklau

PDF

Differentially Private Ordinary Least Squares Or Sheffet

PDF

Differentially Private Submodular Maximization: Data Summarization in Disguise Marko Mitrovic, Mark Bun, Andreas Krause, Amin Karbasi

PDF

Discovering Discrete Latent Topics with Neural Variational Inference Yishu Miao, Edward Grefenstette, Phil Blunsom

PDF

Dissipativity Theory for Nesterov’s Accelerated Method Bin Hu, Laurent Lessard

PDF

Distributed and Provably Good Seedings for K-Means in Constant Rounds Olivier Bachem, Mario Lucic, Andreas Krause

PDF

Distributed Batch Gaussian Process Optimization Erik A. Daxberger, Bryan Kian Hsiang Low

PDF

Distributed Mean Estimation with Limited Communication Ananda Theertha Suresh, Felix X. Yu, Sanjiv Kumar, H. Brendan McMahan

PDF

Doubly Accelerated Methods for Faster CCA and Generalized Eigendecomposition Zeyuan Allen-Zhu, Yuanzhi Li

PDF

Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization Qi Lei, Ian En-Hsu Yen, Chao-yuan Wu, Inderjit S. Dhillon, Pradeep Ravikumar

PDF

Dropout Inference in Bayesian Neural Networks with Alpha-Divergences Yingzhen Li, Yarin Gal

PDF

Dual Iterative Hard Thresholding: From Non-Convex Sparse Minimization to Non-Smooth Concave Maximization Bo Liu, Xiao-Tong Yuan, Lezi Wang, Qingshan Liu, Dimitris N. Metaxas

PDF

Dual Supervised Learning Yingce Xia, Tao Qin, Wei Chen, Jiang Bian, Nenghai Yu, Tie-Yan Liu

PDF

Dueling Bandits with Weak Regret Bangrui Chen, Peter I. Frazier

PDF

Dynamic Word Embeddings Robert Bamler, Stephan Mandt

PDF

Efficient Distributed Learning with Sparsity Jialei Wang, Mladen Kolar, Nathan Srebro, Tong Zhang

PDF

Efficient Nonmyopic Active Search Shali Jiang, Gustavo Malkomes, Geoff Converse, Alyssa Shofner, Benjamin Moseley, Roman Garnett

PDF

Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret Alina Beygelzimer, Francesco Orabona, Chicheng Zhang

PDF

Efficient Orthogonal Parametrisation of Recurrent Neural Networks Using Householder Reflections Zakaria Mhammedi, Andrew Hellicar, Ashfaqur Rahman, James Bailey

PDF

Efficient Regret Minimization in Non-Convex Games Elad Hazan, Karan Singh, Cyril Zhang

PDF

Efficient SoftMax Approximation for GPUs Grave, Armand Joulin, Moustapha Cissé, David Grangier, Hervé Jégou

PDF

Emulating the Expert: Inverse Optimization Through Online Learning Andreas Bärmann, Sebastian Pokutta, Oskar Schneider

PDF

End-to-End Differentiable Adversarial Imitation Learning Nir Baram, Oron Anschel, Itai Caspi, Shie Mannor

PDF

End-to-End Learning for Structured Prediction Energy Networks David Belanger, Bishan Yang, Andrew McCallum

PDF

Enumerating Distinct Decision Trees Salvatore Ruggieri

PDF

Equivariance Through Parameter-Sharing Siamak Ravanbakhsh, Jeff Schneider, Barnabás Póczos

PDF

Estimating Individual Treatment Effect: Generalization Bounds and Algorithms Uri Shalit, Fredrik D. Johansson, David Sontag

PDF

Estimating the Unseen from Multiple Populations Aditi Raghunathan, Gregory Valiant, James Zou

PDF

Evaluating Bayesian Models with Posterior Dispersion Indices Alp Kucukelbir, Yixin Wang, David M. Blei

PDF

Evaluating the Variance of Likelihood-Ratio Gradient Estimators Seiya Tokui, Issei Sato

PDF

Exact Inference for Integer Latent-Variable Models Kevin Winner, Debora Sujono, Dan Sheldon

PDF

Exact MAP Inference by Avoiding Fractional Vertices Erik M. Lindgren, Alexandros G. Dimakis, Adam Klivans

PDF

Exploiting Strong Convexity from Data with Primal-Dual First-Order Algorithms Jialei Wang, Lin Xiao

PDF

Failures of Gradient-Based Deep Learning Shai Shalev-Shwartz, Ohad Shamir, Shaked Shammah

PDF

Fairness in Reinforcement Learning Shahin Jabbari, Matthew Joseph, Michael Kearns, Jamie Morgenstern, Aaron Roth

PDF

Fake News Mitigation via Point Process Based Intervention Mehrdad Farajtabar, Jiachen Yang, Xiaojing Ye, Huan Xu, Rakshit Trivedi, Elias Khalil, Shuang Li, Le Song, Hongyuan Zha

PDF

Fast Bayesian Intensity Estimation for the Permanental Process Christian J. Walder, Adrian N. Bishop

PDF

Fast K-Nearest Neighbour Search via Prioritized DCI Ke Li, Jitendra Malik

PDF

Faster Greedy MAP Inference for Determinantal Point Processes Insu Han, Prabhanjan Kambadur, Kyoungsoo Park, Jinwoo Shin

PDF

Faster Principal Component Regression and Stable Matrix Chebyshev Approximation Zeyuan Allen-Zhu, Yuanzhi Li

PDF

FeUdal Networks for Hierarchical Reinforcement Learning Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu

PDF

Follow the Compressed Leader: Faster Online Learning of Eigenvectors and Faster MMWU Zeyuan Allen-Zhu, Yuanzhi Li

PDF

Follow the Moving Leader in Deep Learning Shuai Zheng, James T. Kwok

PDF

Forest-Type Regression with General Losses and Robust Forest Alexander Hanbo Li, Andrew Martin

PDF

Forward and Reverse Gradient-Based Hyperparameter Optimization Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil

PDF

Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for Markov Chain Monte Carlo Umut Şimşekli

PDF

Frame-Based Data Factorizations Sebastian Mair, Ahcène Boubekki, Ulf Brefeld

PDF

From Patches to Images: A Nonparametric Generative Model Geng Ji, Michael C. Hughes, Erik B. Sudderth

PDF

Generalization and Equilibrium in Generative Adversarial Nets (GANs) Sanjeev Arora, Rong Ge, Yingyu Liang, Tengyu Ma, Yi Zhang

PDF

Geometry of Neural Network Loss Surfaces via Random Matrix Theory Jeffrey Pennington, Yasaman Bahri

PDF

Global Optimization of Lipschitz Functions Cédric Malherbe, Nicolas Vayatis

PDF

Globally Induced Forest: A Prepruning Compression Scheme Jean-Michel Begon, Arnaud Joly, Pierre Geurts

PDF

Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs Alon Brutzkus, Amir Globerson

PDF

Gradient Boosted Decision Trees for High Dimensional Sparse Output Si Si, Huan Zhang, S. Sathiya Keerthi, Dhruv Mahajan, Inderjit S. Dhillon, Cho-Jui Hsieh

PDF

Gradient Coding: Avoiding Stragglers in Distributed Learning Rashish Tandon, Qi Lei, Alexandros G. Dimakis, Nikos Karampatziakis

PDF

Gradient Projection Iterative Sketch for Large-Scale Constrained Least-Squares Junqi Tang, Mohammad Golbabaee, Mike E. Davies

PDF

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling Hairong Liu, Zhenyao Zhu, Xiangang Li, Sanjeev Satheesh

PDF

Grammar Variational Autoencoder Matt J. Kusner, Brooks Paige, José Miguel Hernández-Lobato

PDF

Graph-Based Isometry Invariant Representation Learning Renata Khasanova, Pascal Frossard

PDF

GSOS: Gauss-Seidel Operator Splitting Algorithm for Multi-Term Nonsmooth Convex Composite Optimization Li Shen, Wei Liu, Ganzhao Yuan, Shiqian Ma

PDF

Guarantees for Greedy Maximization of Non-Submodular Functions with Applications Andrew An Bian, Joachim M. Buhmann, Andreas Krause, Sebastian Tschiatschek

PDF

Hierarchy Through Composition with Multitask LMDPs Andrew M. Saxe, Adam C. Earle, Benjamin Rosman

PDF

High Dimensional Bayesian Optimization with Elastic Gaussian Process Santu Rana, Cheng Li, Sunil Gupta, Vu Nguyen, Svetha Venkatesh

PDF

High-Dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation Zhuoran Yang, Krishnakumar Balasubramanian, Han Liu

PDF

High-Dimensional Structured Quantile Regression Vidyashankar Sivakumar, Arindam Banerjee

PDF

High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm Rongda Zhu, Lingxiao Wang, Chengxiang Zhai, Quanquan Gu

PDF

How Close Are the Eigenvectors of the Sample and Actual Covariance Matrices? Andreas Loukas

PDF

How to Escape Saddle Points Efficiently Chi Jin, Rong Ge, Praneeth Netrapalli, Sham M. Kakade, Michael I. Jordan

PDF

Hyperplane Clustering via Dual Principal Component Pursuit Manolis C. Tsakiris, René Vidal

PDF

Identification and Model Testing in Linear Structural Equation Models Using Auxiliary Variables Bryant Chen, Daniel Kumor, Elias Bareinboim

PDF

Identify the Nash Equilibrium in Static Games with Random Payoffs Yichi Zhou, Jialian Li, Jun Zhu

PDF

Identifying Best Interventions Through Online Importance Sampling Rajat Sen, Karthikeyan Shanmugam, Alexandros G. Dimakis, Sanjay Shakkottai

PDF

Image-to-Markup Generation with Coarse-to-Fine Attention Yuntian Deng, Anssi Kanervisto, Jeffrey Ling, Alexander M. Rush

PDF

Improved Variational Autoencoders for Text Modeling Using Dilated Convolutions Zichao Yang, Zhiting Hu, Ruslan Salakhutdinov, Taylor Berg-Kirkpatrick

PDF

Improving Gibbs Sampler Scan Quality with DoGS Ioannis Mitliagkas, Lester Mackey

PDF

Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning Using the Beta Distribution Po-Wei Chou, Daniel Maturana, Sebastian Scherer

PDF

Improving Viterbi Is Hard: Better Runtimes Imply Faster Clique Algorithms Arturs Backurs, Christos Tzamos

PDF

Innovation Pursuit: A New Approach to the Subspace Clustering Problem Mostafa Rahmani, George Atia

PDF

Input Convex Neural Networks Brandon Amos, Lei Xu, J. Zico Kolter

PDF

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability Jakob N. Foerster, Justin Gilmer, Jascha Sohl-Dickstein, Jan Chorowski, David Sussillo

PDF

Interactive Learning from Policy-Dependent Human Feedback James MacGlashan, Mark K. Ho, Robert Loftin, Bei Peng, Guan Wang, David L. Roberts, Matthew E. Taylor, Michael L. Littman

PDF

iSurvive: An Interpretable, Event-Time Prediction Model for mHealth Walter H. Dempsey, Alexander Moreno, Christy K. Scott, Michael L. Dennis, David H. Gustafson, Susan A. Murphy, James M. Rehg

PDF

Iterative Machine Teaching Weiyang Liu, Bo Dai, Ahmad Humayun, Charlene Tay, Chen Yu, Linda B. Smith, James M. Rehg, Le Song

PDF

Joint Dimensionality Reduction and Metric Learning: A Geometric Take Mehrtash Harandi, Mathieu Salzmann, Richard Hartley

PDF

Just Sort It! a Simple and Effective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser

PDF

Kernelized Support Tensor Machines Lifang He, Chun-Ta Lu, Guixiang Ma, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin

PDF

Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs Rakshit Trivedi, Hanjun Dai, Yichen Wang, Le Song

PDF

Language Modeling with Gated Convolutional Networks Yann N. Dauphin, Angela Fan, Michael Auli, David Grangier

PDF

Large-Scale Evolution of Image Classifiers Esteban Real, Sherry Moore, Andrew Selle, Saurabh Saxena, Yutaka Leon Suematsu, Jie Tan, Quoc V. Le, Alexey Kurakin

PDF

Latent Feature Lasso Ian En-Hsu Yen, Wei-Cheng Lee, Sung-En Chang, Arun Sai Suggala, Shou-De Lin, Pradeep Ravikumar

PDF

Latent Intention Dialogue Models Tsung-Hsien Wen, Yishu Miao, Phil Blunsom, Steve Young

PDF

Latent LSTM Allocation: Joint Clustering and Non-Linear Dynamic Modeling of Sequence Data Manzil Zaheer, Amr Ahmed, Alexander J. Smola

PDF

Lazifying Conditional Gradient Algorithms Gábor Braun, Sebastian Pokutta, Daniel Zink

PDF

Learned Optimizers That Scale and Generalize Olga Wichrowska, Niru Maheswaranathan, Matthew W. Hoffman, Sergio Gómez Colmenarejo, Misha Denil, Nando Freitas, Jascha Sohl-Dickstein

PDF

Learning Algorithms for Active Learning Philip Bachman, Alessandro Sordoni, Adam Trischler

PDF

Learning Continuous Semantic Representations of Symbolic Expressions Miltiadis Allamanis, Pankajan Chanthirasegaran, Pushmeet Kohli, Charles Sutton

PDF

Learning Deep Architectures via Generalized Whitened Neural Networks Ping Luo

PDF

Learning Deep Latent Gaussian Models with Markov Chain Monte Carlo Matthew D. Hoffman

PDF

Learning Determinantal Point Processes with Moments and Cycles John Urschel, Victor-Emmanuel Brunel, Ankur Moitra, Philippe Rigollet

PDF

Learning Discrete Representations via Information Maximizing Self-Augmented Training Weihua Hu, Takeru Miyato, Seiya Tokui, Eiichi Matsumoto, Masashi Sugiyama

PDF

Learning from Clinical Judgments: Semi-Markov-Modulated Marked Hawkes Processes for Risk Prognosis Ahmed M. Alaa, Scott Hu, Mihaela Schaar

PDF

Learning Gradient Descent: Better Generalization and Longer Horizons Kaifeng Lv, Shunhua Jiang, Jian Li

PDF

Learning Hawkes Processes from Short Doubly-Censored Event Sequences Hongteng Xu, Dixin Luo, Hongyuan Zha

PDF

Learning Hierarchical Features from Deep Generative Models Shengjia Zhao, Jiaming Song, Stefano Ermon

PDF

Learning Important Features Through Propagating Activation Differences Avanti Shrikumar, Peyton Greenside, Anshul Kundaje

PDF

Learning in POMDPs with Monte Carlo Tree Search Sammie Katt, Frans A. Oliehoek, Christopher Amato

PDF

Learning Infinite Layer Networks Without the Kernel Trick Roi Livni, Daniel Carmon, Amir Globerson

PDF

Learning Latent Space Models with Angular Constraints Pengtao Xie, Yuntian Deng, Yi Zhou, Abhimanu Kumar, Yaoliang Yu, James Zou, Eric P. Xing

PDF

Learning Sleep Stages from Radio Signals: A Conditional Adversarial Architecture Mingmin Zhao, Shichao Yue, Dina Katabi, Tommi S. Jaakkola, Matt T. Bianchi

PDF

Learning Stable Stochastic Nonlinear Dynamical Systems Jonas Umlauft, Sandra Hirche

PDF

Learning Texture Manifolds with the Periodic Spatial GAN Urs Bergmann, Nikolay Jetchev, Roland Vollgraf

PDF

Learning the Structure of Generative Models Without Labeled Data Stephen H. Bach, Bryan He, Alexander Ratner, Christopher Ré

PDF

Learning to Aggregate Ordinal Labels by Maximizing Separating Width Guangyong Chen, Shengyu Zhang, Di Lin, Hui Huang, Pheng Ann Heng

PDF

Learning to Align the Source Code to the Compiled Object Code Dor Levy, Lior Wolf

PDF

Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier Joseph Futoma, Sanjay Hariharan, Katherine Heller

PDF

Learning to Discover Cross-Domain Relations with Generative Adversarial Networks Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, Jiwon Kim

PDF

Learning to Discover Sparse Graphical Models Eugene Belilovsky, Kyle Kastner, Gael Varoquaux, Matthew B. Blaschko

PDF

Learning to Generate Long-Term Future via Hierarchical Prediction Ruben Villegas, Jimei Yang, Yuliang Zou, Sungryull Sohn, Xunyu Lin, Honglak Lee

PDF

Learning to Learn Without Gradient Descent by Gradient Descent Yutian Chen, Matthew W. Hoffman, Sergio Gómez Colmenarejo, Misha Denil, Timothy P. Lillicrap, Matt Botvinick, Nando Freitas

PDF

Leveraging Node Attributes for Incomplete Relational Data He Zhao, Lan Du, Wray Buntine

PDF

Leveraging Union of Subspace Structure to Improve Constrained Clustering John Lipor, Laura Balzano

PDF

Local Bayesian Optimization of Motor Skills Riad Akrour, Dmitry Sorokin, Jan Peters, Gerhard Neumann

PDF

Local-to-Global Bayesian Network Structure Learning Tian Gao, Kshitij Fadnis, Murray Campbell

PDF

Logarithmic Time One-Against-Some Hal Daumé, Nikos Karampatziakis, John Langford, Paul Mineiro

PDF

Lost Relatives of the Gumbel Trick Matej Balog, Nilesh Tripuraneni, Zoubin Ghahramani, Adrian Weller

PDF

Magnetic Hamiltonian Monte Carlo Nilesh Tripuraneni, Mark Rowland, Zoubin Ghahramani, Richard Turner

PDF

Max-Value Entropy Search for Efficient Bayesian Optimization Zi Wang, Stefanie Jegelka

PDF

Maximum Selection and Ranking Under Noisy Comparisons Moein Falahatgar, Alon Orlitsky, Venkatadheeraj Pichapati, Ananda Theertha Suresh

PDF

McGan: Mean and Covariance Feature Matching GAN Youssef Mroueh, Tom Sercu, Vaibhava Goel

PDF

Measuring Sample Quality with Kernels Jackson Gorham, Lester Mackey

PDF

MEC: Memory-Efficient Convolution for Deep Neural Network Minsik Cho, Daniel Brand

PDF

meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting Xu Sun, Xuancheng Ren, Shuming Ma, Houfeng Wang

PDF

Meritocratic Fairness for Cross-Population Selection Michael Kearns, Aaron Roth, Zhiwei Steven Wu

PDF

Meta Networks Tsendsuren Munkhdalai, Hong Yu

PDF

Minimax Regret Bounds for Reinforcement Learning Mohammad Gheshlaghi Azar, Ian Osband, Rémi Munos

PDF

Minimizing Trust Leaks for Robust Sybil Detection János Höner, Shinichi Nakajima, Alexander Bauer, Klaus-Robert Müller, Nico Görnitz

PDF

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn, Pieter Abbeel, Sergey Levine

PDF

Model-Independent Online Learning for Influence Maximization Sharan Vaswani, Branislav Kveton, Zheng Wen, Mohammad Ghavamzadeh, Laks V. S. Lakshmanan, Mark Schmidt

PDF

Modular Multitask Reinforcement Learning with Policy Sketches Jacob Andreas, Dan Klein, Sergey Levine

PDF

Multi-Class Optimal Margin Distribution Machine Teng Zhang, Zhi-Hua Zhou

PDF

Multi-Fidelity Bayesian Optimisation with Continuous Approximations Kirthevasan Kandasamy, Gautam Dasarathy, Jeff Schneider, Barnabás Póczos

PDF

Multi-Objective Bandits: Optimizing the Generalized Gini Index Róbert Busa-Fekete, Balázs Szörényi, Paul Weng, Shie Mannor

PDF

Multi-Task Learning with Labeled and Unlabeled Tasks Anastasia Pentina, Christoph H. Lampert

PDF

Multichannel End-to-End Speech Recognition Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey

PDF

Multilabel Classification with Group Testing and Codes Shashanka Ubaru, Arya Mazumdar

PDF

Multilevel Clustering via Wasserstein Means Nhat Ho, XuanLong Nguyen, Mikhail Yurochkin, Hung Hai Bui, Viet Huynh, Dinh Phung

PDF

Multiple Clustering Views from Multiple Uncertain Experts Yale Chang, Junxiang Chen, Michael H. Cho, Peter J. Castaldi, Edwin K. Silverman, Jennifer G. Dy

PDF

Multiplicative Normalizing Flows for Variational Bayesian Neural Networks Christos Louizos, Max Welling

PDF

Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter Zeyuan Allen-Zhu

PDF

Near-Optimal Design of Experiments via Regret Minimization Zeyuan Allen-Zhu, Yuanzhi Li, Aarti Singh, Yining Wang

PDF

Nearly Optimal Robust Matrix Completion Yeshwanth Cherapanamjeri, Kartik Gupta, Prateek Jain

PDF

Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders Jesse Engel, Cinjon Resnick, Adam Roberts, Sander Dieleman, Mohammad Norouzi, Douglas Eck, Karen Simonyan

PDF

Neural Episodic Control Alexander Pritzel, Benigno Uria, Sriram Srinivasan, Adrià Puigdomènech Badia, Oriol Vinyals, Demis Hassabis, Daan Wierstra, Charles Blundell

PDF

Neural Message Passing for Quantum Chemistry Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, George E. Dahl

PDF

Neural Networks and Rational Functions Matus Telgarsky

PDF

Neural Optimizer Search with Reinforcement Learning Irwan Bello, Barret Zoph, Vijay Vasudevan, Quoc V. Le

PDF

Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks David Balduzzi, Brian McWilliams, Tony Butler-Yeoman

PDF

No Spurious Local Minima in Nonconvex Low Rank Problems: A Unified Geometric Analysis Rong Ge, Chi Jin, Yi Zheng

PDF

Nonnegative Matrix Factorization for Time Series Recovery from a Few Temporal Aggregates Jiali Mei, Yohann De Castro, Yannig Goude, Georges Hébrail

PDF

Nonparanormal Information Estimation Shashank Singh, Barnabás Póczos

PDF

Nyström Method with Kernel K-Means++ Samples as Landmarks Dino Oglic, Thomas Gärtner

PDF

On Approximation Guarantees for Greedy Low Rank Optimization Rajiv Khanna, Ethan R. Elenberg, Alexandros G. Dimakis, Joydeep Ghosh, Sahand Negahban

PDF

On Calibration of Modern Neural Networks Chuan Guo, Geoff Pleiss, Yu Sun, Kilian Q. Weinberger

PDF

On Context-Dependent Clustering of Bandits Claudio Gentile, Shuai Li, Purushottam Kar, Alexandros Karatzoglou, Giovanni Zappella, Evans Etrue

PDF

On Kernelized Multi-Armed Bandits Sayak Ray Chowdhury, Aditya Gopalan

PDF

On Mixed Memberships and Symmetric Nonnegative Matrix Factorizations Xueyu Mao, Purnamrita Sarkar, Deepayan Chakrabarti

PDF

On Orthogonality and Learning Recurrent Networks with Long Term Dependencies Eugene Vorontsov, Chiheb Trabelsi, Samuel Kadoury, Chris Pal

PDF

On Relaxing Determinism in Arithmetic Circuits Arthur Choi, Adnan Darwiche

PDF

On the Expressive Power of Deep Neural Networks Maithra Raghu, Ben Poole, Jon Kleinberg, Surya Ganguli, Jascha Sohl-Dickstein

PDF

On the Iteration Complexity of Support Recovery via Hard Thresholding Pursuit Jie Shen, Ping Li

PDF

On the Projection Operator to a Three-View Cardinality Constrained Set Haichuan Yang, Shupeng Gui, Chuyang Ke, Daniel Stefankovic, Ryohei Fujimaki, Ji Liu

PDF

On the Sampling Problem for Kernel Quadrature François-Xavier Briol, Chris J. Oates, Jon Cockayne, Wilson Ye Chen, Mark Girolami

PDF

Online and Linear-Time Attention by Enforcing Monotonic Alignments Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck

PDF

Online Learning to Rank in Stochastic Click Models Masrour Zoghi, Tomas Tunys, Mohammad Ghavamzadeh, Branislav Kveton, Csaba Szepesvari, Zheng Wen

PDF

Online Learning with Local Permutations and Delayed Feedback Ohad Shamir, Liran Szlak

PDF

Online Partial Least Square Optimization: Dropping Convexity for Better Efficiency and Scalability Zhehui Chen, Lin F. Yang, Chris Junchi Li, Tuo Zhao

PDF

Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks Kevin Scaman, Francis Bach, Sébastien Bubeck, Yin Tat Lee, Laurent Massoulié

PDF

Optimal and Adaptive Off-Policy Evaluation in Contextual Bandits Yu-Xiang Wang, Alekh Agarwal, Miroslav Dudı́k

PDF

Optimal Densification for Fast and Accurate Minwise Hashing Anshumali Shrivastava

PDF

OptNet: Differentiable Optimization as a Layer in Neural Networks Brandon Amos, J. Zico Kolter

PDF

Oracle Complexity of Second-Order Methods for Finite-Sum Problems Yossi Arjevani, Ohad Shamir

PDF

Ordinal Graphical Models: A Tale of Two Approaches Arun Sai Suggala, Eunho Yang, Pradeep Ravikumar

PDF

Orthogonalized ALS: A Theoretically Principled Tensor Decomposition Algorithm for Practical Use Vatsal Sharan, Gregory Valiant

PDF

Pain-Free Random Differential Privacy with Sensitivity Sampling Benjamin I. P. Rubinstein, Francesco Aldà

PDF

Parallel and Distributed Thompson Sampling for Large-Scale Accelerated Exploration of Chemical Space José Miguel Hernández-Lobato, James Requeima, Edward O. Pyzer-Knapp, Alán Aspuru-Guzik

PDF

Parallel Multiscale Autoregressive Density Estimation Scott Reed, Aäron Oord, Nal Kalchbrenner, Sergio Gómez Colmenarejo, Ziyu Wang, Yutian Chen, Dan Belov, Nando Freitas

PDF

Parseval Networks: Improving Robustness to Adversarial Examples Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, Nicolas Usunier

PDF

Partitioned Tensor Factorizations for Learning Mixed Membership Models Zilong Tan, Sayan Mukherjee

PDF

PixelCNN Models with Auxiliary Variables for Natural Image Modeling Alexander Kolesnikov, Christoph H. Lampert

PDF

Post-Inference Prior Swapping Willie Neiswanger, Eric Xing

PDF

Practical Gauss-Newton Optimisation for Deep Learning Aleksandar Botev, Hippolyt Ritter, David Barber

PDF

Prediction and Control with Temporal Segment Models Nikhil Mishra, Pieter Abbeel, Igor Mordatch

PDF

Prediction Under Uncertainty in Sparse Spectrum Gaussian Processes with Applications to Filtering and Control Yunpeng Pan, Xinyan Yan, Evangelos A. Theodorou, Byron Boots

PDF

Preferential Bayesian Optimization Javier González, Zhenwen Dai, Andreas Damianou, Neil D. Lawrence

PDF

Priv’IT: Private and Sample Efficient Identity Testing Bryan Cai, Constantinos Daskalakis, Gautam Kamath

PDF

Probabilistic Path Hamiltonian Monte Carlo Vu Dinh, Arman Bilge, Cheng Zhang, Frederick A. Matsen IV

PDF

Probabilistic Submodular Maximization in Sub-Linear Time Serban Stan, Morteza Zadimoghaddam, Andreas Krause, Amin Karbasi

PDF

Programming with a Differentiable Forth Interpreter Matko Bošnjak, Tim Rocktäschel, Jason Naradowsky, Sebastian Riedel

PDF

Projection-Free Distributed Online Learning in Networks Wenpeng Zhang, Peilin Zhao, Wenwu Zhu, Steven C. H. Hoi, Tong Zhang

PDF

ProtoNN: Compressed and Accurate kNN for Resource-Scarce Devices Chirag Gupta, Arun Sai Suggala, Ankit Goyal, Harsha Vardhan Simhadri, Bhargavi Paranjape, Ashish Kumar, Saurabh Goyal, Raghavendra Udupa, Manik Varma, Prateek Jain

PDF

Provable Alternating Gradient Descent for Non-Negative Matrix Factorization with Strong Correlations Yuanzhi Li, Yingyu Liang

PDF

Provably Optimal Algorithms for Generalized Linear Contextual Bandits Lihong Li, Yu Lu, Dengyong Zhou

PDF

Prox-PDA: The Proximal Primal-Dual Algorithm for Fast Distributed Nonconvex Optimization and Learning over Networks Mingyi Hong, Davood Hajinezhad, Ming-Min Zhao

PDF

Random Feature Expansions for Deep Gaussian Processes Kurt Cutajar, Edwin V. Bonilla, Pietro Michiardi, Maurizio Filippone

PDF

Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees Haim Avron, Michael Kapralov, Cameron Musco, Christopher Musco, Ameya Velingker, Amir Zandieh

PDF

Re-Revisiting Learning on Hypergraphs: Confidence Interval and Subgradient Method Chenzi Zhang, Shuguang Hu, Zhihao Gavin Tang, T-H. Hubert Chan

PDF

Real-Time Adaptive Image Compression Oren Rippel, Lubomir Bourdev

PDF

Recovery Guarantees for One-Hidden-Layer Neural Networks Kai Zhong, Zhao Song, Prateek Jain, Peter L. Bartlett, Inderjit S. Dhillon

PDF

Recurrent Highway Networks Julian Georg Zilly, Rupesh Kumar Srivastava, Jan Koutnı́k, Jürgen Schmidhuber

PDF

Recursive Partitioning for Personalization Using Observational Data Nathan Kallus

PDF

Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning Noam Brown, Tuomas Sandholm

PDF

Regret Minimization in Behaviorally-Constrained Zero-Sum Games Gabriele Farina, Christian Kroer, Tuomas Sandholm

PDF

Regularising Non-Linear Models Using Feature Side-Information Amina Mollaysa, Pablo Strasser, Alexandros Kalousis

PDF

Reinforcement Learning with Deep Energy-Based Policies Tuomas Haarnoja, Haoran Tang, Pieter Abbeel, Sergey Levine

PDF

Relative Fisher Information and Natural Gradient for Learning Large Modular Models Ke Sun, Frank Nielsen

PDF

Resource-Efficient Machine Learning in 2 KB RAM for the Internet of Things Ashish Kumar, Saurabh Goyal, Manik Varma

PDF

Risk Bounds for Transferring Representations with and Without Fine-Tuning Daniel McNamara, Maria-Florina Balcan

PDF

Robust Adversarial Reinforcement Learning Lerrel Pinto, James Davidson, Rahul Sukthankar, Abhinav Gupta

PDF

Robust Budget Allocation via Continuous Submodular Functions Matthew Staib, Stefanie Jegelka

PDF

Robust Gaussian Graphical Model Estimation with Arbitrary Corruption Lingxiao Wang, Quanquan Gu

PDF

Robust Guarantees of Stochastic Greedy Algorithms Avinatan Hassidim, Yaron Singer

PDF

Robust Probabilistic Modeling with Bayesian Data Reweighting Yixin Wang, Alp Kucukelbir, David M. Blei

PDF

Robust Structured Estimation with Single-Index Models Sheng Chen, Arindam Banerjee

PDF

Robust Submodular Maximization: A Non-Uniform Partitioning Approach Ilija Bogunovic, Slobodan Mitrović, Jonathan Scarlett, Volkan Cevher

PDF

RobustFill: Neural Program Learning Under Noisy I/O Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli

PDF

Rule-Enhanced Penalized Regression by Column Generation Using Rectangular Maximum Agreement Jonathan Eckstein, Noam Goldberg, Ai Kagawa

PDF

Safety-Aware Algorithms for Adversarial Contextual Bandit Wen Sun, Debadeepta Dey, Ashish Kapoor

PDF

SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient Lam M. Nguyen, Jie Liu, Katya Scheinberg, Martin Takáč

PDF

Scalable Bayesian Rule Lists Hongyu Yang, Cynthia Rudin, Margo Seltzer

PDF

Scalable Generative Models for Multi-Label Learning with Missing Labels Vikas Jain, Nirbhay Modhe, Piyush Rai

PDF

Scalable Multi-Class Gaussian Process Classification Using Expectation Propagation Carlos Villacampa-Calvo, Daniel Hernández-Lobato

PDF

Scaling up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction Weizhong Zhang, Bin Hong, Wei Liu, Jieping Ye, Deng Cai, Xiaofei He, Jie Wang

PDF

Schema Networks: Zero-Shot Transfer with a Generative Causal Model of Intuitive Physics Ken Kansky, Tom Silver, David A. Mély, Mohamed Eldawy, Miguel Lázaro-Gredilla, Xinghua Lou, Nimrod Dorfman, Szymon Sidor, Scott Phoenix, Dileep George

PDF

Second-Order Kernel Online Convex Optimization with Adaptive Sketching Daniele Calandriello, Alessandro Lazaric, Michal Valko

PDF

Selective Inference for Sparse High-Order Interaction Models Shinya Suzumura, Kazuya Nakagawa, Yuta Umezu, Koji Tsuda, Ichiro Takeuchi

PDF

Self-Paced Co-Training Fan Ma, Deyu Meng, Qi Xie, Zina Li, Xuanyi Dong

PDF

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data Tomoya Sakai, Marthinus Christoffel Plessis, Gang Niu, Masashi Sugiyama

PDF

Sequence Modeling via Segmentations Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng

PDF

Sequence to Better Sequence: Continuous Revision of Combinatorial Structures Jonas Mueller, David Gifford, Tommi Jaakkola

PDF

Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-Control Natasha Jaques, Shixiang Gu, Dzmitry Bahdanau, José Miguel Hernández-Lobato, Richard E. Turner, Douglas Eck

PDF

Sharp Minima Can Generalize for Deep Nets Laurent Dinh, Razvan Pascanu, Samy Bengio, Yoshua Bengio

PDF

Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation Yacine Jernite, Anna Choromanska, David Sontag

PDF

Sketched Ridge Regression: Optimization Perspective, Statistical Perspective, and Model Averaging Shusen Wang, Alex Gittens, Michael W. Mahoney

PDF

Sliced Wasserstein Kernel for Persistence Diagrams Mathieu Carrière, Marco Cuturi, Steve Oudot

PDF

Soft-DTW: A Differentiable Loss Function for Time-Series Marco Cuturi, Mathieu Blondel

PDF

Source-Target Similarity Modelings for Multi-Source Transfer Gaussian Process Regression Pengfei Wei, Ramon Sagarna, Yiping Ke, Yew-Soon Ong, Chi-Keong Goh

PDF

Sparse + Group-Sparse Dirty Models: Statistical Guarantees Without Unreasonable Conditions and a Case for Non-Convexity Eunho Yang, Aurélie C. Lozano

PDF

Spectral Learning from a Single Trajectory Under Finite-State Policies Borja Balle, Odalric-Ambrym Maillard

PDF

Spherical Structured Feature Maps for Kernel Approximation Yueming Lyu

PDF

SPLICE: Fully Tractable Hierarchical Extension of ICA with Pooling Jun-ichiro Hirayama, Aapo Hyvärinen, Motoaki Kawanabe

PDF

SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization Juyong Kim, Yookoon Park, Gunhee Kim, Sung Ju Hwang

PDF

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning Jakob Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson

PDF

State-Frequency Memory Recurrent Neural Networks Hao Hu, Guo-Jun Qi

PDF

Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening Mohsen Ahmadi Fahandar, Eyke Hüllermeier, Inés Couso

PDF

StingyCD: Safely Avoiding Wasteful Updates in Coordinate Descent Tyler B. Johnson, Carlos Guestrin

PDF

Stochastic Adaptive Quasi-Newton Methods for Minimizing Expected Values Chaoxu Zhou, Wenbo Gao, Donald Goldfarb

PDF

Stochastic Bouncy Particle Sampler Ari Pakman, Dar Gilboa, David Carlson, Liam Paninski

PDF

Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Yi Xu, Qihang Lin, Tianbao Yang

PDF

Stochastic DCA for the Large-Sum of Non-Convex Functions Problem and Its Application to Group Variable Selection in Classification Hoai An Le Thi, Hoai Minh Le, Duy Nhat Phan, Bach Tran

PDF

Stochastic Generative Hashing Bo Dai, Ruiqi Guo, Sanjiv Kumar, Niao He, Le Song

PDF

Stochastic Gradient MCMC Methods for Hidden Markov Models Yi-An Ma, Nicholas J. Foti, Emily B. Fox

PDF

Stochastic Gradient Monomial Gamma Sampler Yizhe Zhang, Changyou Chen, Zhe Gan, Ricardo Henao, Lawrence Carin

PDF

Stochastic Modified Equations and Adaptive Stochastic Gradient Algorithms Qianxiao Li, Cheng Tai, Weinan E

PDF

Stochastic Variance Reduction Methods for Policy Evaluation Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou

PDF

Strong NP-Hardness for Sparse Optimization with Concave Penalty Functions Yichen Chen, Dongdong Ge, Mengdi Wang, Zizhuo Wang, Yinyu Ye, Hao Yin

PDF

Strongly-Typed Agents Are Guaranteed to Interact Safely David Balduzzi

PDF

Sub-Sampled Cubic Regularization for Non-Convex Optimization Jonas Moritz Kohler, Aurelien Lucchi

PDF

Tensor Balancing on Statistical Manifold Mahito Sugiyama, Hiroyuki Nakahara, Koji Tsuda

PDF

Tensor Belief Propagation Andrew Wrigley, Wee Sun Lee, Nan Ye

PDF

Tensor Decomposition via Simultaneous Power Iteration Po-An Wang, Chi-Jen Lu

PDF

Tensor Decomposition with Smoothness Masaaki Imaizumi, Kohei Hayashi

PDF

Tensor-Train Recurrent Neural Networks for Video Classification Yinchong Yang, Denis Krompass, Volker Tresp

PDF

The Loss Surface of Deep and Wide Neural Networks Quynh Nguyen, Matthias Hein

PDF

The Predictron: End-to-End Learning and Planning David Silver, Hado Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

PDF

The Price of Differential Privacy for Online Learning Naman Agarwal, Karan Singh

PDF

The Sample Complexity of Online One-Class Collaborative Filtering Reinhard Heckel, Kannan Ramchandran

PDF

The Shattered Gradients Problem: If Resnets Are the Answer, Then What Is the Question? David Balduzzi, Marcus Frean, Lennox Leary, J. P. Lewis, Kurt Wan-Duo Ma, Brian McWilliams

PDF

The Statistical Recurrent Unit Junier B. Oliva, Barnabás Póczos, Jeff Schneider

PDF

Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank Liang Zhao, Siyu Liao, Yanzhi Wang, Zhe Li, Jian Tang, Bo Yuan

PDF

Tight Bounds for Approximate Carathéodory and Beyond Vahab Mirrokni, Renato Paes Leme, Adrian Vladu, Sam Chiu-wai Wong

PDF

Toward Controlled Generation of Text Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, Eric P. Xing

PDF

Toward Efficient and Accurate Covariance Matrix Estimation on Compressed Data Xixian Chen, Michael R. Lyu, Irwin King

PDF

Towards K-Means-Friendly Spaces: Simultaneous Deep Learning and Clustering Bo Yang, Xiao Fu, Nicholas D. Sidiropoulos, Mingyi Hong

PDF

Tunable Efficient Unitary Neural Networks (EUNN) and Their Application to RNNs Li Jing, Yichen Shen, Tena Dubcek, John Peurifoy, Scott Skirlo, Yann LeCun, Max Tegmark, Marin Soljačić

PDF

Uncertainty Assessment and False Discovery Rate Control in High-Dimensional Granger Causal Inference Aditya Chaudhry, Pan Xu, Quanquan Gu

PDF

Uncorrelation and Evenness: A New Diversity-Promoting Regularizer Pengtao Xie, Aarti Singh, Eric P. Xing

PDF

Uncovering Causality from Multivariate Hawkes Integrated Cumulants Massil Achab, Emmanuel Bacry, Stéphane Gaı̈ffas, Iacopo Mastromatteo, Jean-François Muzy

PDF

Understanding Black-Box Predictions via Influence Functions Pang Wei Koh, Percy Liang

PDF

Understanding Synthetic Gradients and Decoupled Neural Interfaces Wojciech Marian Czarnecki, Grzegorz Świrszcz, Max Jaderberg, Simon Osindero, Oriol Vinyals, Koray Kavukcuoglu

PDF

Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition Tasha Nagamine, Nima Mesgarani

PDF

Uniform Convergence Rates for Kernel Density Estimation Heinrich Jiang

PDF

Uniform Deviation Bounds for K-Means Clustering Olivier Bachem, Mario Lucic, S. Hamed Hassani, Andreas Krause

PDF

Unifying Task Specification in Reinforcement Learning Martha White

PDF

Unimodal Probability Distributions for Deep Ordinal Classification Christopher Beckham, Christopher Pal

PDF

Unsupervised Learning by Predicting Noise Piotr Bojanowski, Armand Joulin

PDF

Variants of RMSProp and AdaGrad with Logarithmic Regret Bounds Mahesh Chandra Mukkamala, Matthias Hein

PDF

Variational Boosting: Iteratively Refining Posterior Approximations Andrew C. Miller, Nicholas J. Foti, Ryan P. Adams

PDF

Variational Dropout Sparsifies Deep Neural Networks Dmitry Molchanov, Arsenii Ashukha, Dmitry Vetrov

PDF

Variational Inference for Sparse and Undirected Models John Ingraham, Debora Marks

PDF

Variational Policy for Guiding Point Processes Yichen Wang, Grady Williams, Evangelos Theodorou, Le Song

PDF

Video Pixel Networks Nal Kalchbrenner, Aäron Oord, Karen Simonyan, Ivo Danihelka, Oriol Vinyals, Alex Graves, Koray Kavukcuoglu

PDF

Warped Convolutions: Efficient Invariance to Spatial Transformations João F. Henriques, Andrea Vedaldi

PDF

Wasserstein Generative Adversarial Networks Martin Arjovsky, Soumith Chintala, Léon Bottou

PDF

When Can Multi-Site Datasets Be Pooled for Regression? Hypothesis Tests, $\ell_2$-Consistency and Neuroscience Applications Hao Henry Zhou, Yilin Zhang, Vamsi K. Ithapu, Sterling C. Johnson, Grace Wahba, Vikas Singh

PDF

Why Is Posterior Sampling Better than Optimism for Reinforcement Learning? Ian Osband, Benjamin Van Roy

PDF

World of Bits: An Open-Domain Platform for Web-Based Agents Tianlin Shi, Andrej Karpathy, Linxi Fan, Jonathan Hernandez, Percy Liang

PDF

Zero-Inflated Exponential Family Embeddings Li-Ping Liu, David M. Blei

PDF

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning Junhyuk Oh, Satinder Singh, Honglak Lee, Pushmeet Kohli

PDF

ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning Hantian Zhang, Jerry Li, Kaan Kara, Dan Alistarh, Ji Liu, Ce Zhang

PDF

Zonotope Hit-and-Run for Efficient Sampling from Projection DPPs Guillaume Gautier, Rémi Bardenet, Michal Valko

PDF