ICLR 2017

309 papers

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Dan Hendrycks, Kevin Gimpel

A Compare-Aggregate Model for Matching Text Sequences Shuohang Wang, Jing Jiang

A Compositional Object-Based Approach to Learning Physical Dynamics Michael Chang, Tomer D. Ullman, Antonio Torralba, Joshua B. Tenenbaum

PDF

A Differentiable Physics Engine for Deep Learning in Robotics Jonas Degrave, Michiel Hermans, Joni Dambre, Francis Wyffels

PDF

A Learned Representation for Artistic Style Vincent Dumoulin, Jonathon Shlens, Manjunath Kudlur

PDF

A Recurrent Neural Network Without Chaos Thomas Laurent, James von Brecht

PDF

A Simple but Tough-to-Beat Baseline for Sentence Embeddings Sanjeev Arora, Yingyu Liang, Tengyu Ma

PDF

A Smooth Optimisation Perspective on Training Feedforward Neural Networks Hao Shen

PDF

A Structured Self-Attentive Sentence Embedding Zhouhan Lin, Minwei Feng, Cícero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, Yoshua Bengio

PDF

A Theoretical Framework for Robustness of (Deep) Classifiers Against Adversarial Samples Beilun Wang, Ji Gao, Yanjun Qi

PDF

Accelerating Eulerian Fluid Simulation with Convolutional Networks Jonathan Tompson, Kristofer Schlachter, Pablo Sprechmann, Ken Perlin

PDF

Accelerating SGD for Distributed Deep-Learning Using an Approximted Hessian Matrix Sébastien M. R. Arnold, Chunming Wang

PDF

Adaptive Feature Abstraction for Translating Video to Language Yunchen Pu, Martin Renqiang Min, Zhe Gan, Lawrence Carin

PDF

Adversarial Attacks on Neural Network Policies Sandy H. Huang, Nicolas Papernot, Ian J. Goodfellow, Yan Duan, Pieter Abbeel

PDF

Adversarial Discriminative Domain Adaptation (workshop Extended Abstract) Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

PDF

Adversarial Examples for Semantic Image Segmentation Volker Fischer, Mummadi Chaithanya Kumar, Jan Hendrik Metzen, Thomas Brox

PDF

Adversarial Examples in the Physical World Alexey Kurakin, Ian J. Goodfellow, Samy Bengio

PDF

Adversarial Feature Learning Jeff Donahue, Philipp Krähenbühl, Trevor Darrell

PDF

Adversarial Machine Learning at Scale Alexey Kurakin, Ian J. Goodfellow, Samy Bengio

PDF

Adversarial Training Methods for Semi-Supervised Text Classification Takeru Miyato, Andrew M. Dai, Ian J. Goodfellow

PDF

Adversarially Learned Inference Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Alex Lamb, Martín Arjovsky, Olivier Mastropietro, Aaron C. Courville

PDF

Amortised MAP Inference for Image Super-Resolution Casper Kaae Sønderby, Jose Caballero, Lucas Theis, Wenzhe Shi, Ferenc Huszár

PDF

An Actor-Critic Algorithm for Sequence Prediction Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron C. Courville, Yoshua Bengio

PDF

An Information-Theoretic Framework for Fast and Robust Unsupervised Learning via Neural Population Infomax Wentao Huang, Kechen Zhang

PDF

Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization Xun Huang, Serge J. Belongie

PDF

Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from Multiple Sources in the Same Domain Janarthanan Rajendran, Aravind S. Lakshminarayanan, Mitesh M. Khapra, P. Prasanna, Balaraman Ravindran

PDF

Audio Super-Resolution Using Neural Networks Volodymyr Kuleshov, S. Zayd Enam, Stefano Ermon

PDF

Autoencoding Variational Inference for Topic Models Akash Srivastava, Charles Sutton

PDF

Automated Generation of Multilingual Clusters for the Evaluation of Distributed Representations Philip Blair, Yuval Merhav, Joel Barry

PDF

Automatic Rule Extraction from Long Short Term Memory Networks W. James Murdoch, Arthur Szlam

PDF

Batch Policy Gradient Methods for Improving Neural Conversation Models Kirthevasan Kandasamy, Yoram Bachrach, Ryota Tomioka, Daniel Tarlow, David Carter

PDF

Beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework Irina Higgins, Loïc Matthey, Arka Pal, Christopher P. Burgess, Xavier Glorot, Matthew M. Botvinick, Shakir Mohamed, Alexander Lerchner

PDF

Bidirectional Attention Flow for Machine Comprehension Min Joon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi

PDF

Bit-Pragmatic Deep Neural Network Computing Jorge Albericio, Patrick Judd, Alberto Delmas, Sayeh Sharify, Andreas Moshovos

PDF

Calibrating Energy-Based Generative Adversarial Networks Zihang Dai, Amjad Almahairi, Philip Bachman, Eduard H. Hovy, Aaron C. Courville

PDF

Capacity and Trainability in Recurrent Neural Networks Jasmine Collins, Jascha Sohl-Dickstein, David Sussillo

PDF

Categorical Reparameterization with Gumbel-SoftMax Eric Jang, Shixiang Gu, Ben Poole

PDF

Central Moment Discrepancy (CMD) for Domain-Invariant Representation Learning Werner Zellinger, Thomas Grubinger, Edwin Lughofer, Thomas Natschläger, Susanne Saminger-Platz

PDF

Changing Model Behavior at Test-Time Using Reinforcement Learning Augustus Odena, Dieterich Lawson, Christopher Olah

PDF

Char2Wav: End-to-End Speech Synthesis Jose Sotelo, Soroush Mehri, Kundan Kumar, João Felipe Santos, Kyle Kastner, Aaron C. Courville, Yoshua Bengio

PDF

Charged Point Normalization: An Efficient Solution to the Saddle Point Problem Armen Aghajanyan

PDF

Combining Policy Gradient and Q-Learning Brendan O'Donoghue, Rémi Munos, Koray Kavukcuoglu, Volodymyr Mnih

PDF

CommAI: Evaluating the First Steps Towards a Useful General AI Marco Baroni, Armand Joulin, Allan Jabri, Germán Kruszewski, Angeliki Lazaridou, Klemen Simonic, Tomás Mikolov

PDF

Compact Embedding of Binary-Coded Inputs and Outputs Using Bloom Filters Joan Serrà, Alexandros Karatzoglou

PDF

Compositional Kernel Machines Robert Gens, Pedro M. Domingos

PDF

Coupling Distributed and Symbolic Execution for Natural Language Queries Lili Mou, Zhengdong Lu, Hang Li, Zhi Jin

PDF

Dance Dance Convolution Chris Donahue, Zachary C. Lipton, Julian J. McAuley

PDF

Data Noising as Smoothing in Neural Network Language Models Ziang Xie, Sida I. Wang, Jiwei Li, Daniel Lévy, Aiming Nie, Dan Jurafsky, Andrew Y. Ng

PDF

Dataset Augmentation in Feature Space Terrance DeVries, Graham W. Taylor

PDF

De Novo Drug Design with Deep Generative Models : An Empirical Study Mehdi Cherti, Balázs Kégl, Akin Kazakçi

PDF

Decomposing Motion and Content for Natural Video Sequence Prediction Ruben Villegas, Jimei Yang, Seunghoon Hong, Xunyu Lin, Honglak Lee

PDF

Deep Biaffine Attention for Neural Dependency Parsing Timothy Dozat, Christopher D. Manning

PDF

Deep Information Propagation Samuel S. Schoenholz, Justin Gilmer, Surya Ganguli, Jascha Sohl-Dickstein

PDF

Deep Kernel Machines via the Kernel Reparametrization Trick Jovana Mitrovic, Dino Sejdinovic, Yee Whye Teh

PDF

Deep Learning with Dynamic Computation Graphs Moshe Looks, Marcello Herreshoff, DeLesley Hutchins, Peter Norvig

PDF

Deep Learning with Sets and Point Clouds Siamak Ravanbakhsh, Jeff G. Schneider, Barnabás Póczos

PDF

Deep Multi-Task Representation Learning: A Tensor Factorisation Approach Yongxin Yang, Timothy M. Hospedales

PDF

Deep Nets Don't Learn via Memorization David Krueger, Nicolas Ballas, Stanislaw Jastrzebski, Devansh Arpit, Maxinder S. Kanwal, Tegan Maharaj, Emmanuel Bengio, Asja Fischer, Aaron C. Courville

PDF

Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning William Lotter, Gabriel Kreiman, David D. Cox

PDF

Deep Probabilistic Programming Dustin Tran, Matthew D. Hoffman, Rif A. Saurous, Eugene Brevdo, Kevin Murphy, David M. Blei

PDF

Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data Maximilian Karl, Maximilian Soelch, Justin Bayer, Patrick van der Smagt

PDF

Deep Variational Information Bottleneck Alexander A. Alemi, Ian Fischer, Joshua V. Dillon, Kevin Murphy

PDF

DeepCloak: Masking Deep Neural Network Models for Robustness Against Adversarial Samples Ji Gao, Beilun Wang, Zeming Lin, Weilin Xu, Yanjun Qi

PDF

DeepCoder: Learning to Write Programs Matej Balog, Alexander L. Gaunt, Marc Brockschmidt, Sebastian Nowozin, Daniel Tarlow

PDF

DeepDSL: A Compilation-Based Domain-Specific Language for Deep Learning Tian Zhao, Xiaobing Huang, Yu Cao

PDF

Delving into Adversarial Attacks on Deep Policies Jernej Kos, Dawn Song

PDF

Delving into Transferable Adversarial Examples and Black-Box Attacks Yanpei Liu, Xinyun Chen, Chang Liu, Dawn Song

PDF

Density Estimation Using Real NVP Laurent Dinh, Jascha Sohl-Dickstein, Samy Bengio

PDF

Designing Neural Network Architectures Using Reinforcement Learning Bowen Baker, Otkrist Gupta, Nikhil Naik, Ramesh Raskar

PDF

Development of JavaScript-Based Deep Learning Platform and Application to Distributed Training Masatoshi Hidaka, Ken Miura, Tatsuya Harada

PDF

Dialogue Learning with Human-in-the-Loop Jiwei Li, Alexander H. Miller, Sumit Chopra, Marc'Aurelio Ranzato, Jason Weston

PDF

Diet Networks: Thin Parameters for Fat Genomics Adriana Romero, Pierre Luc Carrier, Akram Erraqabi, Tristan Sylvain, Alex Auvolat, Etienne Dejoie, Marc-André Legault, Marie-Pierre Dubé, Julie G. Hussin, Yoshua Bengio

PDF

Discovering Objects and Their Relations from Entangled Scene Representations David Raposo, Adam Santoro, David G. T. Barrett, Razvan Pascanu, Tim Lillicrap, Peter W. Battaglia

PDF

Discrete Variational Autoencoders Jason Tyler Rolfe

PDF

Distributed Second-Order Optimization Using Kronecker-Factored Approximations Jimmy Ba, Roger B. Grosse, James Martens

PDF

Do Deep Convolutional Nets Really Need to Be Deep and Convolutional? Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Özlem Aslan, Shengjie Wang, Abdelrahman Mohamed, Matthai Philipose, Matthew Richardson, Rich Caruana

PDF

Dropout with Expectation-Linear Regularization Xuezhe Ma, Yingkai Gao, Zhiting Hu, Yaoliang Yu, Yuntian Deng, Eduard H. Hovy

PDF

DSD: Dense-Sparse-Dense Training for Deep Neural Networks Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally

PDF

Dynamic Coattention Networks for Question Answering Caiming Xiong, Victor Zhong, Richard Socher

PDF

Early Methods for Detecting Adversarial Images Dan Hendrycks, Kevin Gimpel

PDF

Efficient Representation of Low-Dimensional Manifolds Using Deep Networks Ronen Basri, David W. Jacobs

PDF

Efficient Sparse-Winograd Convolutional Neural Networks Xingyu Liu, Song Han, Huizi Mao, William J. Dally

PDF

Efficient Variational Bayesian Neural Network Ensembles for Outlier Detection Nick Pawlowski, Miguel Jaques, Ben Glocker

PDF

Efficient Vector Representation for Documents Through Corruption Minmin Chen

PDF

Embracing Data Abundance Ondrej Bajgar, Rudolf Kadlec, Jan Kleindienst

PDF

Emergence of Foveal Image Sampling from Learning to Attend in Visual Scenes Brian Cheung, Eric Weiss, Bruno A. Olshausen

PDF

Emergence of Language with Multi-Agent Games: Learning to Communicate with Sequences of Symbols Serhii Havrylov, Ivan Titov

PDF

Encoding and Decoding Representations with Sum- and Max-Product Networks Antonio Vergari, Robert Peharz, Nicola Di Mauro, Floriana Esposito

PDF

End-to-End Optimized Image Compression Johannes Ballé, Valero Laparra, Eero P. Simoncelli

PDF

Energy-Based Generative Adversarial Networks Junbo Jake Zhao, Michaël Mathieu, Yann LeCun

PDF

Entropy-SGD: Biasing Gradient Descent into Wide Valleys Pratik Chaudhari, Anna Choromanska, Stefano Soatto, Yann LeCun, Carlo Baldassi, Christian Borgs, Jennifer T. Chayes, Levent Sagun, Riccardo Zecchina

PDF

Episodic Exploration for Deep Deterministic Policies for StarCraft Micromanagement Nicolas Usunier, Gabriel Synnaeve, Zeming Lin, Soumith Chintala

PDF

EPOpt: Learning Robust Neural Network Policies Using Model Ensembles Aravind Rajeswaran, Sarvjeet Ghotra, Balaraman Ravindran, Sergey Levine

PDF

Explaining the Learning Dynamics of Direct Feedback Alignment Justin Gilmer, Colin Raffel, Samuel S. Schoenholz, Maithra Raghu, Jascha Sohl-Dickstein

PDF

Exploring Sparsity in Recurrent Neural Networks Sharan Narang, Greg Diamos, Shubho Sengupta, Erich Elsen

PDF

Exponential Machines Alexander Novikov, Mikhail Trofimov, Ivan V. Oseledets

PDF

Extrapolation and Learning Equations Georg Martius, Christoph H. Lampert

PDF

Factorization Tricks for LSTM Networks Oleksii Kuchaiev, Boris Ginsburg

PDF

Fast Adaptation in Generative Models with Generative Matching Networks Sergey Bartunov, Dmitry P. Vetrov

PDF

Fast Chirplet Transform Injects Priors in Deep Learning of Animal Calls and Speech Hervé Glotin, Julien Ricard, Randall Balestriero

PDF

Fast Generation for Convolutional Autoregressive Models Prajit Ramachandran, Tom Le Paine, Pooya Khorrami, Mohammad Babaeizadeh, Shiyu Chang, Yang Zhang, Mark A. Hasegawa-Johnson, Roy H. Campbell, Thomas S. Huang

PDF

Faster CNNs with Direct Sparse Convolutions and Guided Pruning Jongsoo Park, Sheng R. Li, Wei Wen, Ping Tak Peter Tang, Hai Li, Yiran Chen, Pradeep Dubey

PDF

Filter Shaping for Convolutional Neural Networks Xingyi Li, Fuxin Li, Xiaoli Z. Fern, Raviv Raich

PDF

Fine-Grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg

PDF

Forced to Learn: Discovering Disentangled Representations Without Exhaustive Labels Alexey Romanov, Anna Rumshisky

PDF

FractalNet: Ultra-Deep Neural Networks Without Residuals Gustav Larsson, Michael Maire, Gregory Shakhnarovich

PDF

Frustratingly Short Attention Spans in Neural Language Modeling Michal Daniluk, Tim Rocktäschel, Johannes Welbl, Sebastian Riedel

PDF

Gated Multimodal Units for Information Fusion John Edison Arevalo Ovalle, Thamar Solorio, Manuel Montes-y-Gómez, Fabio A. González

PDF

Generalizable Features from Unsupervised Learning Mehdi Mirza, Aaron C. Courville, Yoshua Bengio

PDF

Generalizing Skills with Semi-Supervised Reinforcement Learning Chelsea Finn, Tianhe Yu, Justin Fu, Pieter Abbeel, Sergey Levine

PDF

Generative Adversarial Learning of Markov Chains Jiaming Song, Shengjia Zhao, Stefano Ermon

PDF

Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy Danica J. Sutherland, Hsiao-Yu Tung, Heiko Strathmann, Soumyajit De, Aaditya Ramdas, Alexander J. Smola, Arthur Gretton

PDF

Generative Multi-Adversarial Networks Ishan P. Durugkar, Ian Gemp, Sridhar Mahadevan

PDF

Geometry of Polysemy Jiaqi Mu, Suma Bhat, Pramod Viswanath

PDF

Hadamard Product for Low-Rank Bilinear Pooling Jin-Hwa Kim, Kyoung Woon On, Woosang Lim, Jeonghee Kim, Jung-Woo Ha, Byoung-Tak Zhang

PDF

Hierarchical Multiscale Recurrent Neural Networks Junyoung Chung, Sungjin Ahn, Yoshua Bengio

PDF

Highway and Residual Networks Learn Unrolled Iterative Estimation Klaus Greff, Rupesh Kumar Srivastava, Jürgen Schmidhuber

PDF

HolStep: A Machine Learning Dataset for Higher-Order Logic Theorem Proving Cezary Kaliszyk, François Chollet, Christian Szegedy

PDF

Hyperband: Bandit-Based Configuration Evaluation for Hyperparameter Optimization Lisha Li, Kevin G. Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, Ameet Talwalkar

PDF

HyperNetworks David Ha, Andrew M. Dai, Quoc V. Le

PDF

Identity Matters in Deep Learning Moritz Hardt, Tengyu Ma

PDF

Improving Generative Adversarial Networks with Denoising Feature Matching David Warde-Farley, Yoshua Bengio

PDF

Improving Neural Language Models with a Continuous Cache Edouard Grave, Armand Joulin, Nicolas Usunier

PDF

Improving Policy Gradient by Exploring Under-Appreciated Rewards Ofir Nachum, Mohammad Norouzi, Dale Schuurmans

PDF

Incorporating Long-Range Consistency in CNN-Based Texture Generation Guillaume Berger, Roland Memisevic

PDF

Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights Aojun Zhou, Anbang Yao, Yiwen Guo, Lin Xu, Yurong Chen

PDF

Inductive Bias of Deep Convolutional Networks Through Pooling Geometry Nadav Cohen, Amnon Shashua

PDF

Intelligent Synapses for Multi-Task and Transfer Learning Ben Poole, Friedemann Zenke, Surya Ganguli

PDF

Introspection: Accelerating Neural Network Training by Learning Weight Evolution Abhishek Sinha, Aahitagni Mukherjee, Mausoom Sarkar, Balaji Krishnamurthy

PDF

Joint Embeddings of Scene Graphs and Images Eugene Belilovsky, Matthew B. Blaschko, Jamie Ryan Kiros, Raquel Urtasun, Richard S. Zemel

PDF

Joint Multimodal Learning with Deep Generative Models Masahiro Suzuki, Kotaro Nakayama, Yutaka Matsuo

PDF

Joint Training of Ratings and Reviews with Recurrent Recommender Networks Chao-Yuan Wu, Amr Ahmed, Alex Beutel, Alexander J. Smola

PDF

Latent Sequence Decompositions William Chan, Yu Zhang, Quoc V. Le, Navdeep Jaitly

PDF

Learning a Natural Language Interface with Neural Programmer Arvind Neelakantan, Quoc V. Le, Martín Abadi, Andrew McCallum, Dario Amodei

PDF

Learning Algorithms for Active Learning Philip Bachman, Alessandro Sordoni, Adam Trischler

PDF

Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft

PDF

Learning Curve Prediction with Bayesian Neural Networks Aaron Klein, Stefan Falkner, Jost Tobias Springenberg, Frank Hutter

PDF

Learning End-to-End Goal-Oriented Dialog Antoine Bordes, Y-Lan Boureau, Jason Weston

PDF

Learning Features of Music from Scratch John Thickstun, Zaïd Harchaoui, Sham M. Kakade

PDF

Learning Graphical State Transitions Daniel D. Johnson

PDF

Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning Abhishek Gupta, Coline Devin, Yuxuan Liu, Pieter Abbeel, Sergey Levine

PDF

Learning Invariant Representations of Planar Curves Gautam Pai, Aaron Wetzler, Ron Kimmel

PDF

Learning Recurrent Representations for Hierarchical Behavior Modeling Eyrun Eyjolfsdottir, Kristin Branson, Yisong Yue, Pietro Perona

PDF

Learning Through Dialogue Interactions by Asking Questions Jiwei Li, Alexander H. Miller, Sumit Chopra, Marc'Aurelio Ranzato, Jason Weston

PDF

Learning to Act by Predicting the Future Alexey Dosovitskiy, Vladlen Koltun

PDF

Learning to Compose Words into Sentences with Reinforcement Learning Dani Yogatama, Phil Blunsom, Chris Dyer, Edward Grefenstette, Wang Ling

PDF

Learning to Discover Sparse Graphical Models Eugene Belilovsky, Kyle Kastner, Gaël Varoquaux, Matthew B. Blaschko

PDF

Learning to Generate Samples from Noise Through Infusion Training Florian Bordes, Sina Honari, Pascal Vincent

PDF

Learning to Navigate in Complex Environments Piotr Mirowski, Razvan Pascanu, Fabio Viola, Hubert Soyer, Andy Ballard, Andrea Banino, Misha Denil, Ross Goroshin, Laurent Sifre, Koray Kavukcuoglu, Dharshan Kumaran, Raia Hadsell

PDF

Learning to Optimize Ke Li, Jitendra Malik

PDF

Learning to Perform Physics Experiments via Deep Reinforcement Learning Misha Denil, Pulkit Agrawal, Tejas D. Kulkarni, Tom Erez, Peter W. Battaglia, Nando de Freitas

PDF

Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening Frank S. He, Yang Liu, Alexander G. Schwing, Jian Peng

PDF

Learning to Query, Reason, and Answer Questions on Ambiguous Texts Xiaoxiao Guo, Tim Klinger, Clemens Rosenbaum, Joseph P. Bigus, Murray Campbell, Ban Kawas, Kartik Talamadupula, Gerry Tesauro, Satinder Singh

PDF

Learning to Remember Rare Events Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio

PDF

Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning Sahil Sharma, Aravind S. Lakshminarayanan, Balaraman Ravindran

PDF

Learning to Superoptimize Programs Rudy Bunel, Alban Desmaison, M. Pawan Kumar, Philip H. S. Torr, Pushmeet Kohli

PDF

Learning Visual Servoing with Deep Features and Fitted Q-Iteration Alex X. Lee, Sergey Levine, Pieter Abbeel

PDF

Lie-Access Neural Turing Machines Greg Yang, Alexander M. Rush

PDF

Lifelong Perceptual Programming by Example Alexander L. Gaunt, Marc Brockschmidt, Nate Kushman, Daniel Tarlow

PDF

Loss Is Its Own Reward: Self-Supervision for Reinforcement Learning Evan Shelhamer, Parsa Mahmoudieh, Max Argus, Trevor Darrell

PDF

Loss-Aware Binarization of Deep Networks Lu Hou, Quanming Yao, James T. Kwok

PDF

Lossy Image Compression with Compressive Autoencoders Lucas Theis, Wenzhe Shi, Andrew Cunningham, Ferenc Huszár

PDF

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation Jianwei Yang, Anitha Kannan, Dhruv Batra, Devi Parikh

PDF

Machine Comprehension Using Match-LSTM and Answer Pointer Shuohang Wang, Jing Jiang

PDF

Making Neural Programming Architectures Generalize via Recursion Jonathon Cai, Richard Shin, Dawn Song

PDF

Maximum Entropy Flow Networks Gabriel Loaiza-Ganem, Yuanjun Gao, John P. Cunningham

PDF

Mean Teachers Are Better Role Models: Weight-Averaged Consistency Targets Improve Semi-Supervised Deep Learning Results Antti Tarvainen, Harri Valpola

PDF

Memory Matching Networks for Genomic Sequence Classification Jack Lanchantin, Ritambhara Singh, Yanjun Qi

PDF

Metacontrol for Adaptive Imagination-Based Optimization Jessica B. Hamrick, Andrew J. Ballard, Razvan Pascanu, Oriol Vinyals, Nicolas Heess, Peter W. Battaglia

PDF

Mode Regularized Generative Adversarial Networks Tong Che, Yanran Li, Athul Paul Jacob, Yoshua Bengio, Wenjie Li

PDF

Mollifying Networks Çaglar Gülçehre, Marcin Moczulski, Francesco Visin, Yoshua Bengio

PDF

Multi-Agent Cooperation and the Emergence of (Natural) Language Angeliki Lazaridou, Alexander Peysakhovich, Marco Baroni

PDF

Multi-View Recurrent Neural Acoustic Word Embeddings Wanjia He, Weiran Wang, Karen Livescu

PDF

Multilayer Recurrent Network Models of Primate Retinal Ganglion Cell Responses Eleanor Batty, Josh Merel, Nora Brackbill, Alexander Heitman, Alexander Sher, Alan M. Litke, E. J. Chichilnisky, Liam Paninski

PDF

Multiplicative LSTM for Sequence Modelling Ben Krause, Iain Murray, Steve Renals, Liang Lu

PDF

Natural Language Generation in Dialogue Using Lexicalized and Delexicalized Data Shikhar Sharma, Jing He, Kaheer Suleman, Hannes Schulz, Philip Bachman

PDF

Neu0 Karthik R, Aman Achpal, Vinayshekhar Bk, Anantharaman Palacode Narayana Iyer, Channa Bankapur

PDF

Neural Architecture Search with Reinforcement Learning Barret Zoph, Quoc V. Le

PDF

Neural Combinatorial Optimization with Reinforcement Learning Irwan Bello, Hieu Pham, Quoc V. Le, Mohammad Norouzi, Samy Bengio

PDF

Neural Expectation Maximization Klaus Greff, Sjoerd van Steenkiste, Jürgen Schmidhuber

PDF

Neural Functional Programming John K. Feser, Marc Brockschmidt, Alexander L. Gaunt, Daniel Tarlow

PDF

Neural Photo Editing with Introspective Adversarial Networks Andrew Brock, Theodore Lim, James M. Ritchie, Nick Weston

PDF

Neural Program Lattices Chengtao Li, Daniel Tarlow, Alexander L. Gaunt, Marc Brockschmidt, Nate Kushman

PDF

Neuro-Symbolic Program Synthesis Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli

PDF

Neurogenesis-Inspired Dictionary Learning: Online Model Adaption in a Changing World Sahil Garg, Irina Rish, Guillermo A. Cecchi, Aurélie C. Lozano

PDF

Nonparametric Neural Networks George Philipp, Jaime G. Carbonell

PDF

Normalizing the Normalizers: Comparing and Extending Network Normalization Schemes Mengye Ren, Renjie Liao, Raquel Urtasun, Fabian H. Sinz, Richard S. Zemel

PDF

Offline Bilingual Word Vectors, Orthogonal Transformations and the Inverted SoftMax Samuel L. Smith, David H. P. Turban, Steven Hamblin, Nils Y. Hammerla

PDF

On Detecting Adversarial Perturbations Jan Hendrik Metzen, Tim Genewein, Volker Fischer, Bastian Bischoff

PDF

On Hyperparameter Optimization in Learning Systems Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil

PDF

On Improving the Numerical Stability of Winograd Convolutions Kevin Vincent, Kevin Stephano, Michael A. Frumkin, Boris Ginsburg, Julien Demouth

PDF

On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima Nitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelyanskiy, Ping Tak Peter Tang

PDF

On Robust Concepts and Small Neural Nets Amit Deshpande, Sushrut Karmalkar

PDF

On the Quantitative Analysis of Decoder-Based Generative Models Yuhuai Wu, Yuri Burda, Ruslan Salakhutdinov, Roger B. Grosse

PDF

Online Bayesian Transfer Learning for Sequential Data Modeling Priyank Jaini, Zhitang Chen, Pablo Carbajal, Edith Law, Laura Middleton, Kayla Regan, Mike Schaekermann, George Trimponias, James Tung, Pascal Poupart

PDF

Online Multi-Task Learning Using Active Sampling Sahil Sharma, Balaraman Ravindran

PDF

Online Structure Learning for Sum-Product Networks with Gaussian Leaves Wilson Hsu, Agastya Kalra, Pascal Poupart

PDF

Optimal Binary Autoencoding with Pairwise Correlations Akshay Balsubramani

PDF

Optimization as a Model for Few-Shot Learning Sachin Ravi, Hugo Larochelle

PDF

Out-of-Class Novelty Generation: An Experimental Foundation Mehdi Cherti, Balázs Kégl, Akin Kazakçi

PDF

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc V. Le, Geoffrey E. Hinton, Jeff Dean

PDF

Paleo: A Performance Model for Deep Neural Networks Hang Qi, Evan Randall Sparks, Ameet Talwalkar

PDF

Particle Value Functions Chris J. Maddison, Dieterich Lawson, George Tucker, Nicolas Heess, Arnaud Doucet, Andriy Mnih, Yee Whye Teh

PDF

Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer Sergey Zagoruyko, Nikos Komodakis

PDF

Perception Updating Networks: On Architectural Constraints for Interpretable Video Generative Models Eder Santana, José C. Príncipe

PDF

Performance Guarantees for Transferring Representations Daniel McNamara, Maria-Florina Balcan

PDF

PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications Tim Salimans, Andrej Karpathy, Xi Chen, Diederik P. Kingma

PDF

PixelVAE: A Latent Variable Model for Natural Images Ishaan Gulrajani, Kundan Kumar, Faruk Ahmed, Adrien Ali Taïga, Francesco Visin, David Vázquez, Aaron C. Courville

PDF

Pl@ntNet App in the Era of Deep Learning Antoine Affouard, Hervé Goëau, Pierre Bonnet, Jean-Christophe Lombardo, Alexis Joly

PDF

Playing SNES in the Retro Learning Environment Nadav Bhonker, Shai Rozenberg, Itay Hubara

PDF

Pointer Sentinel Mixture Models Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher

PDF

Precise Recovery of Latent Vectors from Generative Adversarial Networks Zachary C. Lipton, Subarna Tripathi

PDF

Predicting Medications from Diagnostic Codes with Recurrent Neural Networks Jacek M. Bajor, Thomas A. Lasko

PDF

Program Synthesis for Character Level Language Modeling Pavol Bielik, Veselin Raychev, Martin T. Vechev

PDF

Programming with a Differentiable Forth Interpreter Matko Bosnjak, Tim Rocktäschel, Jason Naradowsky, Sebastian Riedel

PDF

Pruning Convolutional Neural Networks for Resource Efficient Inference Pavlo Molchanov, Stephen Tyree, Tero Karras, Timo Aila, Jan Kautz

PDF

Pruning Filters for Efficient ConvNets Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, Hans Peter Graf

PDF

Q-Prop: Sample-Efficient Policy Gradient with an Off-Policy Critic Shixiang Gu, Timothy P. Lillicrap, Zoubin Ghahramani, Richard E. Turner, Sergey Levine

PDF

Quasi-Recurrent Neural Networks James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher

PDF

Query-Reduction Networks for Question Answering Min Joon Seo, Sewon Min, Ali Farhadi, Hannaneh Hajishirzi

PDF

Reasoning with Memory Augmented Neural Networks for Language Comprehension Tsendsuren Munkhdalai, Hong Yu

PDF

REBAR: Low-Variance, Unbiased Gradient Estimates for Discrete Latent Variable Models George Tucker, Andriy Mnih, Chris J. Maddison, Jascha Sohl-Dickstein

PDF

Recurrent Batch Normalization Tim Cooijmans, Nicolas Ballas, César Laurent, Çaglar Gülçehre, Aaron C. Courville

PDF

Recurrent Environment Simulators Silvia Chiappa, Sébastien Racanière, Daan Wierstra, Shakir Mohamed

PDF

Recurrent Hidden Semi-Markov Model Hanjun Dai, Bo Dai, Yan-Ming Zhang, Shuang Li, Le Song

PDF

Recurrent Mixture Density Network for Spatiotemporal Visual Attention Loris Bazzani, Hugo Larochelle, Lorenzo Torresani

PDF

Recurrent Normalization Propagation César Laurent, Nicolas Ballas, Pascal Vincent

PDF

Regularizing CNNs with Locally Constrained Decorrelations Pau Rodríguez, Jordi Gonzàlez, Guillem Cucurull, Josep M. Gonfaus, F. Xavier Roca

PDF

Regularizing Neural Networks by Penalizing Confident Output Distributions Gabriel Pereyra, George Tucker, Jan Chorowski, Lukasz Kaiser, Geoffrey E. Hinton

PDF

Reinforcement Learning Through Asynchronous Advantage Actor-Critic on a GPU Mohammad Babaeizadeh, Iuri Frosio, Stephen Tyree, Jason Clemons, Jan Kautz

PDF

Reinforcement Learning with Unsupervised Auxiliary Tasks Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, Koray Kavukcuoglu

PDF

Reinterpreting Importance-Weighted Autoencoders Chris Cremer, Quaid Morris, David Duvenaud

PDF

RenderGAN: Generating Realistic Labeled Data Leon Sixt, Benjamin Wild, Tim Landgraf

PDF

Restricted Boltzmann Machines Provide an Accurate Metric for Retinal Responses to Visual Stimuli Christophe Gardella, Olivier Marre, Thierry Mora

PDF

Revisiting Batch Normalization for Practical Domain Adaptation Yanghao Li, Naiyan Wang, Jianping Shi, Jiaying Liu, Xiaodi Hou

PDF

Revisiting Classifier Two-Sample Tests David Lopez-Paz, Maxime Oquab

PDF

Robustness to Adversarial Examples Through an Ensemble of Specialists Mahdieh Abbasi, Christian Gagné

PDF

Sample Efficient Actor-Critic with Experience Replay Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Rémi Munos, Koray Kavukcuoglu, Nando de Freitas

PDF

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model Soroush Mehri, Kundan Kumar, Ishaan Gulrajani, Rithesh Kumar, Shubham Jain, Jose Sotelo, Aaron C. Courville, Yoshua Bengio

PDF

Semantic Embeddings for Program Behaviour Patterns Alexander Chistyakov, Ekaterina Lobacheva, Arseny Kuznetsov, Alexey Romanenko

PDF

Semi-Supervised Classification with Graph Convolutional Networks Thomas N. Kipf, Max Welling

PDF

Semi-Supervised Deep Learning by Metric Embedding Elad Hoffer, Nir Ailon

PDF

Semi-Supervised Knowledge Transfer for Deep Learning from Private Training Data Nicolas Papernot, Martín Abadi, Úlfar Erlingsson, Ian J. Goodfellow, Kunal Talwar

PDF

SGDR: Stochastic Gradient Descent with Warm Restarts Ilya Loshchilov, Frank Hutter

PDF

Shake-Shake Regularization of 3-Branch Residual Networks Xavier Gastaldi

PDF

Short and Deep: Sketching and Neural Networks Amit Daniely, Nevena Lazic, Yoram Singer, Kunal Talwar

PDF

Sigma Delta Quantized Networks Peter O'Connor, Max Welling

PDF

Snapshot Ensembles: Train 1, Get M for Free Gao Huang, Yixuan Li, Geoff Pleiss, Zhuang Liu, John E. Hopcroft, Kilian Q. Weinberger

PDF

Soft Weight-Sharing for Neural Network Compression Karen Ullrich, Edward Meeds, Max Welling

PDF

Song from PI: A Musically Plausible Network for Pop Music Generation Hang Chu, Raquel Urtasun, Sanja Fidler

PDF

Sparsely-Connected Neural Networks: Towards Efficient VLSI Implementation of Deep Neural Networks Arash Ardakani, Carlo Condo, Warren J. Gross

PDF

Steerable CNNs Taco S. Cohen, Max Welling

PDF

Stick-Breaking Variational Autoencoders Eric T. Nalisnick, Padhraic Smyth

PDF

Stochastic Neural Networks for Hierarchical Reinforcement Learning Carlos Florensa, Yan Duan, Pieter Abbeel

PDF

Structured Attention Networks Yoon Kim, Carl Denton, Luong Hoang, Alexander M. Rush

PDF

Support Regularized Sparse Coding and Its Fast Encoder Yingzhen Yang, Jiahui Yu, Pushmeet Kohli, Jianchao Yang, Thomas S. Huang

PDF

Symmetry-Breaking Convergence Analysis of Certain Two-Layered Neural Networks with ReLU Nonlinearity Yuandong Tian

PDF

Synthetic Gradient Methods with Virtual Forward-Backward Networks Takeru Miyato, Daisuke Okanohara, Shin-ichi Maeda, Masanori Koyama

PDF

Tactics of Adversarial Attack on Deep Reinforcement Learning Agents Yen-Chen Lin, Zhang-Wei Hong, Yuan-Hong Liao, Meng-Li Shih, Ming-Yu Liu, Min Sun

PDF

Temporal Ensembling for Semi-Supervised Learning Samuli Laine, Timo Aila

PDF

The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables Chris J. Maddison, Andriy Mnih, Yee Whye Teh

PDF

The Effectiveness of Transfer Learning in Electronic Health Records Data Sébastien Dubois, Nathanael Romano, Kenneth Jung, Nigam Shah, David C. Kale

PDF

The High-Dimensional Geometry of Binary Neural Networks Alexander G. Anderson, Cory P. Berg

PDF

The Neural Noisy Channel Lei Yu, Phil Blunsom, Chris Dyer, Edward Grefenstette, Tomás Kociský

PDF

The Preimage of Rectifier Network Activities Stefan Carlsson, Hossein Azizpour, Ali Sharif Razavian, Josephine Sullivan, Kevin Smith

PDF

Third Person Imitation Learning Bradly C. Stadie, Pieter Abbeel, Ilya Sutskever

PDF

Tighter Bounds Lead to Improved Classifiers Nicolas Le Roux

PDF

TopicRNN: A Recurrent Neural Network with Long-Range Semantic Dependency Adji B. Dieng, Chong Wang, Jianfeng Gao, John W. Paisley

PDF

Topology and Geometry of Half-Rectified Network Optimization C. Daniel Freeman, Joan Bruna

PDF

Towards "AlphaChem": Chemical Synthesis Planning with Tree Search and Deep Neural Network Policies Marwin H. S. Segler, Mike Preuss, Mark P. Waller

PDF

Towards a Neural Statistician Harrison Edwards, Amos J. Storkey

PDF

Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses Ryan Lowe, Michael Noseworthy, Iulian Vlad Serban, Nicolas Angelard-Gontier, Yoshua Bengio, Joelle Pineau

PDF

Towards Deep Interpretability (MUS-ROVER II): Learning Hierarchical Representations of Tonal Music Haizi Yu, Lav R. Varshney

PDF

Towards Principled Methods for Training Generative Adversarial Networks Martín Arjovsky, Léon Bottou

PDF

Towards the Limit of Network Quantization Yoojin Choi, Mostafa El-Khamy, Jungwon Lee

PDF

Trace Norm Regularised Deep Multi-Task Learning Yongxin Yang, Timothy M. Hospedales

PDF

Tracking the World State with Recurrent Entity Networks Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann LeCun

PDF

Trained Ternary Quantization Chenzhuo Zhu, Song Han, Huizi Mao, William J. Dally

PDF

Training a Subsampling Mechanism in Expectation Colin Raffel, Dieterich Lawson

PDF

Training Agent for First-Person Shooter Game with Actor-Critic Curriculum Learning Yuxin Wu, Yuandong Tian

PDF

Training Compressed Fully-Connected Networks with a Density-Diversity Penalty Shengjie Wang, Haoran Cai, Jeff A. Bilmes, William S. Noble

PDF

Training Deep Neural-Networks Using a Noise Adaptation Layer Jacob Goldberger, Ehud Ben-Reuven

PDF

Training Triplet Networks with GAN Maciej Zieba, Lei Wang

PDF

Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks Zhilin Yang, Ruslan Salakhutdinov, William W. Cohen

PDF

Transfer of View-Manifold Learning to Similarity Perception of Novel Objects Xingyu Lin, Hao Wang, Zhihao Li, Yimeng Zhang, Alan L. Yuille, Tai Sing Lee

PDF

Transferring Knowledge to Smaller Network with Class-Distance Loss Seungwook Kim, Hyo-Eun Kim

PDF

Tree-Structured Decoding with Doubly-Recurrent Neural Networks David Alvarez-Melis, Tommi S. Jaakkola

PDF

Trusting SVM for Piecewise Linear CNNs Leonard Berrada, Andrew Zisserman, M. Pawan Kumar

PDF

Tuning Recurrent Neural Networks with Reinforcement Learning Natasha Jaques, Shixiang Gu, Richard E. Turner, Douglas Eck

PDF

Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling Hakan Inan, Khashayar Khosravi, Richard Socher

PDF

Understanding Deep Learning Requires Rethinking Generalization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, Oriol Vinyals

PDF

Understanding Intermediate Layers Using Linear Classifier Probes Guillaume Alain, Yoshua Bengio

PDF

Understanding Trainable Sparse Coding with Matrix Factorization Thomas Moreau, Joan Bruna

PDF

Unrolled Generative Adversarial Networks Luke Metz, Ben Poole, David Pfau, Jascha Sohl-Dickstein

PDF

Unseen Style Transfer Based on a Conditional Fast Style Transfer Network Keiji Yanai

PDF

Unsupervised and Scalable Algorithm for Learning Node Representations Tiago Pimentel, Adriano Veloso, Nivio Ziviani

PDF

Unsupervised Cross-Domain Image Generation Yaniv Taigman, Adam Polyak, Lior Wolf

PDF

Unsupervised Feature Learning for Audio Analysis Matthias Meyer, Jan Beutel, Lothar Thiele

PDF

Unsupervised Perceptual Rewards for Imitation Learning Pierre Sermanet, Kelvin Xu, Sergey Levine

PDF

Variable Computation in Recurrent Neural Networks Yacine Jernite, Edouard Grave, Armand Joulin, Tomás Mikolov

PDF

Variational Intrinsic Control Karol Gregor, Danilo Jimenez Rezende, Daan Wierstra

PDF

Variational Lossy Autoencoder Xi Chen, Diederik P. Kingma, Tim Salimans, Yan Duan, Prafulla Dhariwal, John Schulman, Ilya Sutskever, Pieter Abbeel

PDF

Variational Recurrent Adversarial Deep Domain Adaptation Sanjay Purushotham, Wilka Carvalho, Tanachat Nilanon, Yan Liu

PDF

Variational Reference Priors Eric T. Nalisnick, Padhraic Smyth

PDF

Visualizing Deep Neural Network Decisions: Prediction Difference Analysis Luisa M. Zintgraf, Taco S. Cohen, Tameem Adel, Max Welling

PDF

What Does It Take to Generate Natural Textures? Ivan Ustyuzhaninov, Wieland Brendel, Leon A. Gatys, Matthias Bethge

PDF

Why Deep Neural Networks for Function Approximation? Shiyu Liang, R. Srikant

PDF

Words or Characters? Fine-Grained Gating for Reading Comprehension Zhilin Yang, Bhuwan Dhingra, Ye Yuan, Junjie Hu, William W. Cohen, Ruslan Salakhutdinov

PDF

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Aaron C. Courville, Christopher J. Pal

PDF