Kappen, Hilbert J.

27 publications

JMLR 2020 Adaptive Smoothing for Path Integral Control Dominik Thalmeier, Hilbert J. Kappen, Simone Totaro, Vicenç Gómez
UAI 2014 Latent Kullback Leibler Control for Continuous-State Systems Using Probabilistic Graphical Models Takamitsu Matsubara, Vicenç Gómez, Hilbert J. Kappen
ECML-PKDD 2014 Policy Search for Path Integral Control Vicenç Gómez, Hilbert J. Kappen, Jan Peters, Gerhard Neumann
MLJ 2014 The Variational Garrote Hilbert J. Kappen, Vicenç Gómez
MLJ 2013 Minimax PAC Bounds on the Sample Complexity of Reinforcement Learning with a Generative Model Mohammad Gheshlaghi Azar, Rémi Munos, Hilbert J. Kappen
JMLR 2012 Dynamic Policy Programming Mohammad Gheshlaghi Azar, Vicenç Gómez, Hilbert J. Kappen
MLJ 2012 Optimal Control as a Graphical Model Inference Problem Hilbert J. Kappen, Vicenç Gómez, Manfred Opper
NeurIPS 2011 Speedy Q-Learning Mohammad Ghavamzadeh, Hilbert J. Kappen, Mohammad G. Azar, Rémi Munos
JMLR 2010 Approximate Inference on Planar Graphs Using Loop Calculus and Belief Propagation Vicenç Gómez, Hilbert J. Kappen, Michael Chertkov
UAI 2010 Risk Sensitive Path Integral Control Bart van den Broek, Wim Wiegerinck, Hilbert J. Kappen
UAI 2009 Approximate Inference on Planar Graphs Using Loop Calculus and Belief Propagation Vicenç Gómez, Hilbert J. Kappen, Michael Chertkov
NeurIPS 2008 Bounds on Marginal Probability Distributions Joris M. Mooij, Hilbert J. Kappen
NeurIPS 2008 Self-Organization Using Synaptic Plasticity Vicençc Gómez, Andreas Kaltenbrunner, Vicente López, Hilbert J. Kappen
JMLR 2007 Loop Corrections for Approximate Inference on Factor Graphs Joris M. Mooij, Hilbert J. Kappen
JMLR 2007 Truncating the Loop Series Expansion for Belief Propagation Vicenç Gómez, Joris M. Mooij, Hilbert J. Kappen
UAI 2005 Sufficient Conditions for Convergence of Loopy Belief Propagation Joris M. Mooij, Hilbert J. Kappen
NeurIPS 2004 Validity Estimates for Loopy Belief Propagation on Binary Real-World Networks Joris M. Mooij, Hilbert J. Kappen
JAIR 2003 Bound Propagation Martijn A. R. Leisink, Hilbert J. Kappen
NeCo 2002 Associative Memory with Dynamic Synapses Lovorka Pantic, Joaquín J. Torres, Hilbert J. Kappen, Stan C. A. M. Gielen
UAI 2002 General Lower Bounds Based on Computer Generated Higher Order Expansions Martijn A. R. Leisink, Hilbert J. Kappen
NeCo 2001 A Tighter Bound for Graphical Models Martijn A. R. Leisink, Hilbert J. Kappen
NeurIPS 2001 Novel Iteration Schemes for the Cluster Variation Method Hilbert J. Kappen, Wim Wiegerinck
NeurIPS 2000 A Tighter Bound for Graphical Models Martijn A. R. Leisink, Hilbert J. Kappen
NeCo 2000 Nonmonotonic Generalization Bias of Gaussian Mixture Models Shotaro Akaho, Hilbert J. Kappen
NeurIPS 2000 Second Order Approximations for Probability Models Hilbert J. Kappen, Wim Wiegerinck
NeCo 1998 Efficient Learning in Boltzmann Machines Using Linear Response Theory Hilbert J. Kappen, Francisco de Borja Rodríguez Ortiz
NeurIPS 1997 Boltzmann Machine Learning Using Mean Field Theory and Linear Response Correction Hilbert J. Kappen, Francisco de Borja Rodríguez Ortiz