Ishii, Shin

16 publications

TMLR 2025 Double Horizon Model-Based Policy Optimization Akihiro Kubo, Paavo Parmas, Shin Ishii
ICML 2025 Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation Kosuke Nakanishi, Akihiro Kubo, Yuji Yasui, Shin Ishii
NeurIPS 2023 Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning Sotetsu Koyamada, Shinri Okano, Soichiro Nishimori, Yu Murata, Keigo Habara, Haruka Kita, Shin Ishii
ICLR 2016 Distributional Smoothing by Virtual Adversarial Examples Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Ken Nakae, Shin Ishii
MLJ 2016 Sparse and Low-Rank Matrix Regularization for Learning Time-Varying Markov Networks Junichiro Hirayama, Aapo Hyvärinen, Shin Ishii
JMLR 2011 Generalized TD Learning Tsuyoshi Ueno, Shin-ichi Maeda, Motoaki Kawanabe, Shin Ishii
MLJ 2011 Ternary Bradley-Terry Model-Based Decoding for Multi-Class Classification and Its Extensions Takashi Takenouchi, Shin Ishii
ECML-PKDD 2009 Optimal Online Learning Procedures for Model-Free Policy Evaluation Tsuyoshi Ueno, Shin-ichi Maeda, Motoaki Kawanabe, Shin Ishii
ICML 2008 A Semiparametric Statistical Approach to Model-Free Policy Evaluation Tsuyoshi Ueno, Motoaki Kawanabe, Takeshi Mori, Shin-ichi Maeda, Shin Ishii
NeurIPS 2007 Heterogeneous Component Analysis Shigeyuki Oba, Motoaki Kawanabe, Klaus-Robert Müller, Shin Ishii
MLJ 2005 A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game Shin Ishii, Hajime Fujita, Masaoki Mitsutake, Tatsuya Yamazaki, Jun Matsuda, Yoichiro Matsuno
AAAI 2004 Reinforcement Learning for CPG-Driven Biped Robot Takeshi Mori, Yutaka Nakamura, Masa-aki Sato, Shin Ishii
NeCo 2001 Gaussian Process Approach to Spiking Neurons for Inhomogeneous Poisson Inputs Ken-ichi Amemori, Shin Ishii
NeCo 2000 -Opt Neural Approaches to Quadratic Assignment Problems Shin Ishii, Hirotaka Niitsuma
NeCo 2000 On-Line EM Algorithm for the Normalized Gaussian Network Masa-aki Sato, Shin Ishii
NeurIPS 1998 Reinforcement Learning Based on On-Line EM Algorithm Masa-aki Sato, Shin Ishii