Sugiyama, Masashi
299 publications
ECCV
2024
Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning
NeurIPS
2024
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations
NeurIPS
2024
Test-Time Adaptation in Non-Stationary Environments via Adaptive Representation Alignment
NeurIPS
2023
Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation
NeurIPS
2023
Generalizing Importance Weighting to a Universal Solver for Distribution Shift Problems
NeurIPS
2023
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective
NeurIPS
2022
Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks
NeurIPS
2022
Generalizing Consistent Multi-Class Classification with Rejection to Be Compatible with Arbitrary Losses
CVPR
2022
Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation
ACML
2022
Multi-Class Classification from Multiple Unlabeled Datasets with Partial Risk Regularization
AISTATS
2021
Γ-ABC: Outlier-Robust Approximate Bayesian Computation Based on a Robust Divergence Estimator
ICML
2021
CIFS: Improving Adversarial Robustness of CNNs via Channel-Wise Importance-Based Feature Selection
UAI
2021
Incorporating Causal Graphical Prior Knowledge into Predictive Modeling via Simple Data Augmentation
ICML
2021
Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization
NeurIPS
2021
Loss Function Based Second-Order Jensen Inequality and Its Application to Particle Variational Inference
ICML
2021
Mediated Uncoupled Learning: Learning Functions Without Direct Input-Output Correspondences
ICML
2021
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization
AISTATS
2020
Calibrated Surrogate Maximization of Linear-Fractional Utility in Binary Classification
ICML
2018
Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model
NeurIPS
2018
Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks
NeurIPS
2016
Theoretical Comparisons of Positive-Unlabeled Learning Against Positive-Negative Learning
AISTATS
2015
Direct Density-Derivative Estimation and Its Application in KL-Divergence Approximation
ECML-PKDD
2014
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
AISTATS
2014
Analysis of Empirical MAP and Empirical Partially Bayes: Can They Be Alternatives to Variational Bayes?
AISTATS
2014
Bias Reduction and Metric Learning for Nearest-Neighbor Estimation of Kullback-Leibler Divergence
MLJ
2014
Least-Squares Independence Regression for Non-Linear Causal Inference Under Non-Gaussian Noise
MLJ
2013
Computational Complexity of Kernel-Based Density-Ratio Estimation: A Condition Number Analysis
NeurIPS
2013
Global Solver and Its Efficient Approximation for Variational Bayesian Low-Rank Subspace Clustering
AISTATS
2012
Fast Learning Rate of Multiple Kernel Learning: Trade-Off Between Sparsity and Smoothness
ICML
2012
Semi-Supervised Learning of Class Balance Under Class-Prior Change by Distribution Matching
ACML
2011
Computationally Efficient Sufficient Dimension Reduction via Squared-Loss Mutual Information
NeurIPS
2011
Global Solution of Fully-Observed Variational Bayesian Matrix Factorization Is Column-Wise Independent
JMLR
2011
Super-Linear Convergence of Dual Augmented Lagrangian Algorithm for Sparsity Regularized Estimation
ECML-PKDD
2010
Feature Selection for Reinforcement Learning: Evaluating Implicit State-Reward Dependency via Conditional Mutual Information
AAAI
2008
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
NeurIPS
2008
Efficient Direct Density Ratio Estimation for Non-Stationarity Adaptation and Outlier Detection
ICML
2007
Asymptotic Bayesian Generalization Error When Training and Test Distributions Are Different
NeurIPS
2007
Direct Importance Estimation with Model Selection and Its Application to Covariate Shift Adaptation