Category AI Blogs

AI Blogs

Maximizing Learning Efficiency with Adequate Labels

Efficient Learning Using Sufficient Labels: Labels, Information, and Computation Authors: Shiyu Duan, Spencer Chang, Jose C. Principe; Published in 2023, Volume 24(31):1−35. Abstract Supervised learning often requires a large amount of fully-labeled training data, which can be costly to obtain.…

Reinforcement Learning’s Capability in Discovering Stackelberg-Nash Equilibria in General-Sum Markov Games with Rational Followers Acting Myopically

Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopically Rational Followers? Authors: Han Zhong, Zhuoran Yang, Zhaoran Wang, Michael I. Jordan; Published: 24(35):1−52, 2023. Abstract This study focuses on multi-player general-sum Markov games with a designated leader…

Emphasizing Weightings in Off-Policy Actor-Critic Methods

Off-Policy Actor-Critic with Emphatic Weightings Eric Graves, Ehsan Imani, Raksha Kumaraswamy, Martha White; 24(146):1−63, 2023. Abstract A variety of policy gradient algorithms have been developed for the on-policy setting based on the policy gradient theorem, which simplifies the gradient computation.…

Optimizing Stochastic Systems amidst Distributional Drift

Stochastic Optimization under Distributional Drift Authors: Joshua Cutler, Dmitriy Drusvyatskiy, Zaid Harchaoui; Volume 24, Issue 147, Pages 1-56, 2023. Abstract This study addresses the problem of minimizing a convex function that undergoes unknown and potentially stochastic changes, which may depend…