Options
Clustering analysis of movement kinematics in reinforcement learning
Citation
Sidarta, A., Komar, J., & Ostry, D. J. (2021). Clustering analysis of movement kinematics in reinforcement learning. Journal of Neurophysiology. Advance online publication. https://doi.org/10.1152/jn.00229.2021
Abstract
Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors such as task demands and reward availability. In this study, we applied a clustering algorithm to examine how a change in the accuracy requirements of a task affected the choice of exploration over exploitation. Participants made reaching movements to an unseen target using a planar robot arm and received reward after each successful movement. For one group of participants, the width of the hidden target decreased after every other training block. For a second group, it remained constant. The clustering algorithm was applied to the kinematic data to characterize motor learning on a trial-to-trial basis as a sequence of movements, each belonging to one of the identified clusters. By the end of learning, movement trajectories across all participants converged primarily to a single cluster with the greatest number of successful trials. Within this analysis framework, we defined exploration and exploitation as types of behaviour in which two successive trajectories belong to different or similar clusters, respectively. The frequency of each mode of behaviour was evaluated over the course learning. It was found that by reducing the target width, participants used a greater variety of different clusters and displayed more exploration than exploitation. Excessive exploration relative to exploitation was found to be detrimental to subsequent motor learning.
Date Issued
2021
Journal
Journal of Neurophysiology
DOI
10.1152/jn.00229.2021