Please use this identifier to cite or link to this item: http://hdl.handle.net/10497/23607
Title: 
Authors: 
Keywords: 
Exploration
Clustering
Human motor learning
Issue Date: 
2021
Citation: 
Sidarta, A., Komar, J., & Ostry, D. J. (2021). Clustering analysis of movement kinematics in reinforcement learning. Journal of Neurophysiology. Advance online publication. https://doi.org/10.1152/jn.00229.2021
Journal: 
Journal of Neurophysiology
Abstract: 
Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors such as task demands and reward availability. In this study, we applied a clustering algorithm to examine how a change in the accuracy requirements of a task affected the choice of exploration over exploitation. Participants made reaching movements to an unseen target using a planar robot arm and received reward after each successful movement. For one group of participants, the width of the hidden target decreased after every other training block. For a second group, it remained constant. The clustering algorithm was applied to the kinematic data to characterize motor learning on a trial-to-trial basis as a sequence of movements, each belonging to one of the identified clusters. By the end of learning, movement trajectories across all participants converged primarily to a single cluster with the greatest number of successful trials. Within this analysis framework, we defined exploration and exploitation as types of behaviour in which two successive trajectories belong to different or similar clusters, respectively. The frequency of each mode of behaviour was evaluated over the course learning. It was found that by reducing the target width, participants used a greater variety of different clusters and displayed more exploration than exploitation. Excessive exploration relative to exploitation was found to be detrimental to subsequent motor learning.
URI: 
ISSN: 
0022-3077 (print)
1522-1598 (online)
DOI: 
File Permission: 
Embargo_20230101
File Availability: 
With file
Appears in Collections:Journal Articles

Files in This Item:
File Description SizeFormat 
JN-2021-292021.pdf
  Until 2023-01-01
1.06 MBAdobe PDFUnder embargo until Jan 01, 2023
Show full item record

Page view(s)

23
checked on Jun 25, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.