본문 바로가기 주메뉴 바로가기 하위메뉴 바로가기

Institute for Basic Science

Home Contact us Join Login

CENTER for GENOME ENGINEERING

  • About Center
    • Introduction
    • Organization
    • Location
    • Contact Us
    • Center CI
  • People
    • Director
    • Associate Director
    • Faculty
    • Young Scientist Fellow
    • Research Fellow
    • Researcher
    • Student
    • Staff
    • Visiting Scientist
  • Research
    • Advanced MR Neuroimaging Team
    • Neurovascular Imaging Team
    • Functional Neural Circuit Team
    • Human Functional Neuroimaging Team
  • Publication
    • Journals
    • Presentation
    • Patents
    • Books
  • Facilities
    • Equipment
    • Form & Info
  • Events
    • Seminar Series
    • Seminar
    • Undergraduate Internship
    • Conference
    • Calendar
  • News
    • News
    • Videos
    • Gallery
    • Brochure
전체메뉴
Home Events Seminar Print Page

Events

  • Seminar Series
  • Seminar
  • Undergraduate Internship
    • Summer Internship 2019
    • Summer Internship 2018
    • Summer Internship 2017
    • Summer Internship 2016
  • Conference
  • Calendar

Seminar

Contrastive introspection for brain-like credit assignment in reinforcement learning

Blake Richards, Ph.D.

November 3(Thu) - November 3(Thu), 2022

12PM

Online zoom (ID: 728-142-6028)

Neuro@noon Seminar


Date: 12pm, Thursday, Nov 3rd


Place: ZOOM

https://us02web.zoom.us/j/7281426028

회의 ID: 728 142 6028 (password: cnir)


Speaker: Blake Richards, Ph.D. (McGill University)


Title: Contrastive introspection for brain-like credit assignment in reinforcement learning

 

Abstract: Reinforcement learning (RL) algorithms have achieved notable success in recent years, but still struggle with fundamental issues in long-term credit assignment. It remains difficult to learn in situations where success is contingent upon multiple critical steps that are distant in time from each other and from a sparse reward; as is often the case in real life. Moreover, how RL algorithms assign credit in these difficult situations is typically not coded in a way that can rapidly generalize to new situations. Here, we present a brain-inspired approach using offline contrastive learning, which we call contrastive introspection (ConSpec), that can be added to any existing RL algorithm and addresses both issues. In ConSpec, a contrastive loss is used during offline replay to identify invariances among successful episodes. This takes advantage of the fact that it is easier to retrospectively identify the small set of steps that success is contingent upon than it is to prospectively predict reward at every step taken in the environment. ConSpec stores this knowledge in a collection of prototypes summarizing the intermediate states required for success. During training, arrival at any state that matches these prototypes generates an intrinsic reward that is added to any external rewards. As well, the reward shaping provided by ConSpec can be made to preserve the optimal policy of the underlying RL agent. The prototypes in ConSpec provide two key benefits for credit assignment: (1) They enable rapid identification of all the critical states. (2) They do so in a readily interpretable manner, enabling out of distribution generalization when sensory features are altered. In summary, ConSpec is a modular system that can be added to any existing RL algorithm to improve its long-term credit assignment.

 

LIST

CENTER FOR NEUROSCIENCE IMAGING RESEARCH

IBS Center for Neuroscience Imaging Research, N Center, Sungkyunkwan University, Seobu-ro 2066, Jangan-gu, Suwon, Korea Tel.+82-31-299-4354 / Fax.+82-31-299-4506

Copyright(c) 2022 IBS Center For Neuroscience Imaging Research. All Rights Reserved.