Abstract

Recently Introduced Deep Reinforcement Learning (DRL) Techniques in Discrete-Time Have Resulted in Significant Advances in Online Games, Robotics, and So On. Inspired from Recent Developments, We Have Proposed an Approach Referred to as Quantile Critic with Spiking Actor and Normalized Ensemble (QC-SANE) for Continuous Control Problems, Which Uses Quantile Loss to Train Critic and a Spiking Neural Network (NN) to Train an Ensemble of Actors. the NN Does an Internal Normalization using a Scaled Exponential Linear Unit (SELU) Activation Function and Ensures Robustness. the Empirical Study on Multijoint Dynamics with Contact (MuJoCo)-Based Environments Shows Improved Training and Test Results Than the State-Of-The-Art Approach: Population Coded Spiking Actor Network (PopSAN).

Department(s)

Electrical and Computer Engineering

Second Department

Computer Science

Comments

Netaji Subhas University of Technology, Grant None

Keywords and Phrases

Actor critic; deep reinforcement learning (DRL); ensemble; reinforcement learning (RL); robust control; spiking neural network (SNN)

International Standard Serial Number (ISSN)

2162-2388; 2162-237X

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2023 Institute of Electrical and Electronics Engineers, All rights reserved.

Publication Date

01 Sep 2023

PubMed ID

34874871

Share

 
COinS