Continual Optimistic Initialization for Value-Based Reinforcement Learning (COIN)
NEW
Project currently under paper construction phase. Links and updates will be added in the coming months• Designed COIN exploration strategy overcoming optimistic initialization limitations in non-stationary environments.• Showcased higher average returns over existing strategies across five out of 6 benchmark domains in OpenAI Gym, illustrating the superiority of COIN strategy.• Conducted ablation studies to analyze individual hyperparameters.