Opposition-based Q(λ) algorithm

Maryam Shokri; Hamid R. Tizhoosh; Mohamed Kamel

Opposition-based Q(λ) algorithm

Maryam Shokri, Hamid R. Tizhoosh, Mohamed Kamel

Artificial Intelligence and Informatics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the oppositionbased Q(λ). The results are compared with the conventional Watkins' Q(λ) and reflect a remarkable performance increase.

Original language	English (US)
Title of host publication	International Joint Conference on Neural Networks 2006, IJCNN '06
Pages	254-261
Number of pages	8
State	Published - 2006
Event	International Joint Conference on Neural Networks 2006, IJCNN '06 - Vancouver, BC, Canada Duration: Jul 16 2006 → Jul 21 2006

Publication series

Name	IEEE International Conference on Neural Networks - Conference Proceedings
ISSN (Print)	1098-7576

Conference

Conference	International Joint Conference on Neural Networks 2006, IJCNN '06
Country/Territory	Canada
City	Vancouver, BC
Period	7/16/06 → 7/21/06

ASJC Scopus subject areas

Software

Cite this

@inproceedings{5bdb72acbcf9475ba33b6b1d8c8c4798,

title = "Opposition-based Q(λ) algorithm",

abstract = "The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the oppositionbased Q(λ). The results are compared with the conventional Watkins' Q(λ) and reflect a remarkable performance increase.",

author = "Maryam Shokri and Tizhoosh, {Hamid R.} and Mohamed Kamel",

year = "2006",

language = "English (US)",

isbn = "0780394909",

series = "IEEE International Conference on Neural Networks - Conference Proceedings",

pages = "254--261",

booktitle = "International Joint Conference on Neural Networks 2006, IJCNN '06",

note = "International Joint Conference on Neural Networks 2006, IJCNN '06 ; Conference date: 16-07-2006 Through 21-07-2006",

}

TY - GEN

T1 - Opposition-based Q(λ) algorithm

AU - Shokri, Maryam

AU - Tizhoosh, Hamid R.

AU - Kamel, Mohamed

PY - 2006

Y1 - 2006

N2 - The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the oppositionbased Q(λ). The results are compared with the conventional Watkins' Q(λ) and reflect a remarkable performance increase.

AB - The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the oppositionbased Q(λ). The results are compared with the conventional Watkins' Q(λ) and reflect a remarkable performance increase.

UR - http://www.scopus.com/inward/record.url?scp=40649087190&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=40649087190&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:40649087190

SN - 0780394909

SN - 9780780394902

T3 - IEEE International Conference on Neural Networks - Conference Proceedings

SP - 254

EP - 261

BT - International Joint Conference on Neural Networks 2006, IJCNN '06

T2 - International Joint Conference on Neural Networks 2006, IJCNN '06

Y2 - 16 July 2006 through 21 July 2006

ER -

Opposition-based Q(λ) algorithm

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this