Design Of a multi-agent architecture using deep reinforcement learning to decrease the number of interactions between agents and environment by communicating knowledge.

Cárdenas Guilcapi, David Alexander

Por favor, use este identificador para citar o enlazar este ítem: http://bibdigital.epn.edu.ec/handle/15000/23771

Título:	Design Of a multi-agent architecture using deep reinforcement learning to decrease the number of interactions between agents and environment by communicating knowledge.
Autor:	Cárdenas Guilcapi, David Alexander
Palabras clave:	INTELIGENCIA ARTIFICIAL COMPUTACIÓN APRENDIZAJE PROFUNDO MULTIAGENTES
Fecha de publicación:	ene-2023
Editorial:	Quito : EPN, 2023.
Citación:	Cárdenas Guilcapi, D.A. (2023). Design Of a multi-agent architecture using deep reinforcement learning to decrease the number of interactions between agents and environment by communicating knowledge. 35 páginas. Quito : EPN.
Resumen:	The purpose of this integration is to create a communication system where agents can learn from the environment using a deep learning algorithm and communicate the knowledge obtained. Through communication, agents can take advantage of the knowledge of its peers in a way that the number of incorrect actions taken in the environment is reduced. Apart from the use of a pair-based multi-agent architecture with well-defined roles, some adjustments were performed in the deep q learning algorithm. One of them is the addition of information to the observations stored in the experience replay; this additional information is a flag which allows the agent to recognize a relevant state so the value of the rewards can be adjusted during the training of the network. In addition, the use of epsilon-greedy is limited to prevent an agent from exploring states that have already been explored and reported by other agents. These states have been identified as having errors and are therefore not worthy of further exploration. These adjustments proved to be effective since the agent reduces the number of episodes where errors are made in around 86%.
Descripción:	El presente proyecto de investigación introduce una integración de tres técnicas de inteligencia artificial. El propósito es crear un sistema de comunicación en donde los agentes puedan aprender del ambiente utilizando un algoritmo de aprendizaje profundo y comunicar el conocimiento obtenido. Mediante la comunicación, los agentes pueden tomar ventaja del conocimiento de sus semejantes de manera que el número de acciones incorrectas tomadas en el ambiente se reducen. Aparte del uso de una arquitectura multi agente con roles bien definidos, algunos ajustes se realizaron en el algoritmo Deep Q Learning. Uno de ellos es la adición de información a las observaciones almacenadas en la experience replay; esta información adicional es una bandera que permite que el agente reconozca un estado relevante de manera que el valor de las recompensas pueda ajustarse durante el entrenamiento de la red neuronal. Otro ajuste es el uso limitado de 𝜀-greedy el cual previene que un agente explore un estado que ya ha sido comunicado por sus similares, ya que los estados que son comunicados representan una observación en donde se cometió un error. Estos ajustes demostraron ser efectivos ya que el agente reduce el número de episodios donde se cometen errores en un 86%. This research project introduces an integration of three artificial intelligence techniques.
URI:	http://bibdigital.epn.edu.ec/handle/15000/23771
Tipo:	bachelorThesis
Aparece en las colecciones:	Tesis Maestría en Ciencias de la Computación y Comercio Electrónico (FIS)

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
CD 13035.pdf		1,72 MB	Adobe PDF	Visualizar/Abrir

Muestra el registro Dublin Core completo del ítem

BIBDIGITAL