Diseño y simulación de un algoritmo de decisión de camino para un robot laberinto utilizando redes neuronales.

Balseca Balseca, Jessica Pamela

Please use this identifier to cite or link to this item: http://bibdigital.epn.edu.ec/handle/15000/22543

Title:	Diseño y simulación de un algoritmo de decisión de camino para un robot laberinto utilizando redes neuronales.
Authors:	Balseca Balseca, Jessica Pamela
Keywords:	ROBÓTICA ALGORITMOS DE APRENDIZAJE REDES NEURALES
Issue Date:	Jan-2022
Publisher:	Quito : EPN, 2022.
Citation:	Balseca Balseca, J. P. (2022). Diseño y simulación de un algoritmo de decisión de camino para un robot laberinto utilizando redes neuronales. 55 páginas. Quito : EPN.
Abstract:	El presente trabajo consiste en el diseño y simulación de un algoritmo de decisión capaz de resolver laberintos empleando redes neuronales bajo las arquitecturas secuencial y recursiva LSTM. Previo al diseño, se definen algunos conceptos generales e importantes dobre el aprendizaje profundo por refuerzo junto con los componentes y el algoritmo de aprendizaje Q-learning y algoritmo de optimización Adam con la función de pérdida MSE. Se considera un enfoque mixto de tipo cualitativo ya que se necesita profundizar los conocimientos de algunas librerías compatibles con Phyton como Numpy encargada de visualizar datos mediante arrays, Pytorch enfocado al uso de redes neuronales, Matplotlib que permite crear visualizaciones de los laberintos, Jupiter que ayuda a configurar modelos de las redes y el paquete Tkinter con el cual se realiza la interfaz entre usuario y máquina y Jupiteer. En segunda instancia, se utiliza un enfoque cuantitativo ya que se construye el algoritmo Q-learning basado en los estados y acciones que realiza el agente para definir la recompensa en función de los objetivos a corto y largo plazo. Además, se genera laberintos aleatorios con el propósito de que el agente no sea capaz de memorizar el camino directo hacia la meta, por tanto, se desarrolla la repetición de la experiencia para que sea capaz de entrenarse ante esta diversidad. Por consiguiente, se realizaron varias pruebas que reflejan el entrenamiento, función de pérdida y comparación de arquitecturas, siendo la secuencial más efectiva proporcionando un valor superior al recursivo del 71% para 100 laberintos generados.
Description:	Abstract: The present job consists in the design and simulation of a decision algorithm capable of solving labyrinths using neural networks under the sequential and recursive architectures LSTM. Prior to design, some general and important concepts on deep reinforcement learning are defined along with the components and learning algorithm Q-learning and Adam optimization algorithm with the loss function MSE. It is considered a mixed approach of qualitative type since it is necessary to deepen the knowledge of some libraries compatible with Python as NumPy in charge of visualizing data through arrays, PyTorch focused on the use of neural networks, Matlplotlib that allows you to create visualizations of the mazes, Jupiter that helps to configure models of the networks and the Tkinter package with which the interface between user and machine is made and Jupiter. In the second instance, a quantitative approach is used since the Q-learning algorithm is constructed based on the states and actions performed by the agent to define the reward based on short- and long-term objectives. In addition, random labyrinths are generated with the purpose that the agent is not able to memorize the direct path to the goal, therefore, the repetition of the experience is developed to be able to train before this diversity. Therefore, several tests were performed that reflect the training, loss function and comparison of architectures, being the most effective sequential providing a value higher than the recursive 71% for 100 generated labyrinths.
URI:	http://bibdigital.epn.edu.ec/handle/15000/22543
Type:	bachelorThesis
Appears in Collections:	TIC - Electrónica y Automatización (FIEE)

Files in This Item:

File	Description	Size	Format
CD 11917.pdf		10,8 MB	Adobe PDF	View/Open

Show full item record

BIBDIGITAL