Working paper

Neural Networks and Monte-Carlo Method Usage in Multi-Agent Systems for Sudoku Problem Solving

Year:

2021

Published in:

SSRN
DQN
DDQN
TD
PPO
neural network
deep learning
reinforcement learning
multi-agent system
MCTS
Q-Learning

The object of research is multi-agent systems based on Deep Reinforcement Learning algorithms and analysis of ways to establish interaction within the system, based on intelligent agents. Also, part of the material in this paper covers ways to organize the management and administration of agents at the meta-level: external control-lers and tools to optimize their work, describing architectural solutions that should accelerate agents’ training. The studied full-fledged multi-agent system would be flexible to expansion and would give effective acceleration in agent training and problem-solving quality. In this paper, the following neural network models were considered: DQN, DDQN, PPO, TD (methods based on Q-Learning), an approach using a neural network with Monte-Carlo tree search. The presented models were tested on a Sudoku problem with a dataset of 5039 combinations, dimensions 2 × 2, 4 × 4, and 9 × 9. Several sets of agent rewards were used. The presentation of data during the learning and problem-solving process was described. Also was built a multi-agent system based on the model using a Monte-Carlo tree search. According to the study results, it was revealed that for tasks in a complex environment, the models based on Q-Learning are practically ineffective (plots support the statement). The training process for these models is quite demanding on the characteristics of the workstation hardware. It was also determined that the Monte-Carlo tree search method does a good job. Even with a small number of iterations, it shows results better than other Deep Learning methods (45–50 % accuracy for 9 × 9). However, a significant drawback is a complexity of training the model, and the hardware requirements are too large for this kind of research.

Other publications by

12 publications found

2019
Journal article

COMPARATIVE ANALYSIS OF SOFTWARE LIBRARIES FOR THE CLASSIFICATION OF TEXT DATA USING ARTIFICIAL NEURAL NETWORKS

Publisher: Таврійський національний університет ім. В.І. Вернадського

Authors: Vadym Yaremenko, Mykola Tarasenko

2020
Journal article

МОДЕЛЬ МУЛЬТИАГЕНТНОЇ СИСТЕМИ ДЛЯ СЕМАНТИЧНОГО АНАЛІЗУ ТЕКСТІВ

Publisher: Луцький національний технічний університет

Authors: Vadym Yaremenko, Andrii Khudiakov

2025
Journal article

The development of an electronic circuit simulation system using variable tabular bases

Publisher: Technology Center PC

Authors: Vadym Yaremenko, Bogdan Bulakh, Yaroslav Kornachevskyy, Oleksandr Beznosyk, Kostyantyn Kharchenko

2022
Journal article

A theoretically proposed algorithm in a decision tree format for choosing an efficient storage type of large datasets

Publisher: Technology Center PC

Authors: Sofiia Materynska, Vadym Yaremenko, Walery Rogoza

2020
Working paper

Development of a Multi‑Agent System for Solving Domain Dictionary Construction Problem

Publisher: SSRN

Authors: Vadym Yaremenko, Oleksandr Syrotiuk

2019
Journal article

Підхід до використання фільтра блума для багатокласової класифікації текстових даних в режимі реального часу.

Publisher: Technology Center PC

Authors: Vadym Yaremenko, Dmytro Budonnyi

2021
Journal article

A comparative analysis of text data classification accuracy and speed using neural networks, Bloom filter and naive Bayes

Publisher: Technology Center PC

Authors: Olena Hryshchenko, Vadym Yaremenko

2024
Journal article

Forecasting software development costs in scrum iterations using ordinary least squares method

Publisher: Technology Center PC

Authors: Vadym Yaremenko, Kostyantyn Kharchenko, Oleksandr Beznosyk, Bogdan Bulakh, Bogdan Kyriusha

2020
Journal article

Використання штучних нейронних мереж для визначення наявності сердцево‑судинних хвороб та захворювань печінки при малих наборах даних.

Publisher: Луцький національний технічний університет

Authors: Vadym Yaremenko, Sofiia Materynska

2019
Journal article

Порівняльний Аналіз Програмних Бібліотек Для Класифікації Текстових Даних Із Використанням Штучних Нейронних Мереж

Publisher: Вчені записки ТНУ імені В.І. Вернадського

Authors: Vadym Yaremenko, Maksym Tarasenko