Linear Algebraic Systems Neural Network Solution. Part 1
DOI:
https://doi.org/10.18372/1990-5548.75.17542Keywords:
linear algebraic systems, condition number, ill-conditioned systems, neural network, gradient descent, TensorFlowAbstract
In recent years, neural networks have become increasingly popular due to their versatility in solving complex problems. One area of interest is their application in solving linear algebraic systems, especially those that are ill-conditioned. The solutions of such systems are highly sensitive to small changes in their coefficients, leading to unstable solutions. Therefore, solving these types of systems can be challenging and require specialized techniques. This article explores the use of neural network methodologies for solving linear algebraic systems, focusing on ill-conditioned systems. The primary goal is to develop a model capable of directly solving linear equations and to evaluate its performance on a range of linear equation sets, including ill-conditioned systems. To tackle this problem, neural network implementing iterative algorithm was built. Error function of linear algebraic system is minimized using stochastic gradient descent. This model doesn’t require extensive training other than tweaking learning rate for particularly large systems. The analysis shows that the suggested model can handle well-conditioned systems of varying sizes, although for systems with large coefficients some normalization is required. Improvements are necessary for effectively solving ill-conditioned systems, since researched algorithm is shown to be not numerically stable. This research contributes to the understanding and application of neural network techniques for solving linear algebraic systems. It provides a foundation for future advances in this field and opens up new possibilities for solving complex problems. With further research and development, neural network models can become a powerful tool for solving ill-conditioned linear systems and other related problems.
References
G. E. Forsythe, M. A. Malcolm, & C. B. Moler, Computer Solution of Linear Algebraic Systems. New York: Prentice-Hall, Inc., 1967, 259 р. https://doi.org/10.1002/zamm.19790590235
Hilbert, David, “Ein Beitrag zur Theorie des Legendre'schen Polynoms“, (1894) Acta Mathematica, 18: 155–159, https://doi.org/10.1007/BF02418278
M. Zgurovsky, V. Sineglazov and E. Chumachenko, Artificial Intelligence Systems Based on Hybrid Neural Networks. Theory and Applications, Springer Cham, 2020, 512 p. https://doi.org/10.1007/978-3-030-48453-8
A. Cichocki and R. Unbehauen, “Neural networks for solving systems of linear equations and related problems,” IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications. vol. 39, no. 2, 1992, рp. 124–138. https://doi.org/10.1109/81.167018
C. Darken, J. Chang, and J. Moody, “Learning rate schedules for faster stochastic gradient search,”. Neural Networks for Signal Processing, Proceedings of the 1992 IEEE Workshop, vol. 2, Sep 1992, pp. 1–11. https://doi.org/10.1109/NNSP.1992.253713
TensorFlow documentation. https://www.tensorflow.org/api_docs/python/tf
Keras documentation. https://keras.io/api/
NumPy documentation. https://numpy.org/doc/stable/reference/index.html#reference
Matplotlib documentation. https://matplotlib.org/stable/api/index.html
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).