: ..,
..
:
: 120
:
: 2026
: .., .. // . - 2026. - . 120. - .369-389.
: , , ,
(.): conflict, horizontal maneuver, reinforcement learning, agent learning algorithm
: : . , , . . , , , . . Q- , . , .
(.): This paper briefly examines two main formulations of the aircraft conflict avoidance problem based on reinforcement learning: autonomous multi-aircraft conflict resolution using multi-agent deep reinforcement learning and conflict avoidance solution generation for air traffic controller decision support systems. The second formulation is particularly relevant for modern air traffic control, as the implementation of fully automated methods faces significant challenges in certifying machine-learning methods in civil aviation, where safety is crucial. This paper considers the problem of generating a horizontal maneuver to avoid a conflict between two aircraft using reinforcement learning. Unlike studies solving this problem in a continuous action space, this paper proposes learning an agent to act in a discrete space, which better aligns with the actions of air traffic controllers in centralized conflict avoidance. A formalization of the problem as a Markov decision process is presented. The reinforcement learning algorithms "dual deep Q-networks" and "proximal policy optimization" chosen to solve the problem are briefly described. The results of training and testing the agent in the developed simulation environment using the implemented algorithms are presented, and their effectiveness is compared.
PDF
: 49, : 29, : 22.