FlockGPT: Guiding UAV Flocking with Linguistic Orchestration
Artem Lykov, Sausar Karaf, Mikhail Martynov, Valerii Serpiva, Aleksey Fedoseev, Mikhail Konenkov, Dzmitry Tsetserukou
TL;DR
This work tackles scalable drone flocking controlled by natural language. It couples an LLM-driven interface to generate target surfaces via a Python-based Signed Distance Function (SDF) with a distributed flocking controller that assigns drones to surface points and maintains cohesive motion. Validation in Unity with up to 64 simulated drones and real-world Crazyflie experiments shows novice users can craft and recognize six shapes, achieving an average recognition of 80% and up to 93% for certain configurations, while NASA-TLX and UEQ metrics indicate favorable usability and experience. The approach offers an intuitive pathway for large-scale drone shows and teleoperable flocking, with potential extensions to dynamic shapes and richer human–flock interaction modalities.
Abstract
This article presents the world's first rapid drone flocking control using natural language through generative AI. The described approach enables the intuitive orchestration of a flock of any size to achieve the desired geometry. The key feature of the method is the development of a new interface based on Large Language Models to communicate with the user and to generate the target geometry descriptions. Users can interactively modify or provide comments during the construction of the flock geometry model. By combining flocking technology and defining the target surface using a signed distance function, smooth and adaptive movement of the drone swarm between target states is achieved. Our user study on FlockGPT confirmed a high level of intuitive control over drone flocking by users. Subjects who had never previously controlled a swarm of drones were able to construct complex figures in just a few iterations and were able to accurately distinguish the formed swarm drone figures. The results revealed a high recognition rate for six different geometric patterns generated through the LLM-based interface and performed by a simulated drone flock (mean of 80% with a maximum of 93\% for cube and tetrahedron patterns). Users commented on low temporal demand (19.2 score in NASA-TLX), high performance (26 score in NASA-TLX), attractiveness (1.94 UEQ score), and hedonic quality (1.81 UEQ score) of the developed system. The FlockGPT demo code repository can be found at: coming soon
