Turn off MathJax
Article Contents
WANG Xuhang, WU Haiyan. Scheduling Methods for Astronomical Satellite Target of Opportunity Tasks with High-frequency Dynamic Arrivals (in Chinese). Chinese Journal of Space Science, 2025, 45(5): 1-11 doi: 10.11728/cjss2025.05.2024-0125
Citation: WANG Xuhang, WU Haiyan. Scheduling Methods for Astronomical Satellite Target of Opportunity Tasks with High-frequency Dynamic Arrivals (in Chinese). Chinese Journal of Space Science, 2025, 45(5): 1-11 doi: 10.11728/cjss2025.05.2024-0125

Scheduling Methods for Astronomical Satellite Target of Opportunity Tasks with High-frequency Dynamic Arrivals

doi: 10.11728/cjss2025.05.2024-0125 cstr: 32142.14.cjss.2024-0125
  • Received Date: 2024-10-08
  • Rev Recd Date: 2025-03-08
  • Available Online: 2025-03-10
  • Based on the background where sky survey equipment detects tens of thousands of variable sources daily and the demand for observing these variable sources increases, a long-sequence task planning problem is formed. This problem comprises dynamically arriving high-frequency Targets of Opportunity (ToO) and their subsequent follow-up observation tasks. This type of problem is classified as an NP hard problem due to the randomness of observation events, strong timeliness of data acquisition, high selectivity of decision paths, and the coupling effect of complex constraints such as Earth occlusion and observation time windows. Traditional supervised learning methods are difficult to apply due to the difficulty in obtaining high-quality labeled data, while Deep Reinforcement Learning (DRL) combines the perceptual ability of deep learning with the sequential decision optimization mechanism of reinforcement learning to construct end-to-end autonomous decision-making agents, demonstrating the universal advantage of handling complex planning problems. This article is based on the DRL framework and constructs a Markov Decision Process (MDP) model for long sequence task planning problems. State space encoding, action space encoding, and reward functions are defined in the task planning problem to guide reinforcement learning algorithms to converge towards optimizing the target set. However, classical DRL methods have significant limitations in unsupervised learning mode - when faced with long sequence dynamic task flows, agents have difficulty quickly converging to the global optimal strategy due to the large exploration space, and lack sensitivity to local key tasks. Therefore, this paper improves the Pointer Network (PN) by drawing on the concept of the Local Attention (LA) mechanism, proposing the Local Attention Pointer Network (LA-PN) algorithm. This algorithm innovatively introduces a sliding time window constraint on the attention range, allowing the model to dynamically focus on task subsequences related to the current decision, and filter high-value targets through local context awareness to reduce redundant computation. By comparing and analyzing simulation results, verify the profitability, real-time performance, and generalization of the algorithm.

     

  • loading
  • [1]
    李大林. 天文观测卫星任务规划模型与方法研究[D]. 哈尔滨: 哈尔滨工业大学, 2020

    LI Dalin. Research on Observation Scheduling Model and Method of Astronomy Satellite[D]. Harbin: Harbin Institute of Technology, 2020
    [2]
    徐子羚, 刘玉荣, 冯准. 基于遗传算法的Tiling覆盖策略天文卫星任务规划[J]. 空间科学学报, 2022, 42(2): 321-328 doi: 10.11728/cjss2022.02.210112006

    XU Ziling, LIU Yurong, FENG Zhun. Mission planning for astronomical satellite based on genetic algorithm under tiling coverage strategy[J]. Chinese Journal of Space Science, 2022, 42(2): 321-328 doi: 10.11728/cjss2022.02.210112006
    [3]
    向尚, 陈盈果, 李国梁, 等. 卫星自主与协同任务调度规划综述[J]. 自动化学报, 2019, 45(2): 252-264

    XIANG Shang, CHEN Yinguo, LI Guoliang, et al. Review on satellite autonomous and collaborative task scheduling planning[J]. Acta Automatica Sinica, 2019, 45(2): 252-264
    [4]
    吴海燕, 孟新, 张玉珠, 等. 面向天文观测的空间科学卫星任务规划方法研究[J]. 空间科学学报, 2013, 33(5): 561-568 doi: 10.11728/cjss2013.05.561

    WU Haiyan, MENG Xin, ZHANG Yuzhu, et al. Research on the planning method for astronomy observation mission[J]. Chinese Journal of Space Science, 2013, 33(5): 561-568 doi: 10.11728/cjss2013.05.561
    [5]
    黄跃, 屈进禄, 贾淑梅, 等. HXMT卫星长期任务规划算法[J]. 空间科学学报, 2017, 37(6): 766-772 doi: 10.11728/cjss2017.06.766

    HUANG Yue, QU Jinlu, JIA Shumei, et al. Long-term planning algorithm for the HXMT mission[J]. Chinese Journal of Space Science, 2017, 37(6): 766-772 doi: 10.11728/cjss2017.06.766
    [6]
    尹晓丹, 白萌, 李卓恒. 超额订购下天文卫星短期任务的聚类规划方法[J]. 南京航空航天大学学报(英文版), 2023, 40(3): 307-322

    YIN Xiaodan, BAI Meng, LI Zhuoheng. Clustering-scheduling methods for oversubscribed short-term tasks of astronomical satellites[J]. Transactions of Nanjing University of Aeronautics and Astronautics, 2023, 40(3): 307-322
    [7]
    王海蛟. 基于强化学习的卫星规模化在线调度方法研究[D]. 北京: 中国科学院大学(中国科学院国家空间科学中心), 2018

    WANG Haijiao. Massive Scheduling Method Under Online Situation for Satellites Based on Reinforcement Learning[D]. Beijing: National Space Science Center, Chinese Academy of Sciences, 2018
    [8]
    马一凡. 基于深度强化学习的敏捷成像卫星任务规划方法[D]. 杭州: 浙江大学, 2021

    MA Yifan. Agile Imaging Satellite Task Planning Method Based on Deep Reinforcement Learning[D]. Hangzhou: Zhejiang University, 2021
    [9]
    赵世杰. 基于强化学习的卫星自主任务规划方法研究与实现[D]. 上海: 华东师范大学, 2024

    ZHAO Shijie. Research and Implementation of Satellite Autonomous Task Planning Method Based on Reinforcement Learning[D]. Shanghai: East China Normal University, 2024
    [10]
    刘晓路, 许英杰, 贺仁杰, 等. 面向卫星的在轨服务任务规划方法[J]. 国防科技大学学报, 2020, 42(5): 143-150 doi: 10.11887/j.cn.202005020

    LIU Xiaolu, XU Yingjie, HE Renjie, et al. Satellite-oriented on-orbit service mission planning method[J]. Journal of National University of Defense Technology, 2020, 42(5): 143-150 doi: 10.11887/j.cn.202005020
    [11]
    曾而康, 吴海燕, 冯准. 天区机遇目标空间探测任务规划算法[J]. 空间科学学报, 2023, 43(2): 361-368 doi: 10.11728/cjss2023.02.220225020

    ZENG Erkang, WU Haiyan, FENG Zhun. Sky area target of opportunity mission planning method[J]. Chinese Journal of Space Science, 2023, 43(2): 361-368 doi: 10.11728/cjss2023.02.220225020
    [12]
    CHUNG J, GULCEHRE C, CHO K, et al. Empirical evaluation of gated recurrent neural networks on sequence modeling[OL]. arXiv preprint arXiv: 1412.3555, 2014
    [13]
    LUONG T, PHAM H, MANNING C D. Effective approaches to attention-based neural machine translation[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon: ACL, 2015
    [14]
    BELLO I, PHAM H, LE Q V, et al. Neural combinatorial optimization with reinforcement learning[C]//Proceedings of the 5th International Conference on Learning Representations. Toulon: OpenReview. net, 2017
    [15]
    赵凡宇. 航天器多目标观测任务调度与规划方法研究[D]. 北京: 北京理工大学, 2015

    ZHAO Fanyu. Mission Scheduling and Planning of Multi-Target Observations for Spacecraft[D]. Beijing: Beijing Institute of Technology, 2015
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(6)  / Tables(4)

    Article Metrics

    Article Views(229) PDF Downloads(9) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return