Overview of the pipeline for pre-training the general policy and fine-tuning it online (IMAGE)
Caption
This figure overviews the proposed method from the perspective of offline pretraining with supervised learning and online fine-tuning with MARL algorithms.
Credit
Beijing Zhongke Journal Publising Co. Ltd.
Usage Restrictions
Credit must be given to the creator.
License
CC BY