Offline Reinforcement Learning:a Survey
李晓峰,蒋佳慧,王雪娆
Journal of Chinese Computer Systems . 2026, (5): 1056 -1069 .