1.Zhang, L. L. and Guo, X. P. Constrained continuous-time Markov decision processes with average criteria. Mathematical Methods of Operations Research. 67(2): 323-340, 2008.
2. 张兰兰,郭先平. 受控排队系统的平均最优与约束平均最优. 控制理论与应用.26(2): 139-144, 2009.
3. Xianping Guo and Lanlan Zhang. Total Reward Criteria for Unconstrained/Constrained Continuous-Time Markov Decision. Journal of Systems Science and Complexity. 24: 491-505, 2011. |