Yang Zhang, Bo Tang, Qingyu Yang*, et al. BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market. Thirty-fifth Conference on Neural Information Processing Systems(NeurIPS), 2021, Accepted. (CCF A类会议,会议论文录用率26.0%。该论文为课题组和阿里公司合作完成)