| A Regularized Opponent Model with Maximum Entropy Objective | Zheng Tian, Ying Wen, Zhichen Gong, Faiz Punakkath, Shihao Zou, Jun Wang | IJCAI'2019 |
| Learning Adaptive Display Exposure for Real-Time Advertising | Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Xiaotian Hao, Yixi Wang, Han Li, Jian Xu and Kun Gai | CIKM'2019 |
| Towards Efficient Detection and Optimal Response against Sophosticated Opponents | Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng, Ze Zheng | IJCAI'2019 |
| Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces | Haotian Fu, Hongyao Tang, Jianye Hao, Zihan Lei, Yingfeng Chen, Changjie Fan | IJCAI'2019 |
| Context-Aware Policy Reuse | Siyuan Li, Fangda Gu, Guangxiang Zhu, Chongjie Zhang | AAMAS'2019 |
| Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG | Hangyu Mao, Zhengchao Zhang, Zhen Xiao and Zhibo Gong | AAMAS'2019 |
| A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer | Fuli Luo , Peng Li , Jie Zhou , Pengcheng Yang , Baobao Chang , Zhifang Sui , Xu Sun | IJCAI'2019 |
| Budget-feasible Procurement Mechanisms in Two-sided Markets | Weiwei Wu, Xiang Liu, Minming Li | IJCAI'2018 |
| Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning | Wenzhen Huang, Junge Zhang, Kaiqi Huang | AAAI'2019 |
| Playing FPS Games With Environment-Aware Hierarchical Reinforcement Learning | Shihong Song, Jiayi Weng, Hang Su, Dong Yan, Haosheng Zou, Jun Zhu | IJCAI'2019 |
| Learning Attentional Communication for Multi-Agent Cooperation | Jiechuan Jiang and Zongqing Lu | NeurIPS'2018 |
| A Multi-Agent Communication Framework for Question-Worthy Phrase Extraction and Question Generation | Siyuan Wang, Zhongyu Wei, Zhihao Fan, Yang Liu, Xuanjing Huang | AAAI'2019 |
| Selling Multiple Items via Social Networks | Dengji Zhao | AAMAS'2018 |
| Mean Field Multi-Agent Reinforcement Learning | Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang | ICML'2018 |
| A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems | Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang | AAAI'2019 |
| Towards Faster Convention Emergence from Multilateral Coordination: A Gist Trace-based Multi-Agent Reinforcement Learning Approach | Shuyue Hu, Chin-wing Leung, Ho-fung Leung and Jiamou Liu | AAMAS'2019 |
| Value Function Transfer for Deep Multi-Agent Reinforcement Learning Based on N-Step Returns | Yong Liu, Yujing Hu , Yang Gao , Yingfeng Chen, Changjie Fan | IJCAI'2019 |
| StackDRL: Stacked Deep Reinforcement Learning for Fine-grained Visual Categorization | Xiangteng He, Yuxin Peng and Junjie Zhao | IJCAI'2018 |
| An Optimal Online Method of Selecting Source Policies for Reinforcement Learning | Siyuan Li, Chongjie Zhang | AAAI'2018 |
| Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach | Shuyue Hu, Chin-wing Leung and Ho-fung Leung | NeurIPS'2019 |
| Coordinated Multiagent Reinforcement Learning for Teams of Mobile Sensing Robots. | Chao Yu, Xin Wang, Zhanbo Feng | AAMAS'2019 |
| Dynamic electronic toll collection via multi-agent deep reinforcement learning with edge-based graph convolutional network representation | Wei Qiu, Haipeng Chen, Bo An | IJCAI'2019 |
| Optimal interdiction of urban criminals with the aid of real-time information | Youzhi Zhang, Qingyu Guo, Bo An, Long Tran-Thanh, Nicholas Jennings | AAAI'2019 |
| PT-ISABB A Hybrid Tree-based Complete Algorithm to Solve Asymmetric Distributed Constraint Optimization Problems | Yanchen Deng, Ziyu Chen, Dingding Chen, Xingqiong Jiang, Qiang Li | AAMAS'2019 |
| Fully Parameterized Quantile Function for Distributional Reinforcement Learning | Derek Yang, Li Zhao, Zichuan Lin, Jiang Bian, Tao Qin, Tie-Yan Liu. | NeurIPS'2019 |
| Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Siyuan Li, Rui Wang, Minxue Tang, Chongjie Zhang | NeurIPS'2019 |
| Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning | Zhihao Fan , Zhongyu Wei , Siyuan Wang, Xuanjing Huang | ACL'2019 |
| Multi-unit Budget Feasible Mechanisms for Cellular Traffic Offloading | Jun Wu, Yuan Zhang, Yu Qiao, Lei Zhang, Chongjun Wang, and Junyuan Xie | AAMAS'2019 |
| SA-IGA: amultiagent reinforcement learning method towards socially optimal outcomes | Chengwei Zhang, Xiaohong Li, Jianye Hao, Siqi Chen, Karl Tuyls, Wanli Xue, Zhiyong Feng | JAAMAS'2019 |
| Cooperation Enforcement and Collusion Resistance in Repeated Publi c Goods Games | Kai Li, Dong Hao | AAAI'2019 |
| Regret Minimization for Reinforcement Learning by Evaluating the Optimal Bias Function | Zihan Zhang | NeurIPS'2019 |
| Cascaded algorithm-selection and hyper-parameter optimization with extreme-region upper confidence bound bandit | Yi-Qi Hu, Yang Yu and Jun-Da Liao | IJCAI'2019 |
| Environment reconstruction with hidden confounders for reinforcement learning based recommendation | Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei Qin, Yiping Meng and Jieping Ye | KDD'2019 |
| A deep bayesian policy reuse approach against non-stationary agents | Yan Zheng, Zhaopeng Meng, Jianye Hao, Zhangzong Zhang, Tianpei Yang, Changjie Fan | NeurIPS'2018 |
| Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems | Xiaotian Hao, Weixun Wang, Yaodong Yang, Jianye Hao | AAMAS'2019 |