A Regularized Opponent Model with Maximum Entropy Objective |
Zheng Tian, Ying Wen, Zhichen Gong, Faiz Punakkath, Shihao Zou, Jun Wang |
IJCAI'2019 |
Learning Adaptive Display Exposure for Real-Time Advertising |
Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Xiaotian
Hao, Yixi Wang, Han Li, Jian Xu and Kun Gai |
CIKM'2019 |
Towards Efficient Detection and Optimal Response against Sophosticated Opponents |
Tianpei Yang, Jianye
Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng, Ze Zheng |
IJCAI'2019 |
Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces |
Haotian Fu, Hongyao Tang, Jianye Hao, Zihan Lei, Yingfeng Chen, Changjie Fan |
IJCAI'2019 |
Context-Aware Policy Reuse |
Siyuan Li, Fangda Gu, Guangxiang Zhu, Chongjie Zhang |
AAMAS'2019 |
Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG |
Hangyu Mao, Zhengchao Zhang, Zhen Xiao and Zhibo Gong |
AAMAS'2019 |
A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer |
Fuli Luo , Peng Li , Jie Zhou , Pengcheng Yang , Baobao Chang , Zhifang Sui , Xu Sun |
IJCAI'2019 |
Budget-feasible Procurement Mechanisms in Two-sided Markets |
Weiwei Wu, Xiang Liu, Minming Li |
IJCAI'2018 |
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement
Learning |
Wenzhen Huang, Junge Zhang, Kaiqi Huang |
AAAI'2019 |
Playing FPS Games With Environment-Aware Hierarchical Reinforcement Learning |
Shihong Song, Jiayi Weng, Hang Su, Dong Yan, Haosheng Zou, Jun Zhu |
IJCAI'2019 |
Learning Attentional Communication for
Multi-Agent Cooperation |
Jiechuan Jiang and Zongqing Lu |
NeurIPS'2018 |
A Multi-Agent Communication Framework for Question-Worthy Phrase Extraction and Question
Generation |
Siyuan Wang, Zhongyu Wei, Zhihao Fan, Yang Liu, Xuanjing Huang |
AAAI'2019 |
Selling Multiple Items via Social Networks |
Dengji Zhao |
AAMAS'2018 |
Mean Field Multi-Agent Reinforcement Learning |
Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang |
ICML'2018 |
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems |
Ling Pan, Qingpeng
Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang |
AAAI'2019 |
Towards Faster Convention Emergence from Multilateral Coordination: A Gist Trace-based
Multi-Agent Reinforcement Learning Approach |
Shuyue Hu, Chin-wing Leung, Ho-fung Leung and
Jiamou Liu |
AAMAS'2019 |
Value Function Transfer for Deep Multi-Agent Reinforcement Learning Based on N-Step Returns
|
Yong Liu, Yujing Hu , Yang Gao , Yingfeng Chen, Changjie Fan |
IJCAI'2019 |
StackDRL: Stacked Deep Reinforcement Learning for Fine-grained Visual Categorization |
Xiangteng He, Yuxin Peng and Junjie Zhao |
IJCAI'2018 |
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning |
Siyuan Li, Chongjie Zhang |
AAAI'2018 |
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field
Theoretic Approach |
Shuyue Hu, Chin-wing Leung and Ho-fung Leung |
NeurIPS'2019 |
Coordinated Multiagent Reinforcement Learning
for Teams of Mobile Sensing Robots. |
Chao Yu, Xin Wang, Zhanbo Feng |
AAMAS'2019 |
Dynamic electronic toll collection via multi-agent deep reinforcement learning with
edge-based graph convolutional network representation |
Wei Qiu, Haipeng Chen, Bo An |
IJCAI'2019 |
Optimal interdiction of urban criminals with the aid of real-time information |
Youzhi Zhang, Qingyu Guo, Bo An, Long Tran-Thanh, Nicholas Jennings |
AAAI'2019 |
PT-ISABB A Hybrid Tree-based Complete Algorithm to Solve Asymmetric Distributed Constraint
Optimization Problems |
Yanchen Deng, Ziyu Chen, Dingding Chen, Xingqiong Jiang, Qiang Li |
AAMAS'2019 |
Fully Parameterized Quantile Function for Distributional Reinforcement Learning |
Derek Yang, Li Zhao, Zichuan Lin, Jiang Bian, Tao Qin, Tie-Yan Liu. |
NeurIPS'2019 |
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards |
Siyuan Li, Rui Wang, Minxue Tang, Chongjie Zhang |
NeurIPS'2019 |
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning |
Zhihao Fan , Zhongyu Wei , Siyuan Wang, Xuanjing Huang |
ACL'2019 |
Multi-unit Budget Feasible Mechanisms for Cellular Traffic Offloading |
Jun Wu, Yuan Zhang, Yu Qiao, Lei Zhang, Chongjun Wang, and Junyuan Xie |
AAMAS'2019 |
SA-IGA: amultiagent reinforcement learning method towards socially optimal outcomes |
Chengwei Zhang, Xiaohong Li, Jianye Hao, Siqi Chen, Karl Tuyls, Wanli Xue, Zhiyong Feng
|
JAAMAS'2019 |
Cooperation Enforcement and Collusion Resistance in Repeated Publi
c Goods Games |
Kai Li, Dong Hao |
AAAI'2019 |
Regret Minimization for Reinforcement Learning by Evaluating the Optimal Bias Function |
Zihan Zhang |
NeurIPS'2019 |
Cascaded algorithm-selection and hyper-parameter optimization with extreme-region upper
confidence bound bandit |
Yi-Qi Hu, Yang Yu and Jun-Da Liao |
IJCAI'2019 |
Environment reconstruction with hidden confounders for reinforcement learning based
recommendation |
Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei Qin, Yiping Meng and Jieping Ye |
KDD'2019 |
A deep bayesian policy reuse approach against non-stationary agents |
Yan Zheng, Zhaopeng Meng, Jianye Hao, Zhangzong Zhang, Tianpei Yang, Changjie Fan |
NeurIPS'2018 |
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems |
Xiaotian Hao, Weixun Wang, Yaodong Yang, Jianye Hao |
AAMAS'2019 |