Markov Decision Theory and Reinforcement Learning
- Q. Zhao
Multi-Armed Bandits: Theory and Applications to Online Learning in Networks
Morgan and Claypool Publishers, 2019. - X. Xu, S. Vakili, Q. Zhao, A. Swami
Multi-Armed Bandits on Partially Revealed Unit Interval Graphs
to appear in IEEE Transactions on Network Science and Engineering. - X. Xiao, Q. Zhao
Distributed No-Regret Learning in Multi-Agent Systems
to appear in IEEE Signal Processing Magazine. - S. Vakili, A. Boukouvalas, and Q. Zhao
Decision Variance in Risk-Averse Online Learning
IEEE Conference on Decision and Control (CDC), December, 2019. - S. Baltaoglu, L. Tong, Q. Zhao
Online Learning of Optimal Bidding Strategy in Repeated Multi-Commodity Auctions
The 31st Annual Conference on Neural Information Processing Systems (NIPS), December, 2017. - S. Vakili, Q. Zhao
Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure
IEEE Journal of Selected Topics in Signal Processing (JSTSP): Special Issue on Financial Signal Processing and Machine Learning for Electronic Trading, vol. 10, no. 6, pp. 1093-1111, September, 2016.
Also available at arXiv.org - Y. Zhai, Q. Zhao
Oligopoly Dynamic Pricing: A Repeated Game with Incomplete Information
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March, 2016. - S. Vakili, Q. Zhao
Achieving Complete Learning in Multi-Armed Bandit Problems
The 47th IEEE Asilomar Conference on Signals, Systems, and Computers, November, 2013. - S. Vakili, K. Liu, Q. Zhao
Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems
IEEE Journal of Selected Topics in Signal Processing (JSTSP), vol. 7, no. 5, pp. 759 – 767, October, 2013.
Also available at arXiv.org. - H. Liu, K. Liu, Q. Zhao
Learning in A Changing World: Restless Multiarmed Bandit with Unknown Dynamics
IEEE Transactions on Information Theory, vol. 59, no. 3, pp. 1902-1916, March 2013.
Also available at arXiv.org. - K. Liu, Q. Zhao
Dynamic Intrusion Detection in Resource-Constrained Cyber Networks
IEEE International Symposium on Information Theory (ISIT), July, 2012.
Also available at arXiv.org - K. Liu, Q. Zhao
Adaptive Shortest-Path Routing under Unknown and Stochastically Varying Link States
The 10th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt), May, 2012. - K. Liu, Q. Zhao
Cooperative Game in Dynamic Spectrum Access with Unknown Model and Imperfect Sensing
IEEE Transactions on Wireless Communications, vol. 11, no. 4, pp. 1596-1604, April, 2012. - K. Liu, R.R. Weber, Q. Zhao
Indexability and Whittle Index for Restless Bandit Problems Involving Reset Processes
The 50th IEEE Conference on Decision and Control (CDC), December, 2011. - Q. Zhao, J. Ye
Quickest Detection in Multiple On-Off Processes
IEEE Transactions on Signal Processing, vol. 58, no. 12, pp. 5994-6006, December, 2010. - K. Liu, Q. Zhao
Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access
IEEE Transactions on Information Theory, vol. 56, no. 11, pp. 5547-5567, November, 2010.
Also available at arXiv.org. - K. Liu, Q. Zhao
Distributed Learning in Multi-Armed Bandit with Multiple Players
IEEE Transactions on Signal Processing, vol. 58, no. 11, pp. 5667-5681, November, 2010.
Also available at arXiv.org. - K. Liu, Q. Zhao, B. Krishnamachari
Dynamic Multichannel Access with Imperfect Channel State Detection
IEEE Transactions on Signal Processing, vol. 58, No. 5, pp. 2795 – 2808, May, 2010. - S.H. Ahmad, M. Liu, T. Javidi, Q. Zhao, B. Krishnamachari
Optimality of Myopic Sensing in Multichannel Opportunistic Access
IEEE Transactions on Information Theory, vol. 55, No. 9, pp. 4040-4050, September, 2009.
Also available at arXiv.org. - Y. Chen, Q. Zhao, A. Swami
Distributed Spectrum Sensing and Access in Cognitive Radio Networks with Energy Constraint
IEEE Transactions on Signal Processing, vol. 57, no. 2, pp. 783-797, February, 2009. - Q. Zhao, B. Krishnamachari, K. Liu
On Myopic Sensing for Multi-Channel Opportunistic Access: Structure, Optimality, and Performance
IEEE Transactions on Wireless Communications, vol. 7, no. 12, pp. 5431-5440, December, 2008.
Also available at arXiv.org. - Y. Chen, Q. Zhao, A. Swami
Joint Design and Separation Principle for Opportunistic Spectrum Access in the Presence of Sensing Errors
IEEE Transactions on Information Theory, vol. 54, no. 5, pp. 2053-2071, May, 2008.
Also available at arXiv.org. - Y. Chen, Q. Zhao, V. Krishnamurthy, D. Djonin
Transmission Scheduling for Optimizing Sensor Network Lifetime: A Stochastic Shortest Path Approach
IEEE Transactions on Signal Processing, vol. 55, no. 5, pp. 2294-2309, May, 2007. - Q. Zhao, L. Tong, A. Swami, Y. Chen
Decentralized Cognitive MAC for Opportunistic Spectrum Access in Ad Hoc Networks: A POMDP Framework
IEEE Journal on Selected Areas in Communications (JSAC), vol. 25, no. 3, pp. 589-600, April, 2007.
(Download Simulation Source Code)
IEEE Copyright Information: Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the copyright holder.