目录摘要(关键词)........................................................11.引言.............................................................11.1选题意义....................................................11.2国内外发展状况..............................................11.3展望........................................................21.4目前强化学习遇到的问题......................................21.5研究方法的探索..............................................32.系统模型及理论支持...............................................32.1马尔科夫决策过程............................................32.2强化学习....................................................32.3Q学习......................................................42.4Q学习的信道选择算法........................................52.5Boltzmann学习规则..........................................62.6模拟退火过程................................................72.7OFDM的信道接入方式.........................................72.8系统模型....................................................72.9蒙特卡洛方法................................................83.算法流程及实验过程...............................................83.1实验过程....................................................83.2算法流程....................................................83.3仿真过程...................................................104.结果分析........................................................114.1数据分析...................................................114.2实验中存在的问题和可行性...................................135.结束语..........................................................136.参考文献........................................................137.致谢............................................................14abstract(keywords)................................................15基于客户端的学习算法节能问题【摘要】如今移动客户端数量日益增多,通信频道资源渐渐难以满足需求,本文采用一种基于Q学习的信道选择算法,通过建立点...