您现在的位置是：首页 > 后端

当前栏目

Q-Learning算法（command_line_reinforcement_learning）

算法 learning Command line Reinforcement

2023-09-27 14:26:47 时间

Q-Learning算法

import numpy as np
import pandas as pd
import time

np.random.seed(2)  # reproducible


N_STATES = 6   # the length of the 1 dimensional world
ACTIONS = ['left', 'right']     # available actions
EPSILON = 0.9   # greedy police
ALPHA = 0.1     # learning rate
GAMMA = 0.9    # discount factor
MAX_EPISODES = 13   # maximum episodes
FRESH_TIME = 0.3    # fresh time for one move


def build_q_table(n_states, actions):
    table = pd.DataFrame(
        np.zeros((n_states, len(actions))),     # q_table initial values
        columns=actions,    # actions's name
    )
    print(table)    # show table
    return table


def choose_action(state, q_table):

猜你喜欢

洛谷 UVA10226 Hardwood Species
jdk的server模式修改无效（关于client和server模式）
有了这 12 款 IDEA 插件后，室友再也不叫我小白了
第13章 Linux的网络管理
C# winform combobox控件中子项加删除按钮
23周一
SwiftUI 全站项目之Django服务器和客户端Moya Alamofire URLSession 支持GET和POST Kingfisher （教程含源码）
异常排查 | warning: push.default is unset； its implicit value is changing in Git 2.0
Nginx 高可用方案
如何在 Ubuntu 上安装 MongoDB
【测试过程问题记录】解决平台创建时间和入库时间不同步问题
python中的if判断语句
Oracle 执行计划（Explain Plan）说明
[LeetCode] Implement Queue using Stacks 用栈来实现队列
python自学篇——PyGame模块的所有功能函数详解
神经网络与机器学习笔记—小规模和大规模学习问题
Excel VLOOKUP实用教程之 10 在使用 VLOOKUP 函数时处理错误?（教程含数据excel）
查看mysql运行日志：mysql_general_log

相关主题

去雾算法2020
刷算法题
算法评价
常用算法总结
秒懂排序算法
算法-递归算法

zl程序教程

当前栏目

Q-Learning算法（command_line_reinforcement_learning）

Q-Learning算法

相关文章