您现在的位置是：首页 > IT要闻

当前栏目

不完美信息游戏中的搜索

学习计算机

2023-03-14 22:36:25 时间

从这个领域的黎明开始，带有价值函数的搜索就是计算机游戏研究的一个基本概念。图灵1950年的国际象棋算法能够提前两步思考，香农1950年关于国际象棋的工作包括一个关于搜索中使用的评价函数的广泛章节。塞缪尔1959年的跳棋程序已经结合了搜索和价值函数，这些函数是通过自我游戏和引导来学习的。TD-Gammon在这些想法的基础上进行了改进，并使用神经网络来学习这些复杂的价值函数--只是为了在搜索中再次使用。决策时间搜索和价值函数的结合已经成为计算机在长期的挑战性游戏中战胜人类对手的显著里程碑--国际象棋的DeepBlue和围棋的AlphaGo。直到最近，这种以（学习）价值函数为辅助的强大搜索框架还仅限于完全信息游戏。由于许多有趣的问题没有为代理人提供完美的环境信息，这是一个令人遗憾的限制。这篇论文向读者介绍了不完全信息博弈的健全搜索。

原文题目：Search in Imperfect Information Games

原文：From the very dawn of the field, search with value functions was a fundamental concept of computer games research. Turing's chess algorithm from 1950 was able to think two moves ahead, and Shannon's work on chess from 1950 includes an extensive section on evaluation functions to be used within a search. Samuel's checkers program from 1959 already combines search and value functions that are learned through self-play and bootstrapping. TD-Gammon improves upon those ideas and uses neural networks to learn those complex value functions -- only to be again used within search. The combination of decision-time search and value functions has been present in the remarkable milestones where computers bested their human counterparts in long standing challenging games -- DeepBlue for Chess and AlphaGo for Go. Until recently, this powerful framework of search aided with (learned) value functions has been limited to perfect information games. As many interesting problems do not provide the agent perfect information of the environment, this was an unfortunate limitation. This thesis introduces the reader to sound search for imperfect information games.

不完美信息游戏中的搜索.pdf

猜你喜欢

用Spark机器学习数据流水线进行广告检测
Kappa:比Lambda更好更灵活的实时处理架构
为什么说Storm比Hadoop 快？
Flink常见的关键技术与特性详解
大数据框架对比：Hadoop、Storm、Samza、Spark和Flink
酒店业大数据应用趋势：数据将更具可管理性
大数据产业未来方向何在？业内人士给出10个“数据观”
异构数据中心的简化与安全保护
高速公路视图大数据处理应用探讨
用户群体画像功能深度解析
数据分析师的必读书单
帆软荣获2016”IT印象”最具影响力商业智能品牌
详细解读五大企业日志管理工具
有哪些传统数据科学技术被大众媒体称为人工智能（AI）？
数据中心已然“玉树临风”，你还在用老的套路备份吗？
【1971-2050 计算革命简史】从摩尔定律到“消失”的计算机
美数科技创始人范昂：广告只是表现，“信息流+大数据”才是数字营销的未来
打破TPCx-BB测试记录又怎样，会玩Hadoop大数据应用吗?
Hadoop面试中6个常见的问题及答案
从零开始，构建数据化运营体系

zl程序教程

当前栏目

不完美信息游戏中的搜索

相关文章