教小白精通编程的博客 | Learn to programming Blog

Python程序设计基础与实践-目录

"Python Programming Basic and practice-catlog"

Python程序设计基础与实践-目录 1. 选题背景 Python是一个易于学习的、功能强大的编程语言。它具有高效的高级数据结构和能够简单有效地实现面向对象编程。Python优美的语法和动态类型，连同解释型特性一起，使其在多个平台的许多领域都成为脚本处理以及快速应用开发的理想语言。Python是全能型的应用编程语言，用Python可以做web开发、爬虫、运维、数据分析、人工智能等。比如做...

Posted by xuepro on June 14, 2018

Python程序设计基础与实践-1.4字符的表示

"Python Programming Basic and practice-Representation of Char"

教小白精通编程系列之“Python程序设计基础与实践”.未经作者同意，不得转载，违者将追究法律责任！欢迎关注微博：教小白精编程博客：https://xuepro.xcguan.net 1.4 字符(char)的表示计算机中所有的数据、指令都是以0，1串表示的，前面我们看到过的数字(Number)在计算机中是如何用二进制位表示的。同样的，各种字符比如大小写的英文字母、数字(0...

Posted by xuepro on June 14, 2018

Python程序设计基础与实践-1.3数的表示

"Python Programming Basic and practice-Representation of Number"

教小白精通编程系列之“Python程序设计基础与实践”.未经作者同意，不得转载，违者将追究法律责任！欢迎关注微博：教小白精编程博客：https://xuepro.xcguan.net 1.3 数Number的表示在python程序中，数的表示方式有多种，如果你熟悉二进制、十六进制，可以直接忽略本段内容。日常生活中，通常采用的是“十进制（Decimal）”，即用10个不同...

Posted by xuepro on June 14, 2018

Python程序设计基础与实践-1.2安装和解释器

"Python Programming Basic and practice"

教小白精通编程系列之“Python程序设计基础与实践”.未经作者同意，不得转载，违者将追究法律责任！欢迎关注微博：教小白精编程博客：https://xuepro.xcguan.net 安装Python开发环境安装python开发环境通常有2种下方式：原生安装只安装相应平台最基本的Python解释器，就可以编写程序Python程序了。可在https://www.pyt...

Posted by xuepro on June 13, 2018

策略梯度Policy Gradient

策略梯度Policy Gradient The general case is that when we have an expression of the form - i.e. the expectation of some scalar valued score function under some probability distribution parameterized...

Posted by xuepro on June 10, 2018

Entropy, cross Entropy and KL Divergence

Information entropy is defined as the average amount of information produced by a stochastic source of data. (熵是随机数据的平均信息量)，一个随机事件发生的概率越大，其信息量越少，反之，一个事件发生的概率越小，则一旦发生其信息量就很大，比如你说“一个人最终会死亡”，因为这是一个确定...

Posted by xuepro on June 7, 2018

Python-迭代器(iterator)、可迭代的(iterable)、生成器(generator)

iterator,iterable,generator

文章来源：Python技术进阶——迭代器、可迭代对象、生成器容器（container）、可迭代对象（iterable）、迭代器（iterator）、生成器（generator）的关系如下图： list、set、tuple、dict都是容器容器通常是一个可迭代对象但凡可以返回一个迭代器的对象，都称之为可迭代对象迭代器是一个可迭代对象，但反之不一定 ...

Posted by xuepro on June 3, 2018

AlphaGo algorithm

AlphaGo_IJCAI.pdf 深度解读AlphaGo Google DeepMind’s AlphaGo: How it works AlphaGo-lecture AlphaGo and Monte Carlo Tree Search 解密Google Deepmind AlphaGo围棋算法 [AlphaGo Zero: Learning ...

Posted by xuepro on June 2, 2018

蒙特卡罗方法Monte Carlo methods

Monte Carlo methodsg

Previously, we discussed markov decision processes, and algorithms to find the optimal action-value function and . We used policy iteration and value iteration to solve for the optimal policy. B...

Posted by xuepro on June 2, 2018

动态规划Dynamic Programming

Dynamic Programming

Dynamic Programming Policy Evaluation (Prediction) For a given policy compute the state–value function A system of jSj simultaneous linear equations Solution in matrix notation (complexity : ...

Posted by xuepro on June 2, 2018

教小白精通编程

Python程序设计基础与实践-目录

"Python Programming Basic and practice-catlog"

Python程序设计基础与实践-1.4字符的表示

"Python Programming Basic and practice-Representation of Char"

Python程序设计基础与实践-1.3数的表示

"Python Programming Basic and practice-Representation of Number"

Python程序设计基础与实践-1.2安装和解释器

"Python Programming Basic and practice"

策略梯度Policy Gradient

策略梯度Policy Gradient

Entropy, cross Entropy and KL Divergence

Entropy, cross Entropy and KL Divergence

Python-迭代器(iterator)、可迭代的(iterable)、生成器(generator)

iterator,iterable,generator

AlphaGo algorithm

AlphaGo algorithm

蒙特卡罗方法Monte Carlo methods

Monte Carlo methodsg

动态规划Dynamic Programming

Dynamic Programming

FEATURED TAGS

ABOUT ME

FRIENDS