Coder Social home page Coder Social logo

aichess's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

aichess's Issues

game.py graphic函数的调用

game.py graphic函数的调用是否都错了?
是否应该是下边这个样子??
if is_shown:
self.graphic(self.board, self.board.id2color[player1.player], self.board.id2color[player2.player])

预测值

作者您好,请问走了一次前向传播的policy_value_fn预测出来的走子概率及状态估值都是Nan是怎么回事啊

python collect.py后输出储存失败

已加载初始模型
F:\aichess-main\pytorch_net.py:75: UserWarning: Implicit dimension choice for log_softmax has been deprecated. Change the call to include dim=X as an argument.
policy = F.log_softmax(policy)
D:\Anaconda\envs\chess\lib\site-packages\torch\nn\functional.py:1949: UserWarning: nn.functional.tanh is deprecated. Use torch.tanh instead.
warnings.warn("nn.functional.tanh is deprecated. Use torch.tanh instead.")
走一步要花: 5.746572971343994
走一步要花: 5.981132745742798
F:\aichess-main\zip_array.py:51: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when creating the ndarray.
return np.array(zip_res)
存储失败

半精度的计算速度问题

with autocast(): #半精度fp16

在使用自动精度之前,单步collect速度是20s左右,但是当我改成浮点固定精度float32并取消autocast后,单步速度到了16s左右,我猜是自动适配精度其实是额外的消耗,仅个人猜测,不知道有没有人跟我一样的实验结果?

关于gpu调用问题

大神的代码写的非常精巧,因为我是个小白,我看了您的代码不太理解是怎么调用的gpu,我感觉tensor还是跑在cpu上饿。。。

合法动作的问题

大佬您好,我在看您写的Alpha chess,并按照您的意思,在自己写一个。
但是在collect过程中,他会做出不合法的动作,然后报错。
找了半天也没找到在哪里出问题了。
就是在您的这个demo里面,网络直接输出2084个动作,但是这些动作也是有不合法的,这不合法的动作,在哪里去掉或者约束呢。
因为马上要交作业了,所以有点着急请教您。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.