Pytorch 几个踩坑点

tensor.detach() creates a tensor that shares storage with tensor that does not requires grad. This will remove a tensor from a computation graph.
tensor.clone() creates a copy of tensor that imitates the original tensor's requires_grad field. This still keeps the copy as a part of the computational graph it came from.
tensor.data returns a new tensor that shares storage with tensor. But it always has requires_grad=False.
2.
gradient 可以理解成一阶近似，所以梯度可以理解成某个变量

posted @ 2019-02-13 06:18 林小奚阅读(349) 评论(0) 收藏举报

刷新页面返回顶部

林小奚

你我既为学者，尽我学者之力便是，其余不必多想。

Pytorch 几个踩坑点

公告