tensorflow中NaN的問題

原創

2018-12-23 20:32

今天幫妹子調試tensorflow的程序，遇到了nan的問題，找了好久終於解決，也沒辜負妹子。
最終找到了問題是tf.sqrt，引自stackoverflow， Why is my loss function returning nan?
解釋爲：

It was coming from the fact that x was approaching a tensor with all zeros for entries. This was making the derivative of sigma wrt x a NaN. The solution was to add a small value to that quantity.

也就是tf.sqrt(x), x爲0導致的nan的問題。當x爲0, 導致導數爲NaN。
解決方案：加一個極小量避免x爲0，也就是:

tf.sqrt(x+1e-8)

再給大家推薦一下知乎相似問題，很有借鑑意義。爲什麼用tensorflow訓練網絡，出現了loss=nan，accuracy總是一個固定值？

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

ImportError: libcudnn.so.7: cannot open shared object file錯誤以及非root用戶安裝cudnn

關於ImportError: libcudnn.so.7: cannot open shared object file錯誤以及非root用戶安裝cudnn 使用 Tensorflow 的時候出現了錯誤Tensorflow:Imp

MirrorN

2020-07-07 23:30:35

[深度之眼]LeNet/AlexNet/VGGNet/InceptionNet/ResNet實現fashion_mnist分類

本文使用五種經典卷積神經網絡，實現fashion_mnist十分類問題，並對比準確度和運行時間LeNet5 原理AlexNet8 原理VGGNet16 原理InceptionNet10 原理ResNet18 原理用到的包： im

TF_Keras

2020-07-08 11:53:37

Tensorflow常用函數（二）

tf.lin_space(start, stop, num, name=None) create a sequence of num evenly-spaced values are generated beginning at

wf592523813

2020-07-08 08:32:10

利用TensorBoad可視化網絡

首先寫一個最簡單的TensorFlow程序爲例。 import tensorflow as tf tf.reset_default_graph() logdir='F:/log' input1=tf.constant([

小敏偏头痛

2020-07-08 06:40:26

GPU導入模型非常緩慢的解決辦法

問題描述最近在一臺服務器上訓練模型，奈何卡有點少，爲了更有銷效率的調參，將網絡和環境都遷移到一臺8卡的服務器上，本以爲會開啓瘋狂調參模式，沒想到問題來了。GPU每秒加載4-5M的模型數據，我的模型和數據集一共差不多是8500M左

还是少年呀

2020-07-08 05:59:41

windows 下 keras.utils.plot_model報錯問題

1、下載graphviz.msi安裝包，添加Bin目錄到Path系統變量下載地址：https://graphviz.gitlab.io/_pages/Download/Download_windows.html 2、如果出現'dict'

DDDIIILLL

2020-07-08 05:35:28

subprocess.py報錯：FileNotError: [Errno 2] No such file or directory: java: java

在運行coco計算ImageCaption得分時，出現以下錯誤： subprocess.py報錯：FileNotError: [Errno 2] No such file or directory: 'java': 'java' 原因：

清晨的光明

2020-07-08 02:37:26

[tensorflow2]五分鐘帶你入門tensorboard&&錯誤解決方案

好久沒有在CSDN博客寫作了，本作僅供入門使用，詳情請移步官方文檔和通常的文件讀取寫入流層一樣，tensorboard也遵循這一過程 1. 找個本子，拿起筆 train_log_dir = 'E:\\dataset\\tra

still_learning

2020-07-08 02:30:48

Mac安裝Tensorflow，運行項目報錯: module compiled against API version 0xa but this version of numpy is 0x9

1、安裝Tensorflow $ sudo easy_install pip$ sudo easy_install --upgrade six$ sudo pip install tensorflow 當前最新的tensorflow版本1

铁真木

2020-07-08 01:58:44

損失函數：categorical_crossentropy

損失函數：categorical_crossentropy損失函數講解合集概述正文公式分析代碼分析MORE 損失函數講解合集 binary_crossentropy categorical_crossentropy 概述本文講解

Stealers

2020-07-08 01:15:40

微軟三維人臉重建論文前期知識鋪墊——《Accurate 3D Face Reconstruction with Weakly-Supervised Learning》

一個3D模型的數據結構是怎麼樣的？首先是一個points_shape,這個shape的形狀是shape=（35709，3）。其中35709代表這個模型包括35709個點，3代表的是這些點的三維座標。有了這個矩陣，我們能夠在腦海裏想象在空

wxshan3

2020-07-08 01:13:07

《Tensorflow中文社區教程》筆記

文|Seraph 01 | 新手入門一、介紹平面擬合代碼 import tensorflow as tf import numpy as np # 使用 NumPy 生成假數據(phony data), 總共 100 個

ME_Seraph

2020-07-08 00:56:51

在Kaggle上免費使用GPU

Intro Kaggle提供免費訪問內核中的NVidia K80 GPU。該基準測試表明，在深度學習模型的訓練過程中，爲您的內核啓用GPU可實現12.5倍的加速。這個內核是用GPU運行的。我將運行時間與在CPU上訓練相同模型內核的運

10点43

2020-07-08 09:08:45

Windows環境下編譯 matconvnet的坑（2）

未定義函數或變量 'export_fig'。出錯 tiny_face_detector (line 197) export_fig('-dpng', '-native', '-opengl', '-transparent', out

danyang_Q

2020-07-08 01:05:53

深度學習(1) 關於圖像卷積和卷積神經網絡（CNN)

最近由於項目組需要，需要看一篇文獻“Deep Learning from Temporal Coherence in Video”。本人也聽過一些關於深度學習，CNN方面的報告，但是其實一直都是似懂非懂，完全不瞭解他們口中

懂deeee珍惜

2020-07-07 23:19:57

24小時熱門文章

tensorflow中NaN的問題

工作中用到的腳本合集

24-5-18 X

樹莓派實驗室

C均值算法（K-means）在opencv中實現圖像分割（摳圖）

win10怎麼添加開機啓動項

opecv cartoon(外星人模式)

Pytorch 數組反轉報錯 ValueError some of the strides of a given numpy array are negative

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結