理解批量归一化参数模型输出

文奇聪明

我有以下型号

根据上图使用keras创建模型后，我有以下模型参数

我的问题是如何将批标准化 1 的参数设为 784。据我了解，批标准化有两个参数，由于我们有 196 个过滤器，我的理解是我们应该有 196 * 2 = 392，但模型输出显示为 784。我不明白这个值是怎么来的？请求提供有关我们如何获得此值的直觉？

Another question is how do we calculate for batch normalization for GRU units we got batch_normalization 2 got 512 parameters. To my understanding GRU has three non linear functions for update gate, relevance gate, and while calculating new cell value. so Here we should have 128 * 3 = 384, but model output as 512. How this value came here?

Thanks for your time and guidence.

Dr. Snoopy

The number of parameters for Batch Normalization is four times the number of input dimensions in the specified normalization axis (by default the last). This corresponds to the gamma and beta parameters, as well as the moving mean and moving variances. You can confirm this in the keras' source code.

要获得 784 BN 参数，您的维度为 784 / 4 = 196 个元素，对应于第一个 BN 层之前的层。对于 GRU 层，BN 有 128 个输入维度，需要 128 x 4 = 512 个参数。

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2021-07-22

我来说两句

0 条评论

登录后参与评论

TOP 榜单

文章

理解批量归一化参数模型输出

理解批量归一化参数模型输出

计算数据帧R中的字符串频率

Android Studio Kotlin：提取为常量

Excel 2016图表将增长与4个参数进行比较

获取并汇总所有关联的数据

如何使用Redux-Toolkit重置Redux Store

http：// localhost：3000 /＃！/为什么我在localhost链接中得到“＃！/”。

将加号/减号添加到jQuery菜单

算术中的c ++常量类型转换

TYPO3：将 Formhandler 添加到新闻扩展

TreeMap中的自定义排序

如何开始为Ubuntu开发

在 Python 2.7 中。如何从文件中读取特定文本并分配给变量

无法使用 envoy 访问 .ssh/config

在Ubuntu和Windows中，触摸板有时会滞后。硬件问题？

遍历元素数组以每X秒在浏览器上显示

在Jenkins服务器中使用Selenium和Ruby进行的黄瓜测试失败，但在本地计算机中通过

警告消息：在matrix（unlist（drop.item），ncol = 10，byrow = TRUE）中：数据长度[16]不是列数的倍数[10]>？

未捕获的SyntaxError：带有Ajax帖子的意外令牌u

如何使用tweepy流式传输来自指定用户的推文（仅在该用户发布推文时流式传输）

尝试在Dell XPS13 9360上安装Windows 7时出错

如果从DB接收到的值为空，则JMeter JDBC调用将返回该值作为参数名称