您现在的位置是：首页 > .Net

当前栏目

torch.nn.Embedding

2023-02-18 16:33:20 时间

Parameters

num_embeddings (int) – size of the dictionary of embeddings
embedding_dim (int) – the size of each embedding vector
padding_idx (int, optional) – If specified, the entries at padding_idx do not contribute to the gradient; therefore, the embedding vector at padding_idx is not updated during training, i.e. it remains as a fixed “pad”. For a newly constructed Embedding, the embedding vector at padding_idx will default to all zeros, but can be updated to another value to be used as the padding vector.
max_norm (float, optional) – If given, each embedding vector with norm larger than max_norm is renormalized to have norm max_norm.
norm_type (float, optional) – The p of the p-norm to compute for the max_norm option. Default 2.
scale_grad_by_freq (boolean, optional) – If given, this will scale gradients by the inverse of frequency of the words in the mini-batch. Default False.
sparse (bool, optional) – If True, gradient w.r.t. weight matrix will be a sparse tensor. See Notes for more details regarding sparse gradients.

Examples:

import torch
from torch import nn
embedding = nn.Embedding(5, 4) # 假定字典中只有5个词，词向量维度为4
word = [[1, 2, 3],
        [2, 3, 4]] # 每个数字代表一个词，例如 {'!':0,'how':1, 'are':2, 'you':3,  'ok':4}
                    #而且这些数字的范围只能在0～4之间，因为上面定义了只有5个词
embed = embedding(torch.LongTensor(word))
print(embed)
print(embed.size())

结果：

tensor([[[-0.0436, -1.0037, 0.2681, -0.3834],
[ 0.0222, -0.7280, -0.6952, -0.7877],
[ 1.4341, -0.0511, 1.3429, -1.2345]],

[[ 0.0222, -0.7280, -0.6952, -0.7877],
[ 1.4341, -0.0511, 1.3429, -1.2345],
[-0.2014, -0.4946, -0.0273, 0.5654]]], grad_fn=<EmbeddingBackward0>)
torch.Size([2, 3, 4])

猜你喜欢

[Nature Biotechnology | 论文简读] 使用语言模型和深度学习进行单序列蛋白质结构预测
[ICML 2022 | 论文简读] 面向图表示学习的结构感知的Transformer
[KDD 2022 | 论文简读] HyperAid:用于树拟合和层次聚类的双曲空间去噪
[JCIM | 论文简读] 用于检测β-内酰胺酶-抑制剂相互作用的可转移多通道模型
最佳实践 | 最佳 DevOps 工具链轻松管理软件开发团队的所有工具
[Nature Methods | 论文简读] SVision：一种解决复杂结构变异的深度学习方法
GitOps: Kubernetes CI/CD 的缺失环节
[Nature Methods] SpaGCN：整合基因表达、空间位置和组织学，通过图卷积网络识别空间域和空间可变基因
如何验证Kubernetes YAML Files
基础架构即代码 vs 配置管理 vs 基础架构预配
[IEEE Trans. Med. Imaging] VQAMix:基于带条件三元组混合的医学图像问答
[Nat. Commun. | 论文简读] 基于原子环境的神经机器翻译预测逆合成反应路径
[Nat. Commun. | 论文简读] 将生物医学数据集成和格式化为 Bioteque 中预先计算的知识图谱嵌入
[scientific reports | 论文简读] scCAN:使用自动编码器和网络融合的单细胞聚类算法
玩转SQLite8：运算符与表达式
[Nature Biotechnology] BayesSpace：亚点分辨率下的空间转录组学
[IEEE TPAMI] HGNN+: 通用超图神经网络
[PNAS | 论文简读] 神经表征几何是小样本概念学习的基础
[ICIR | 论文简读] 重新审视基因发现：通过学习嵌入的结构化解码提高鲁棒性
玩转SQLite9：常用语句实践(一)

zl程序教程

当前栏目

torch.nn.Embedding

相关文章