简单介绍numpy实现RNN原理实现

102次阅读

共计 1695 个字符，预计需要花费 5 分钟才能阅读完成。

导读	这篇文章主要介绍了 numpy 实现 RNN 原理实现，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧

首先说明代码只是帮助理解，并未写出梯度下降部分，默认参数已经被固定，不影响理解。代码主要实现 RNN 原理，只使用 numpy 库，不可用于 GPU 加速。

 import numpy as np
 
 
class Rnn():
 
  def __init__(self, input_size, hidden_size, num_layers, bidirectional=False):
    self.input_size = input_size
    self.hidden_size = hidden_size
    self.num_layers = num_layers
    self.bidirectional = bidirectional
 
  def feed(self, x):
    '''
 
    :param x: [seq, batch_size, embedding]
    :return: out, hidden
    '''
 
    # x.shape [sep, batch, feature]
    # hidden.shape [hidden_size, batch]
    # Whh0.shape [hidden_size, hidden_size] Wih0.shape [hidden_size, feature]
    # Whh1.shape [hidden_size, hidden_size] Wih1.size [hidden_size, hidden_size]
 
    out = []
    x, hidden = np.array(x), [np.zeros((self.hidden_size, x.shape[1])) for i in range(self.num_layers)]
    Wih = [np.random.random((self.hidden_size, self.hidden_size)) for i in range(1, self.num_layers)]
    Wih.insert(0, np.random.random((self.hidden_size, x.shape[2])))
    Whh = [np.random.random((self.hidden_size, self.hidden_size)) for i in range(self.num_layers)]
 
    time = x.shape[0]
    for i in range(time):
      hidden[0] = np.tanh((np.dot(Wih[0], np.transpose(x[i, ...], (1, 0))) +
               np.dot(Whh[0], hidden[0])
               ))
 
      for i in range(1, self.num_layers):
        hidden[i] = np.tanh((np.dot(Wih[i], hidden[i-1]) +
                   np.dot(Whh[i], hidden[i])
                   ))
 
      out.append(hidden[self.num_layers-1])
 
    return np.array(out), np.array(hidden)
 
 
def sigmoid(x):
  return 1.0/(1.0 + 1.0/np.exp(x))
 
 
if __name__ == '__main__':
  rnn = Rnn(1, 5, 4)
  input = np.random.random((6, 2, 1))
  out, h = rnn.feed(input)
  print(f'seq is {input.shape[0]}, batch_size is {input.shape[1]}', 'out.shape', out.shape, 'h.shape', h.shape)
  # print(sigmoid(np.random.random((2, 3))))
  #
  # element-wise multiplication
  # print(np.array([1, 2])*np.array([2, 1]))

到此这篇关于 numpy 实现 RNN 原理实现的文章就介绍到这了。

阿里云 2 核 2G 服务器 3M 带宽 61 元 1 年，有高配

腾讯云新客低至 82 元 / 年，老客户 99 元 / 年

代金券：在阿里云专用满减优惠券

正文完

星哥玩云-微信公众号

发表至： linux教程

2024-07-25

0

转载说明：除特殊说明外本站文章皆由CC-4.0协议发布，转载请注明出处。

Linux中如何使用Sipcalc计算IP子网

整理磁盘碎片让Windows 7电脑运行更快

为什么Linux 用 tar.gz而很少用 7Z 或 ZIP？

CentOS 6/7升级最新内核并开启Google BBR

简单介绍Python3压缩和解压缩实现代码

简单介绍numpy实现RNN原理实现

开源堡垒机JumpServer配置教程：使用步骤与配置

申请腾讯混元的API Key并且使用LobeChat调用混元AI

基于Docker快速搭建一个开源的IT人员在线工具箱-it-tools

Docker部署搭建一个开源强大的图书管理系统

让每个人都可以轻松使用Git-腾讯自研Git客户端

实践操作：github使用记录

Secure Boot什么意思？BIOS中Secure Boot灰色无法更改解决方法详解

Linux常用检测性能的10个基本命令

列举一下你可能没注意的Linux命令

上百TB的视频、图片如何保存？选择对象存储OSS性价比更高！

	import numpy as np


	class Rnn():

	def __init__(self, input_size, hidden_size, num_layers, bidirectional=False):
	self.input_size = input_size
	self.hidden_size = hidden_size
	self.num_layers = num_layers
	self.bidirectional = bidirectional

	def feed(self, x):
	'''

	:param x: [seq, batch_size, embedding]
	:return: out, hidden
	'''

	# x.shape [sep, batch, feature]
	# hidden.shape [hidden_size, batch]
	# Whh0.shape [hidden_size, hidden_size] Wih0.shape [hidden_size, feature]
	# Whh1.shape [hidden_size, hidden_size] Wih1.size [hidden_size, hidden_size]

	out = []
	x, hidden = np.array(x), [np.zeros((self.hidden_size, x.shape[1])) for i in range(self.num_layers)]
	Wih = [np.random.random((self.hidden_size, self.hidden_size)) for i in range(1, self.num_layers)]
	Wih.insert(0, np.random.random((self.hidden_size, x.shape[2])))
	Whh = [np.random.random((self.hidden_size, self.hidden_size)) for i in range(self.num_layers)]

	time = x.shape[0]
	for i in range(time):
	hidden[0] = np.tanh((np.dot(Wih[0], np.transpose(x[i, ...], (1, 0))) +
	np.dot(Whh[0], hidden[0])
	))

	for i in range(1, self.num_layers):
	hidden[i] = np.tanh((np.dot(Wih[i], hidden[i-1]) +
	np.dot(Whh[i], hidden[i])
	))

	out.append(hidden[self.num_layers-1])

	return np.array(out), np.array(hidden)


	def sigmoid(x):
	return 1.0/(1.0 + 1.0/np.exp(x))


	if __name__ == '__main__':
	rnn = Rnn(1, 5, 4)
	input = np.random.random((6, 2, 1))
	out, h = rnn.feed(input)
	print(f'seq is {input.shape[0]}, batch_size is {input.shape[1]}', 'out.shape', out.shape, 'h.shape', h.shape)
	# print(sigmoid(np.random.random((2, 3))))
	#
	# element-wise multiplication
	# print(np.array([1, 2])*np.array([2, 1]))