how to elegantly duplicate a graph (neural network)

2023-01-02 12:18 问答作者：

I have a graph (network) which consists of layers, which contains nodes (neurons). I would like to write a procedure to duplicate entire graph in most elegant way possible -- i.e. with minimal or no overhead added to the structure of the node or layer.

Or yet in other words -- the procedure could be complex, but the complexity should not "leak" to structures. They should be no complex just because they are copyable.

I wrote the code in C#, so far it looks like this:

neuron has additional field -- copy_of w开发者_运维问答hich is pointer the the neuron which base copied from, this is my additional overhead
neuron has parameterless method Clone()
neuron has method Reconnect() -- which exchanges connection from "source" neuron (parameter) to "target" neuron (parameter)
layer has parameterless method Clone() -- it simply call Clone() for all neurons
network has parameterless method Clone() -- it calls Clone() for every layer and then it iterates over all neurons and creates mappings neuron=>copy_of and then calls Reconnect to exchange all the "wiring"

I hope my approach is clear. The question is -- is there more elegant method, I particularly don't like keeping extra pointer in neuron class just in case of being copied! I would like to gather the data in one point (network's Clone) and then dispose it completely (Clone method cannot have an argument though).

Use a hash table for copying a general graph:

h = new HashTable()
def copyAll(node):
   if h has key node: return h[node]
   copy = node.copy()
   h[node] = copy
   for each successor of node:
     copy.addSuccessor(copy(successor))
   return copy

Your particular graph seems to be acyclic with special structure so you don't need a hash table (you can use an array instead) and the approach you are describing seems to be the best way to copy it.

If you are writing a neural network you should just use vectors and matrices of floats to represent the neurons. It may seem less elegant now, but trust me it's much more elegant (and several orders of magnitude faster too).

Consider a neural network with 2 layers, the input (n nodes) and the output (m nodes). Now suppose we have a vector of floats called in that represents the values of the input layer, and we want to compute a vector called out that represents the values of the output layer. The neural network itself consists of an n by m matrix M of floats. M[i][j] represents how strong the connection between input node i and output node j is. The beauty is that evaluating a network is the same as matrix multiplication followed by applying the activation function to every element of the result vector:

out = f(M*in)

Where f is the activation function and where * is matrix multiplication. This is neural network evaluation in 1 line! You cannot get it this elegant with OO design of a neural network.

继续阅读：algorithm graph

how to elegantly duplicate a graph (neural network)

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？