Neural Network with BackPropagation

Implement a simple Neural network trained with backprogation in Python3.

How to train a supervised Neural Network?

Feed Forward
Feed Backward * (BackPropagation)
Update Weights Iterating the above three steps

Figure 1. FeedForward vs. FeedBackward (by Mayank Agarwal)

Description of BackPropagation (小筆記)

Backpropagation is the implementation of gradient descent in multi-layer neural networks. Since the same training rule recursively exists in each layer of the neural network, we can calculate the contribution of each weight to the total error inversely from the output layer to the input layer, which is so-called backpropagation.

Gradient Descent (Optimization)

Gradient descent is a first-order iterative optimization algorithm, which is used to find the local minima or global minima of a function. The algorithm itself is not hard to understand, which is:

Starting from a point on the graph of a function;
Find a direction ▽F(a) from that point, in which the function decreases fastest;
Go (down) along this direction a small step γ, got to a new point of a+1; By iterating the above three steps, we can find the local minima or global minima of this function.

Stochastic Gradient Descent (SGD)

The advantage of this method is that the gradient is accurate and the function converges fast. But when the training dataset is enormous, the evaluation of the gradient from all data points becomes expensive and the training time can be very long.

Another method is called stochastic gradient descent, which samples (with replacement) a subset (one or more) of training data to calculate the gradient.

My Implementation

How to Learn?

Stochastic Gradient Descent

每次迭代:

Input為一筆資料的features，模型透過feed forward運算出predict_Y (outputs)

將所有Outputs與其相對之targets比較，利用Gradient Descent找出每個Neuron中的weights的改變方向

修正Weights，下一筆資料用新的權重來進行prediction，如此類推直至收斂。

Version

1.0

Requirements

Python==3

Coding Description

Activation Function: Sigmoid

Error Minimization: Gradient Descent

Error Function: Mean Square Error

Demo

(demo_1_xor.py) Here we used XOR dataset for our first demo.

Input datasets

# dataset = [(inputs), (outputs)]
train_dataset = [
    [(1, 0), [1]],
    [(0, 0), [0]],
    [(0, 1), [1]],
    [(1, 1), [0]]
]

test_dataset = [
    [(1, 0), [1]],
    [(0, 0), [0]]
]

Build the NN model with 1 hidden layer (2-3-1)

(n_input(2) -> n_hiddens(3,) --> n_output(1))

nn = NeuralNetwork(learning_rate=0.1, debug=False)
nn.add_layer(n_inputs=2, n_neurons=3)
nn.add_layer(n_inputs=3, n_neurons=1)

Train

nn.train(dataset=train_dataset, n_iterations=100, print_error_report=True)

Test

nn.test(test_dataset)

Remark

It is just a rough neural netowrk with bp.

Future Logs

Hidden Layers can be added at initial.
More Activation Functions

References

Concepts:

Coding:

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
.bash_profile		.bash_profile
README.md		README.md
activation.py		activation.py
demo_1_xor.py		demo_1_xor.py
neural_network.py		neural_network.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Network with BackPropagation

How to train a supervised Neural Network?

Description of BackPropagation (小筆記)

Gradient Descent (Optimization)

Stochastic Gradient Descent (SGD)

My Implementation

How to Learn?

Version

Requirements

Coding Description

Activation Function: Sigmoid

Error Minimization: Gradient Descent

Error Function: Mean Square Error

Demo

Input datasets

Build the NN model with 1 hidden layer (2-3-1)

Train

Test

Remark

Future Logs

References

About

Releases

Packages

Languages

Vercaca/NN-Backpropagation

Folders and files

Latest commit

History

Repository files navigation

Neural Network with BackPropagation

How to train a supervised Neural Network?

Description of BackPropagation (小筆記)

Gradient Descent (Optimization)

Stochastic Gradient Descent (SGD)

My Implementation

How to Learn?

Version

Requirements

Coding Description

Activation Function: Sigmoid

Error Minimization: Gradient Descent

Error Function: Mean Square Error

Demo

Input datasets

Build the NN model with 1 hidden layer (2-3-1)

Train

Test

Remark

Future Logs

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages