to minimize. There is no guarantee that it will bring us to the best solution! It should be differentiable! Credits to Sungjoon (https://github.com/sjchoi86/dl_tutorials)
different is the layer is deep ( > 1 ) So just add the layer ? Many people think of that Between every layer there is some technique to calculate the move of the data Like our brain works
tensor Matrix: 2-dimensional tensor Deep learning processes are flows of tensors A sequence of tensor operations Can also represent a number of machine learning algorithms Credits to Sungjoon (https://github.com/sjchoi86/dl_tutorials)