- The weights should be initialized randomly to break symmetry.
- However, it's okay to initialize the biases to zeros. Symmetry is still broken so long as is initialized randomly.
- Initializing weights to very large random values doesn't work well.
- Initializing with small random values should do better.
No comments:
Post a Comment