Bottleneck flatten option
This is a new function for me, so as far as i interpret it correctly flatten is more like -> no bottleneck.
Of course flatten happens always at the end of the encoder to make a long 1d vector, normally this goes into a smaller dense bottleneck or pooling bottleneck but with flatten you just pass this enormous vector strait to the dense layers.
I see in most cases it explodes the parameter count.
Can somebody explain what the best use case is for the use of flatten als bottleneck?
Edit 1, seems like it is for the transformers