I know about as little there is to know, and I have two questions relating to speed/quality and batch size.
The guide states that
"while large batches train faster, batch sizes in the 8 to 16 range likely produce better quality."
Is there any reason that the number starts at 8? Would it be even better quality if I did 7 or 6.....or even 1?Is training different, although closely related to iterations? Or does training = iterations? The guide clearly suggests that larger batch size trains faster, but smaller produces better quality. For my project with 500 input A faces and 3k input B faces, when I'm running a batch size of 50, I'm getting about 240 iterations per hour, at a batch size of 30 -Im getting about 500 iterations per hour. At a batch size of 6 -I'm getting 900 iterations per hour.
So did more "training" occur after 1 hour with my high batchsize low iteration count, or did more "training" occur from my low batchsize high iteration count?