I have a question about the size of faces when extracting from B-side video source.
I found a similar answer from Mr. torzdf from a past question, but my English is not good enough to understand it.
But (for example) a 256px model takes faces at 256px regardless if they come from a low/high res source. Downsizing a 4K image to 256px will not retain any Downsizing a 4K image to 256px will not retain any more detail than using an image that was 256px to start with.
Assuming the image is 1920x1080, the red box is 256x256 and the blue box is 128x128.
Don't worry about the quality. This is only a question about the size of the face. And I am sorry if I am fundamentally wrong.
1 is almost the ideal face size in that I think.
2 and 3 are fine with the face sticking out far from the box?
I think 4 is not good.
If I am right and I really need to use 4 (or 2 and 3), should I convert the video to 1080x720 or something else?
Thank you for reading my question.