one vs multiple models for a swap

Want to understand the training process better? Got tips for which model to use and when? This is the place for you


Forum rules

Read the FAQs and search the forum before posting a new topic.

This forum is for discussing tips and understanding the process involved with Training a Faceswap model.

If you have found a bug are having issues with the Training process not working, then you should post in the Training Support forum.

Please mark any answers that fixed your problems so others can find the solutions.

Post Reply
User avatar
korrupt78
Posts: 59
Joined: Wed Jan 29, 2020 1:34 am
Answers: 0
Has thanked: 4 times
Been thanked: 1 time

one vs multiple models for a swap

Post by korrupt78 »

I've heard some say that it helps to split the training set into subsets based on face orientation or expression, and then train a separate model for each subset.

Does that make any sense? Is it always a good idea, or only in some circumstances?

User avatar
DeliciousCaramels
Posts: 11
Joined: Wed Mar 05, 2025 7:57 am
Answers: 0

Re: one vs multiple models for a swap

Post by DeliciousCaramels »

I am no expert but I have been thinking about this question a lot.

I assume the modelling is built to expect the face in different orientations, and as you know the face is a 3d object. I think if you do it with discrete orientations as you suggest the model makes assumptions about how the face will look at certain angles, despite it not being able to display or confirm, and, when you actually show it that data "for real" it is difficult for the model as it has to relearn things it has assumed.

I think a better strategy is to feed the model data in chunks. I like to sort the faces by average alignments and I split the data fed to the model based on these bins, eg, if I have 100k images extracted and 6 bins sorted I just divide each binning by 4, put them into different directories and just feed the different sets into the model as I go. Seems to work? Not sure if I am rubbing snake oil on training but it seems intuitively like a good idea.

Last edited by DeliciousCaramels on Thu Apr 10, 2025 10:37 pm, edited 1 time in total.
User avatar
torzdf
Posts: 2790
Joined: Fri Jul 12, 2019 12:53 am
Answers: 159
Has thanked: 142 times
Been thanked: 649 times

Re: one vs multiple models for a swap

Post by torzdf »

Theoretically splitting the data and training multiple models should lead to a better result (you are storing less information in the latent space).

However, I suspect that the time/work involved vs the perceived improvement would not make it worthwhile.

If you do try to experiment with this, I would be interested to see the results.

My word is final

Post Reply