Converting model to TF2.x mask problem

Lig789 · Post by **Lig789** » Tue Aug 18, 2020 2:37 pm

After updating FS 2.0, It started of converting model for tensor flow 2.x version, Kindly help me with solving this problem. I am stuck with alignment files when I never used it in original training (only needed when using mask etc).

when I created alignment file for conversions from original face folder, I get the error for "mask" but I didn't use any mask in original Dlight training. If I do not include any alignment file it asks for alignment file. If I include alignment it is giving me this error. I dont even know what mask if used any. But I didn't use any and it wants alignment file. should I go back to FS v1? It does go through all the images but when it reads alignment files for A and B it crushes.

Code: Select all

INFO     Reading alignments from A and B then 
Following error: 
08/18/2020 19:42:16 CRITICAL Error caught! Exiting...
08/18/2020 19:42:16 ERROR    Caught exception in thread: '_training_0'
08/18/2020 19:42:21 ERROR    Got Exception on main handler:
Traceback (most recent call last):
File "D:\faceswap\lib\cli\launcher.py", line 156, in execute_script
process.process()
File "D:\faceswap\scripts\train.py", line 165, in process
self._end_thread(thread, err)
File "D:\faceswap\scripts\train.py", line 205, in _end_thread
thread.join()
File "D:\faceswap\lib\multithreading.py", line 121, in join
raise thread.err[1].with_traceback(thread.err[2])
File "D:\faceswap\lib\multithreading.py", line 37, in run
self._target(*self._args, **self._kwargs)
File "D:\faceswap\scripts\train.py", line 227, in _training
raise err
File "D:\faceswap\scripts\train.py", line 216, in _training
trainer = self._load_trainer(model)
File "D:\faceswap\scripts\train.py", line 263, in _load_trainer
trainer = trainer(model,
File "D:\faceswap\plugins\train\trainer\original.py", line 10, in __init__
super().__init__(*args, **kwargs)
File "D:\faceswap\plugins\train\trainer\_base.py", line 85, in __init__
self._get_alignments_data())
File "D:\faceswap\plugins\train\trainer\_base.py", line 120, in _get_alignments_data
retval["masks"] = alignments.masks
File "D:\faceswap\plugins\train\trainer\_base.py", line 1090, in masks
retval = {side: self._get_masks(side, detected_faces)
File "D:\faceswap\plugins\train\trainer\_base.py", line 1090, in <dictcomp>
retval = {side: self._get_masks(side, detected_faces)
File "D:\faceswap\plugins\train\trainer\_base.py", line 1114, in _get_masks
mask = face.mask[self._config["mask_type"]]
KeyError: None
08/18/2020 19:42:21 CRITICAL An unexpected crash has occurred. Crash report written to 'D:\faceswap\crash_report.2020.08.18.194216335184.log'. You MUST provide this file if seeking assistance. Please verify you are running the latest version of faceswap before reporting.

Post by **torzdf** » Tue Aug 18, 2020 2:48 pm

Please provide this file:

D:\faceswap\crash_report.2020.08.18.194216335184.log

Lig789 · Post by **Lig789** » Wed Aug 19, 2020 11:15 am

Successfully, I get two files converted and they are "dlight.h5" and "dlight_state.json" and the rest is error. In original training I did not use any mask and left alignment files empty but in conversion for TF 2.X models I get this following error, when it reads newly created alignment file{note: FS 2.0, does not allow me to choose "None" as mask in global setting, it always goes back to "extended", when I tried to set none before clicking on train}:

Korben · Post by **Korben** » Wed Aug 19, 2020 3:29 pm

Same kind of problem, did not need alignments before updating.

Post by **torzdf** » Thu Aug 20, 2020 12:08 am

[mention]Korben[/mention]
Please could you provide both of the <model_name>_state.json file from your new and archived model folders

Post by **torzdf** » Thu Aug 20, 2020 12:12 am

Lig789 wrote: ↑Wed Aug 19, 2020 11:15 am
{note: FS 2.0, does not allow me to choose "None" as mask in global setting, it always goes back to "extended", when I tried to set none before clicking on train}:

This looks like a bug. I will investigate tomorrow.

Korben · Post by **Korben** » Thu Aug 20, 2020 2:40 am

torzdf wrote: ↑Thu Aug 20, 2020 12:08 am
@Korben
Please could you provide both of the <model_name>_state.json file from your new and archived model folders

Forgot to add the same thing lig789 just posted. I also noticed that if I tried to select none as mask it s always at extended when I come back to that screen.

But your post made me curious and I took a look the state files and saw that the new model was like this

Code: Select all

 "penalized_mask_loss": true,

compared to false in the original. So I tweaked it and it worked. I made an other test where I deleted the new model turned penalized_mask_loss off in the global config and redid the upgrade and it got rid of the error too.

BUT.......the model has zero training. Analysis only shows the new sessions starting at 22 in my case. Pretty sure I,m being blind right now but I dont see how to attach files.

Post by **torzdf** » Thu Aug 20, 2020 9:09 am

Ok, I suspect I know why this happens. Will review today.

Post by **torzdf** » Thu Aug 20, 2020 11:27 am

FYI: The mask reverting to "extended" issue was just a GUI display issue and has been fixed in the latest update.

I am still looking at the other issue.

Post by **torzdf** » Thu Aug 20, 2020 12:24 pm

This issue should now be fixed....

The easiest way to get your model updated is to delete the new folder that was created before (and doesn't work).

Rename the "_archived" folder back to the the original folder name.

Run training again.

Any problems, let me know.

Korben · Post by **Korben** » Thu Aug 20, 2020 2:56 pm

ok so updated everything and now it gets the model running on first try.

But still in my case its like the first training. I dont know if its intended in the new version but dfl_h128_decoder_A.h5, and dfl_h128_decoder_B.h5 are not present in the new folder. Judging by the file size it all seems to be in the dfl_h128_encoder.h5 file now.

Post by **torzdf** » Thu Aug 20, 2020 3:00 pm

No, it should continue from where it left off, but if it's LIAE architecture, then it may be problematic....

If you can zip up the archived model folder and share it, I may be able to take a look.

Lig789 · Post by **Lig789** » Thu Aug 20, 2020 4:00 pm

torzdf wrote: ↑Thu Aug 20, 2020 11:27 am
FYI: The mask reverting to "extended" issue was just a GUI display issue and has been fixed in the latest update.

I am still looking at the other issue.

Thank you for updates, but there are new problems too. In GUI train tab, there used to be GPU saving section, and it is missing. Other problem is obviously posted above .json, I depend on GPU saving but I can not run without it.

For the time being, until these bugs are fixed, can I run FS v1 in parallel to FS v2 without influencing dependencies and packages (python38 in roaming folder, in anaconda naviagtor or miniconda3 method) which method is best to use both versions in same pc ?

Code: Select all

git clone --depth 1 https://github.com/deepfakes/faceswap.git

Using this I get latest but How to get FS V1 ? into the link above. or should I use archive installer with miniconda 3 while FS v2 running in Anaconda naviagtor ?

Post by **torzdf** » Thu Aug 20, 2020 4:10 pm

We cannot currently bring the 2 VRAM saving options across to Tensorflow 2 as they are fundamentally incompatible.

However, you can enable "Mixed Precision" (In Settings > Train Options) to get a similar (or even better) level of VRAM savings. Try this, and see if it resolves your VRAM issues. Unfortunately this can only be set for new models at the moment (This is a Tensorflow, not a Faceswap limitation).

The best way to have both versions of Faceswap installed are to use the main installer for version 2, and the version 1 installer (https://github.com/deepfakes/faceswap/r ... tag/v1.0.0) for Version 1. Just make sure to select a different folder name and environment name for each of the installs.

Korben · Post by **Korben** » Sat Aug 22, 2020 5:08 pm

torzdf wrote: ↑Thu Aug 20, 2020 3:00 pm
No, it should continue from where it left off, but if it's LIAE architecture, then it may be problematic....

If you can zip up the archived model folder and share it, I may be able to take a look.

I was able to reproduce it.
In faceswapV1 I make a Dfl-H128 model with no mask and then try to upgrade and it just ignores the previous progress. If I make it using a mask it seems to upgrade and resume correctly.

A small 2k iteration sample
removed link after bug was fixed

Korben · Post by **Korben** » Wed Aug 26, 2020 7:07 pm

am I the only one with this problem? Models all resuming at correct iteration but loss starts from scratch.

Post by **torzdf** » Thu Aug 27, 2020 8:35 am

I have not had a chance to look at this yet. I may get a chance today, and will let you know if so.

Post by **torzdf** » Sun Aug 30, 2020 10:44 pm

There was a bug in dfl_h128 update code. This has been fixed in latest commit.

Korben · Post by **Korben** » Mon Sep 07, 2020 1:18 am

torzdf wrote: ↑Sun Aug 30, 2020 10:44 pm
There was a bug in dfl_h128 update code. This has been fixed in latest commit.

I can confirm the new update fixes the issue I had.

Faceswap Forum

Converting model to TF2.x mask problem

Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem

Re: Converting model to TF2.x mask problem