Loss cant go down spent over 48 hrs ?

adam_macchiato · Post by **adam_macchiato** » Tue Jul 26, 2022 5:37 am

Hi , I am using Phaze-A for traning with stojo present , its has been trained for 4days but the loss still keep on 0.017₀.016 this 48hrs , still cant go down ,

here is my setting :

RTX3080
Phaze-A Stojo mode ( efficientnet v2_s)
Learning rate : 4.5e , EE :-5
Mixed Precision and Nan on ,
Face A and B both over 10K png
Iterations : 260K now

But other PC with 3090 ( efficientnet v2_l), same setting , Iterations 67k loss 0.014 already , so whats should i do ?

one more questions is , RTX3080 cant use ( efficientnet v2_l), cause out of memory , even i set down the batch size to 2 , Learning rate 3.5e still cant use , how can make it ? because 3090 setting result is much better than 3080 setting

thank you.

Post by **torzdf** » Wed Jul 27, 2022 9:35 am

adam_macchiato wrote: ↑Tue Jul 26, 2022 5:37 am
Hi , I am using Phaze-A for traning with stojo present , its has been trained for 4days but the loss still keep on 0.017₀.016 this 48hrs , still cant go down ,

here is my setting :

RTX3080
Phaze-A Stojo mode ( efficientnet v2_s)
Learning rate : 4.5e , EE :-5
Mixed Precision and Nan on ,
Face A and B both over 10K png
Iterations : 260K now

But other PC with 3090 ( efficientnet v2_l), same setting , Iterations 67k loss 0.014 already , so whats should i do ?

Don't compare the loss values between mode settings. They are not directly comparable. The raw numbers are meaningless. All which matters is that they are going down. You can not normally tell this from the Graph tab, as it is zoomed out too far, but using the graph pop-out in the Analysis page will allow you to zoom in to the last few 10,000 iters and looking at the rolling average. See here: viewtopic.php?t=146#monitor

one more questions is , RTX3080 cant use ( efficientnet v2_l), cause out of memory , even i set down the batch size to 2 , Learning rate 3.5e still cant use , how can make it ? because 3090 setting result is much better than 3080 setting

Learning Rate does not impact VRAM usage. However efficientnet v2_L is a big model, so I am not surprised it runs out of VRAM. It also has an input size of 448px which is probably far larger than you require. Try using v2_S and lowering the encoder scaling.

Faceswap Forum

Loss cant go down spent over 48 hrs ?

Loss cant go down spent over 48 hrs ?

Re: Loss cant go down spent over 48 hrs ?