Required image format for getting FID and KID. #17

jonatelintelo · 2023-10-25T11:25:08Z

Hi,

I am trying to apply the standardized FID and KID scoring to my own dataset and generators. For this I have a question.

The evaluate_metrics(args) handles the real image and the generated image subdirectories and calculates the desired scores between all real and generated images in these subdirs. But in what format are these passed? (.jpg or something different?)

The text was updated successfully, but these errors were encountered:

usert5432 · 2023-10-26T03:03:42Z

Hi @jonatelintelo,

Are you referring to this function?

uvcgan2/scripts/eval_fid.py

Lines 64 to 74 in f741603

    
           def evaluate_metrics(path1, path2, kid_size): 
        
               return torch_fidelity.calculate_metrics( 
        
                   input1  = path1, 
        
                   input2  = path2, 
        
                   cuda    = True, 
        
                   isc     = False, 
        
                   fid     = True, 
        
                   kid     = True, 
        
                   verbose = False, 
        
                   kid_subset_size = kid_size, 
        
               )

If so, then this a wrapper around torch_fidelity package and it should support all of the image formats that torch_fidelity supports. When we evaluated the FID scores, we used png format though.

jonatelintelo · 2023-10-26T09:56:23Z

Hi @jonatelintelo,

Are you referring to this function?

uvcgan2/scripts/eval_fid.py

Lines 64 to 74 in f741603

def evaluate_metrics(path1, path2, kid_size):

return torch_fidelity.calculate_metrics(

input1 = path1,

input2 = path2,

cuda = True,

isc = False,

fid = True,

kid = True,

verbose = False,

kid_subset_size = kid_size,

)

If so, then this a wrapper around torch_fidelity package and it should support all of the image formats that torch_fidelity supports. When we evaluated the FID scores, we used png format though.

Thank you for the quick answer. I indeed meant that function.

In this wrapper function from your code, is the input to calculate_metrics a singular image or a directory with images? If it is the latter, does that mean you called calculate_metrics for every attribute you compared such as anime to selfie and selfie to anime etc.

How many samples were there at minimum in each directory during your metric calculations?

usert5432 · 2023-10-27T01:25:05Z

In this wrapper function from your code, is the input to calculate_metrics a singular image or a directory with images? If it is the latter, does that mean you called calculate_metrics for every attribute you compared such as anime to selfie and selfie to anime etc.

FID scores are evaluated between directories of images. In the case of the Anime <-> Selfie translation we evaluate scores between the following directories:

Real Anime Images vs Anime images obtained from selfies
Real Selfie Images vs Selfie images obtained from anime images

How many samples were there at minimum in each directory during your metric calculations?

We used the entire test datasets for the evaluation. The smallest test dataset belongs to Anime2Selfie and it has just 100 anime and 100 selfie images.

jonatelintelo · 2023-10-27T07:46:54Z

Thank you for the answers, I can work further with this now.

We used the entire test datasets for the evaluation. The smallest test dataset belongs to Anime2Selfie and it has just 100 anime and 100 selfie images.

I also took a look at another FID implementation from pytorch-fid. For this implemenation the author recommends to use at least 2048 images due to the last pooling layer dimension in the inception network. Changing this might result in scores no longer correlating with visual quality. Do you know if the torch-fidelity has this same constraint and did you take this into account for the paper?

usert5432 · 2023-10-30T15:33:39Z

Hi @jonatelintelo,

Do you know if the torch-fidelity has this same constraint and did you take this into account for the paper?

No, unfortunately, I am not aware whether torch-fidelity makes such recommendations or not. And, we did not take torch-fid recommendation into account.

jonatelintelo changed the title ~~Getting FID and KID for custom dataset.~~ Required image format for getting FID and KID. Oct 25, 2023

usert5432 self-assigned this Oct 26, 2023

usert5432 added the question Further information is requested label Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Required image format for getting FID and KID. #17

Required image format for getting FID and KID. #17

jonatelintelo commented Oct 25, 2023

usert5432 commented Oct 26, 2023

jonatelintelo commented Oct 26, 2023 •

edited

Loading

usert5432 commented Oct 27, 2023

jonatelintelo commented Oct 27, 2023 •

edited

Loading

usert5432 commented Oct 30, 2023

Required image format for getting FID and KID. #17

Required image format for getting FID and KID. #17

Comments

jonatelintelo commented Oct 25, 2023

usert5432 commented Oct 26, 2023

jonatelintelo commented Oct 26, 2023 • edited Loading

usert5432 commented Oct 27, 2023

jonatelintelo commented Oct 27, 2023 • edited Loading

usert5432 commented Oct 30, 2023

jonatelintelo commented Oct 26, 2023 •

edited

Loading

jonatelintelo commented Oct 27, 2023 •

edited

Loading