Handle corrupt images #27

CerebralSeed · 2023-07-20T03:39:03Z

This code will inform the user of a corrupt image in their set and continue training.

lucidrains · 2023-07-20T14:24:06Z

oh nice

not sure if selecting another image is the best way to go about it though

CerebralSeed · 2023-07-21T09:02:02Z

Ideally, we would just skip that index. Perhaps could send an argument out, if that image load was an error. How about this:

    def __getitem__(self, index):
        try:
            path = self.paths[index]
            img = Image.open(path)
            img = self.transform(img)
            err = False
        except:
            print("Error with image", str(path))
            img = None
            err = True
        return img, err

And then in the discriminator trainer:

            if self.unconditional:
                real_images, err = next(dl_iter)
            else:
                result, err = next(dl_iter)
                assert isinstance(result, tuple), 'dataset should return a tuple of two items for text conditioned training, (images: Tensor, texts: List[str])'
                real_images, texts = result
            if not err: 
                # requires grad for real images, for gradient penalty
    
                real_images = real_images.to(self.device)
                real_images.requires_grad_()
                ...

And something similar for the generator, inside of the generate_kwargs.

lucidrains · 2023-07-22T16:51:01Z

@CerebralSeed maybe we should just run through the entire dataset once at init? could be parallelized too with joblib

CerebralSeed · 2023-07-25T15:47:05Z

@lucidrains - been under the weather the last few days and stuck in bed unable to focus. Feeling better now. Just started going through some of the changes you've made and, wow! I've got a lot of catching up to do.

Interesting idea. How long would that take, though, with LAION 5B?

By the way, if you're open to the idea, I can go through the code and make it generalized for the 1D and 3D cases with an added argument on initialization.

Once you get a more finalized version, I'm going to work on incorporating 3D Conv branches on the 2D case as a variant. I've found that fixes the generative model issue with hands and other objects with multiple degrees of freedom on the depth dimension in UNets.

I saw you removed the discriminator learning rate multiplier recently. Did that not work out so well?

Handle corrupt images

24f2bb2

This code will inform the user of a corrupt image in their set and continue training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle corrupt images #27

Handle corrupt images #27

CerebralSeed commented Jul 20, 2023

lucidrains commented Jul 20, 2023

CerebralSeed commented Jul 21, 2023

lucidrains commented Jul 22, 2023 •

edited

Loading

CerebralSeed commented Jul 25, 2023

Handle corrupt images #27

Are you sure you want to change the base?

Handle corrupt images #27

Conversation

CerebralSeed commented Jul 20, 2023

lucidrains commented Jul 20, 2023

CerebralSeed commented Jul 21, 2023

lucidrains commented Jul 22, 2023 • edited Loading

CerebralSeed commented Jul 25, 2023

lucidrains commented Jul 22, 2023 •

edited

Loading