-
Notifications
You must be signed in to change notification settings - Fork 7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
port transforms from the old to the new API #5520
Comments
Following #5521 (comment), I've removed the "help wanted" label for now. Unfortunately, the design isn't as finished as we originally thought. If you have already started, we'll try to merge the PR, but please do not start a new task until we add the "help wanted" label again. |
@pmeier I would like to work on the |
Thanks a lot for the interest @yassineAlouini! Unfortunately, the API is not ready again for significant community contributions. Still you can comment on the issue so we don't forget to ping you when we are ready to go. |
Everything is done here, closing this tracker. Please reopen if something is still missing. |
We are currently in the process of revamping the
transforms
module, but there is still a lot of porting left. The design is now stable enough that the porting process should be manageable for someone not intimately familiar with the new design. Thus, we are actively looking for contributions helping us finishing this.Here is the list of transformations that still need to be ported to achieve feature parity (minus some special transformations) between the new and old API:
transforms.Pad
toprototype.transforms
#5521transforms.RandomCrop
toprototype.transforms
#5522transforms.RandomHorizontalFlip
toprototype.transforms
#5523transforms.RandomVerticalFlip
toprototype.transforms
#5524transforms.RandomPerspective
toprototype.transforms
#5525transforms.FiveCrop
toprototype.transforms
#5526transforms.TenCrop
toprototype.transforms
#5527transforms.ColorJitter
toprototype.transforms
#5528transforms.RandomRotation
toprototype.transforms
#5529transforms.RandomAffine
toprototype.transforms
#5530transforms.GaussianBlur
toprototype.transforms
#5531transforms.RandomInvert
toprototype.transforms
#5532transforms.RandomPosterize
toprototype.transforms
#5533transforms.RandomSolarize
toprototype.transforms
#5534transforms.RandomAdjustSharpness
toprototype.transforms
#5535transforms.RandomAutocontrast
toprototype.transforms
#5536transforms.RandomEqualize
toprototype.transforms
#5537transforms.LinearTransformation
toprototype.transforms
#5538RandomHorizontalFlip
Transforms without dispatcher #5421Resize
Transforms without dispatcher #5421CenterCrop
Transforms without dispatcher #5421RandomResizedCrop
Transforms without dispatcher #5421Normalize
Transforms without dispatcher #5421RandomErasing
Transforms without dispatcher #5421RandAugment
Transforms without dispatcher #5421TrivialAugmentWide
Transforms without dispatcher #5421AutoAugment
Transforms without dispatcher #5421There is an issue for each transformation. Please comment on that if you want to take up a task so we can assign it to you.
Here is a recipe on how the porting process looks like:
Port the kernels from
transforms.functional_tensor
andtransforms.functional_pil
toprototype.transforms.functional
. In most cases this means just binding them to a new namevision/torchvision/prototype/transforms/functional/_geometry.py
Lines 14 to 15 in 97385df
The naming scheme is
{kernel_name}_{feature_type}_{feature_subtype}
. To use the example above, the kernel name ishorizontal_flip
, the feature type isimage
and the subtypes aretensor
andpil
.In general, the new kernels should have the same signature as the old dispatchers from
torchvision.functional
. In most cases this is given by default, but sometimes there is some common pre-processing performed in the dispatchers before the kernels are called. In these cases, the canonical way is to move the common functionality into a private helper function and define the new kernels to call the helper first and afterwards the old kernel.Create a new transform in
prototype.transforms
that inherits fromprototype.transforms.Transform
. The constructor can be copy-pasted from the corresponding transform intransforms
.Implement the
_transform
method. It receives two arguments:input
andparams
(see below).input
can be any non-container objects, i.e. no lists, tuple, dictionaries, and so on, so the implementation needs to check the input type and dispatch accordingly. The general behavior should be to handle what we can and let the rest pass through (there are exceptions to this, see below). The implementation should look something like thisSome transformations could in theory support other feature types such as bounding boxes, but we currently don't have kernels for them or will never have. In these cases it is crucial to not perform the transformation on the image and let the rest pass through, because it invalidates the correspondence. For example, applying
RandomRotate
only on an image but ignoring a bounding box renders the bounding box invalid. To avoid this, overwrite theforward
method and fail if unsupported types are detected:vision/torchvision/prototype/transforms/_geometry.py
Lines 74 to 78 in 97385df
Some random transformations take a
p
parameter that indicates the probability to apply the transformation. Overwrite theforward
method and perform this check there before calling theforward
of the super classvision/torchvision/prototype/transforms/_augment.py
Lines 102 to 105 in 97385df
Some random transformations need to sample parameters at runtime. In the old implementations this is usually done in a
The new architecture is similar but not the same. You can overwrite the
_get_params(self, sample) -> Dict[str, Any]
method:vision/torchvision/prototype/transforms/_geometry.py
Lines 114 to 116 in 97385df
The returned dictionary is available through the
params
parameter in the_transform
method. Thequery_image
function used above can be used to find an image in thesample
without worrying about the actual structure. This is useful if the image dimensions are needed to generate the parameters.The transformations are not simple stuff, so there might be cases were the recipe from above is not sufficient. If you have any kind of questions or hit blockers, feel free to send a PR with what you have and ping me there so I can have a look and help you out.
cc @vfdev-5 @datumbox @bjuncek
The text was updated successfully, but these errors were encountered: