Refactoring of the preprocessing DSL (#416) #425

ermolenkodev · 2022-08-16T10:32:48Z

No description provided.

juliabeliaeva · 2022-08-16T20:21:55Z

Wouldn't it be sufficient to just generify ImagePreprocessor interface? We'd also get the same-looking preprocessing for jvm and android.

dataset/src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessor/image/CenterCrop.kt

dataset/src/commonMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessing/Preprocessor.kt

dataset/src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessor/Preprocessing.kt

dataset/src/androidMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessing/Preprocessing.kt

juliabeliaeva · 2022-08-16T23:35:20Z

Wouldn't it be sufficient to just generify ImagePreprocessor interface? We'd also get the same-looking preprocessing for jvm and android.

Something like this: master...juliabeliaeva:wip/preprocessing-api

juliabeliaeva · 2022-08-18T17:18:02Z

Would have been better to have separate PR for android implementation.

juliabeliaeva · 2022-08-18T17:28:05Z

We definitely should include more explanation of the changes into the resulting commit message, since we are basically having a new DSL here. Also let's mention in the message that we moved from ImageShape to TensorShape and that getFinalShape now calculates correct shape for operations such as Transpose (previous implementation did not take into account any tensor operations at all). And changes to the save operation should be described.

docs/transfer_learning.md

...src/androidMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessing/ConvertToFloatArray.kt

juliabeliaeva · 2022-08-18T17:53:08Z

...src/androidMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessing/ConvertToFloatArray.kt

+        )
+        imgData.rewind()
+        val stride = input.width * input.height
+        val bmpData = IntArray(stride.toInt())


Isn't stride already Int?

I will fix it in separate PR with operations for an Android platform

...kotlin/examples/transferlearning/modelhub/vgg16/Example_5_VGG16_additional_training_noTop.kt

onnx/src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/api/dataset/preprocessor/Transpose.kt

tensorflow/src/main/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessor/Sharpen.kt

...src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessor/image/ImageOperationBase.kt

juliabeliaeva · 2022-08-18T21:09:06Z

Let's add tests for the android operations and for new getFinalShape implementations.

dataset/src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessor/image/Convert.kt

...rc/commonMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessing/PreprocessingPipeline.kt

New preprocessing API implies that preprocessing operations are defined as functional objects with Input and Output types defined as generics. Operations can be composed together to form a pipeline. The output of the operation is the input of the next operation. At the end of the pipeline, the output of the last operation is the pipeline's output. The approach when the pipeline is defined as a sequence of operations has the following advantages: - it's more flexible and supports the composition of operations with different input and output types. - user explicitly specifies conversions between types during the pipeline definition. E.g. there is an Operation that converts from [BufferedImage] to [FloatArray]. - allows adding support of augmentations in the future without changing the API. * Remove imagePreprocessing and tensorPreprocessing DSL blocks * User explicitly convert BufferedImage to FloatArray * Replace ImageShape with TensorShape * getOutputShape method now implemented for tensor operations and now calculates correct shape for operations such as Transpose * Update tests and examples with new API Co-authored-by: Julia Beliaeva <Julia.Beliaeva@jetbrains.com>

* Now method takes requested color mode into account when computing output shape * Add tests for output shape calculation

* Add support for unknown dimension in input TensorShape * Add BufferedImage.getTensorShape extension function

* Dimensions that are -1 in TensorShape are converted to null in ImageShape.

* Remove useless toInt casts * Fix ImageProessing example * Remove commented code

* Fix output shape calculation for Transpose operation * Add test for an output shape calculation

* Add FloatArrayOperation base class * Revert wrong removal of toInt casts in examples

* Remove ImageOperationBase * Remove ImageSaver and Operation.save extension function * Introduce Operation.onResult function which can be used instead of Operation.save for saving intermediate results of preprocessing operations * Add an explitit type declaration for pipeline function Co-authored-by: Julia Beliaeva <Julia.Beliaeva@jetbrains.com>

juliabeliaeva · 2022-08-23T12:12:06Z

dataset/src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/dataset/OnHeapDataset.kt

 import java.io.File
 import java.io.IOException
 import java.nio.FloatBuffer
 import java.nio.file.Files
 import java.nio.file.Path
 import kotlin.math.truncate
 import kotlin.random.Random
-import kotlin.streams.toList


Can't remove this import, it does not compile without it.

Sorry, for some reason, tests were green in IntelliJ. I added import back and ran tests using the Gradle command line. Now seems fine

* Fix broken import in OnHeapDataset

ermolenkodev requested a review from juliabeliaeva August 16, 2022 10:32

ermolenkodev linked an issue Aug 16, 2022 that may be closed by this pull request

Add preprocessing API for android Bitmaps #416

Closed

juliabeliaeva reviewed Aug 16, 2022

View reviewed changes

juliabeliaeva mentioned this pull request Aug 16, 2022

Introduce an interface for loading and preprocessing data #424

Merged

ermolenkodev force-pushed the 416 branch from bc7bcca to ef26c9e Compare August 18, 2022 14:58

ermolenkodev marked this pull request as ready for review August 18, 2022 16:29

ermolenkodev requested a review from juliabeliaeva August 18, 2022 16:30

juliabeliaeva reviewed Aug 18, 2022

View reviewed changes

dataset/src/jvmMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessor/image/Convert.kt Outdated Show resolved Hide resolved

juliabeliaeva reviewed Aug 18, 2022

View reviewed changes

...rc/commonMain/kotlin/org/jetbrains/kotlinx/dl/dataset/preprocessing/PreprocessingPipeline.kt Outdated Show resolved Hide resolved

ermolenkodev and others added 13 commits August 22, 2022 17:27

Add documentation for the Operation interface (Kotlin#416)

ac72b5d

Fix Convert.getOutputShape method (Kotlin#416)

f60ab47

* Now method takes requested color mode into account when computing output shape * Add tests for output shape calculation

Fix getOutputShape for BufferedImage operations (Kotlin#416)

06a2182

* Add support for unknown dimension in input TensorShape * Add BufferedImage.getTensorShape extension function

Fix convertation of TensorShape to ImageShape (Kotlin#416)

446dd57

* Dimensions that are -1 in TensorShape are converted to null in ImageShape.

Add documentation for TensorShape.toImageShape function (Kotlin#416)

91600ef

Optimize imports in modified modules (Kotlin#416)

aa09330

Fix resizeNoInputShape test (Kotlin#416)

b261c44

Minor fixes in examples (Kotlin#416)

529ddb0

* Remove useless toInt casts * Fix ImageProessing example * Remove commented code

Fix Transpose operation (Kotlin#416)

b8b5f51

* Fix output shape calculation for Transpose operation * Add test for an output shape calculation

Add FloatArrayOperation base class (Kotlin#416)

c1eb3e3

* Add FloatArrayOperation base class * Revert wrong removal of toInt casts in examples

Update documentation (Kotlin#416)

76d90cb

ermolenkodev force-pushed the 416 branch from d7ee7f2 to 76d90cb Compare August 23, 2022 10:27

ermolenkodev changed the title ~~WIP implementation of preprocessing api for Android (#416)~~ Refactoring of the preprocessing DSL (#416) Aug 23, 2022

ermolenkodev requested a review from juliabeliaeva August 23, 2022 10:30

juliabeliaeva approved these changes Aug 23, 2022

View reviewed changes

Fix tests (Kotlin#416)

70cf751

* Fix broken import in OnHeapDataset

ermolenkodev merged commit 5a5d385 into Kotlin:master Aug 24, 2022

This was referenced Sep 21, 2022

Add Support For Loading Image ByteArray During Preprocessing #321

Closed

Allow to invoke Preprocessing on a single BufferedImage #387

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring of the preprocessing DSL (#416) #425

Refactoring of the preprocessing DSL (#416) #425

ermolenkodev commented Aug 16, 2022

juliabeliaeva commented Aug 16, 2022 •

edited

Loading

juliabeliaeva commented Aug 16, 2022

juliabeliaeva commented Aug 18, 2022

juliabeliaeva commented Aug 18, 2022 •

edited

Loading

juliabeliaeva Aug 18, 2022

ermolenkodev Aug 23, 2022

juliabeliaeva commented Aug 18, 2022

juliabeliaeva Aug 23, 2022

ermolenkodev Aug 23, 2022

Refactoring of the preprocessing DSL (#416) #425

Refactoring of the preprocessing DSL (#416) #425

Conversation

ermolenkodev commented Aug 16, 2022

juliabeliaeva commented Aug 16, 2022 • edited Loading

juliabeliaeva commented Aug 16, 2022

juliabeliaeva commented Aug 18, 2022

juliabeliaeva commented Aug 18, 2022 • edited Loading

juliabeliaeva Aug 18, 2022

Choose a reason for hiding this comment

ermolenkodev Aug 23, 2022

Choose a reason for hiding this comment

juliabeliaeva commented Aug 18, 2022

juliabeliaeva Aug 23, 2022

Choose a reason for hiding this comment

ermolenkodev Aug 23, 2022

Choose a reason for hiding this comment

juliabeliaeva commented Aug 16, 2022 •

edited

Loading

juliabeliaeva commented Aug 18, 2022 •

edited

Loading