[Bug] Suspiciously slow calls to `Sequential.predictSoftly` #25

LostMekka · 2020-12-19T17:26:21Z

Calls to Sequential.predictSoftly start out fairly slow and each call takes a bit more time than the last one.

My example model is quite small, but the first call already takes about 25ms on my machine. (for comparison: fitting 1.000 random data points with 10 epochs takes 415ms) After calling predictSoftly 10.000 times, each successive call takes a third of a second.

Code to reproduce:

@OptIn(ExperimentalTime::class)
fun main() {
    Sequential.of(
        Input(36),
        Dense(36),
        Dense(36),
        Dense(36),
        Dense(36),
        Dense(36),
        Dense(16),
        Dense(8),
        Dense(3),
    ).use { model ->
        model.compile(
            optimizer = Adam(),
            loss = Losses.SOFT_MAX_CROSS_ENTROPY_WITH_LOGITS,
            metric = Metrics.MSE,
        )
        model.init()
        val features = FloatArray(36) { Random.nextFloat() }
        var predictionCalls = 0
        var predictionTimeOfBatch = Duration.ZERO
        val predictionTimes = mutableListOf<Double>()
        repeat(100_000) {
            val timing = measureTimedValue { model.predict(features) }
            predictionCalls++
            predictionTimeOfBatch += timing.duration
            predictionTimes += timing.duration.inMilliseconds
            if (predictionCalls % 100 == 0) {
                val csv = predictionTimes
                    .withIndex()
                    .joinToString("\n") { (i,t)->"${i + 1},$t" }
                File("timing.csv").writeText(csv)
                println("$predictionCalls calls done. (${predictionTimeOfBatch / 100} per call)")
                predictionTimeOfBatch = Duration.ZERO
            }
        }
    }
}

The text was updated successfully, but these errors were encountered:

quickstep24 · 2020-12-20T10:58:09Z

a.)
Sequential.internalPredict does not close the tensors it receives from Session.run.
b.)
Dataset.serializeToBuffer can be optimized for this special case when there is exactly one FloatArray. It is faster to wrap this FloatArray into a FloatBuffer than to copy it into a newly allocated buffer.
c.)
There should be a means to call predictSoftly on a batch of inputs. At the moment this is only possible for predict.
d.)
Looking at the underlying Java implementation, why are FloatArray values copied to a FloatBuffer at all? It seems to me that FloatArray and Array<FloatArray> are valid inputs for creating a Tensor.

zaleslaw · 2020-12-21T08:50:43Z

@LostMekka thanks a lot for your performance experiment, It looks great.
Also, looks like @quickstep24 is right and his suggestions will be fixed in the next 0.1.1 bug fix release.

Missed closed tensor - it's a bug, of course.
I'd like an idea for optimization for plain FloatArray.
About FloatBuffer usage - need to revisit its usage (I have some thoughts about small performance improvements related to FloatBuffer usage), need to measure and choose the best option here

LostMekka · 2020-12-21T09:25:28Z

c.)
There should be a means to call predictSoftly on a batch of inputs. At the moment this is only possible for predict.

That would be awesome as well. My current pet project to try out this library is a board game AI that very crudely learns through self play, where the neural net is the search heuristic. The AI plays a semi-random game sequence and for each pair of successive moves I create a training data point. (needs 1 predictSoftly per data point) Building these self-play datasets would probably greatly benefit from batch soft-predicting 😄

Should I add a second issue for this?

zaleslaw · 2020-12-21T09:41:04Z

Great idea for pet-project, so, hope it will be helpful, please create a separate ticket for this case as an feature request.
So, I will release it in 0.1.1 but it takes time and will be released at the mid of January (not earlier, sorry)

zaleslaw · 2021-01-13T16:08:42Z

Dear @quickstep24

Looking at the underlying Java implementation, why are FloatArray values copied to a FloatBuffer at all? It seems to me that FloatArray and Array<FloatArray> are valid inputs for creating a Tensor.

The reason to use are the following:

No Tensor.create method which give us ability to pass plain FloatArray with shape.
The shape of created tensor depends on the input data and could have different number of dimensions
Reshape of plain FloatArray to multidimensional array is required before passing to Tensor
Why not prepare FloatBuffers if underlaying TF Java API it requires..
After upgrade on TF 2.x Java API we will have deal with totally different system of TensorTypes (I hope the migration will coming soon)

quickstep24 · 2021-01-14T20:46:54Z

It is probably me misinterpreting the documentation of org.tensorflow.Tensor.create(Object obj, Class<T> type) , where the example is given as:

 // Valid: A 3x2 matrix of floats.
 float[][] matrix = new float[3][2];
 Tensor<Float> m = Tensor.create(matrix, Float.class);

I did assume that this constructor would also work on 1-dim float arrays, but I have never tried.

zaleslaw · 2021-01-15T10:00:13Z

@quickstep24 I will explain my message. Yes, you can pass the 1-dim floatArray. But I need to pass a shape (to transform it to 2d/3d/4d) - but in this case this method requires 4d float array. It is not useful for me in this case, because the shape could be unknown and I want more generic code. But I agree, that in future need to minimize overhead with copying, will keep in mind such places.

zaleslaw changed the title ~~Suspiciously slow calls to Sequential.predictSoftly~~ [Bug] Suspiciously slow calls to Sequential.predictSoftly Dec 21, 2020

zaleslaw added the bug Something isn't working label Dec 21, 2020

zaleslaw added this to the 0.1.1 milestone Dec 21, 2020

LostMekka mentioned this issue Dec 21, 2020

[Feature Request] Support batch processing for predictSoftly #28

Closed

zaleslaw closed this as completed in 87a0252 Jan 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Suspiciously slow calls to `Sequential.predictSoftly` #25

[Bug] Suspiciously slow calls to `Sequential.predictSoftly` #25

LostMekka commented Dec 19, 2020

quickstep24 commented Dec 20, 2020 •

edited

Loading

zaleslaw commented Dec 21, 2020

LostMekka commented Dec 21, 2020

zaleslaw commented Dec 21, 2020

zaleslaw commented Jan 13, 2021

quickstep24 commented Jan 14, 2021

zaleslaw commented Jan 15, 2021

[Bug] Suspiciously slow calls to Sequential.predictSoftly #25

[Bug] Suspiciously slow calls to Sequential.predictSoftly #25

Comments

LostMekka commented Dec 19, 2020

quickstep24 commented Dec 20, 2020 • edited Loading

zaleslaw commented Dec 21, 2020

LostMekka commented Dec 21, 2020

zaleslaw commented Dec 21, 2020

zaleslaw commented Jan 13, 2021

quickstep24 commented Jan 14, 2021

zaleslaw commented Jan 15, 2021

[Bug] Suspiciously slow calls to `Sequential.predictSoftly` #25

[Bug] Suspiciously slow calls to `Sequential.predictSoftly` #25

quickstep24 commented Dec 20, 2020 •

edited

Loading