Update compiler plugin #832

koperagen · 2024-08-21T18:50:36Z

No description provided.

Jolanrensen

Nice to have a lot more functions supported in the plugin :) I just have a few small comments, mostly regarding clarification of the new toDataFrame overload, otherwise, it's great. Could you mention which new functions are now supported in the PR notes? That'll help us find it back later

Jolanrensen · 2024-08-22T09:43:13Z

dataframe-excel/src/main/kotlin/org/jetbrains/kotlinx/dataframe/io/xlsx.kt

@@ -209,7 +213,7 @@ public fun DataFrame.Companion.readExcel(
 * @param range comma separated list of Excel column letters and column ranges (e.g. “A:E” or “A,C,E:F”)
 */
 @JvmInline
-public value class StringColumns(public val range: String)
+public value class StringColumns @Interpretable("StringColumns") constructor(public val range: String)


The linter fails here, it expects it to be like:

@JvmInline public value class StringColumns @Interpretable("StringColumns") constructor(public val range: String)

I recommend using the KtLint plugin (if you run the IDE in K1 mode) or run the ktlint task manually

Jolanrensen · 2024-08-22T09:59:29Z

docs/StardustDocs/topics/createDataFrame.md

@@ -151,6 +151,19 @@ df.add("length") { value.length }

 <!---END-->

+Creates a [`DataFrame`](DataFrame.md) with one column
+made from [`Iterable`](https://kotlinlang.org/api/latest/jvm/stdlib/kotlin.collections/-iterable/) of values:


It took me a while to figure out how this case is different from the other cases. I'd specify a) that this uses non-basic types, since we have overloads for those already, and b) that by specifying the column name, the properties of the given objects aren't unfolded like in the case below. This should probably also be in kdocs next to the function

Creates a DataFrame from Iterable<T> with one column: "columnName: DataColumn<T>"
Is it better?

Yes! But probably also specify that the properties are not unfolded, aka, you get one "value column" and not a "column group".

Jolanrensen · 2024-08-22T10:08:53Z

core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/impl/TypeUtils.kt

@@ -436,6 +436,8 @@ internal fun guessValueType(values: Sequence<Any?>, upperBound: KType? = null, l
                collectionClasses.add(it.javaClass.kotlin)
            }

+            is Function<*> -> classes.add(Function::class)


why not :) Maybe we should change the rendering for functions in dataframes though. After a quick test I found it looks like:

⌌---------------------------------------------------------------------------------------------------------------⌍ | | a:Function<*>| b:Int| |--|-----------------------------------------------------------------------------------------------------|------| | 0| org.jetbrains.kotlinx.dataframe.testSets.person.DataFrameTests$$Lambda$60/0x000000010013a040@64ee819| 2| ⌎---------------------------------------------------------------------------------------------------------------⌏

Sadly for such lambda objects toString is weird. I tried to look at the object in the debugger, but there's literally nothing that hints at signature or anything useful

hmm you'd think there was a way in kotlin to detect it's a () -> Int or something :/

actually...

It already renders correctly often!

might just be a fluke in the tests if the lambda is serialized as interface

what kernel version do you use?

I run the dev version of this PR's branch in the notebook. (so publish to maven local and use v=0.14.0-dev)

Ah, ok, so the fix is needed anyway

Jolanrensen · 2024-08-22T10:10:35Z

plugins/kotlin-dataframe/testData/box/fillNulls.kt

+fun box(): String {
+    val df = dataFrameOf("a", "b")(1, null, null, "")
+    val df1 = df.fillNulls { b }.with { "empty" }
+    val b: DataColumn<String> = df1.b


this is beautiful behavior :) Literally how DataFrame is meant to be! Anatoly would be proud I'm sure

Jolanrensen · 2024-08-22T10:13:00Z

plugins/kotlin-dataframe/src/org/jetbrains/kotlinx/dataframe/plugin/impl/api/flatten.kt

+    val Arguments.separator: String by arg(defaultValue = Present("."))
+
+    override fun Arguments.interpret(): PluginDataFrameSchema {
+        return receiver.asDataFrame().flatten(keepParentNameForColumns, separator).toPluginDataFrameSchema()


we can use the actual flatten function :D awesome!

Jolanrensen · 2024-08-22T10:36:55Z

oh btw, can we already update the kotlin version of the compiler plugin to 2.0.20-RC2? That's the latest now, the beta version is not available anymore

Jolanrensen · 2024-08-22T10:55:56Z

core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/api/update.kt

-public fun <T, C> Update<T, C>.with(expression: UpdateExpression<T, C, C?>): DataFrame<T> =
+@Refine
+@Interpretable("UpdateWith0")
+public fun <T, C, R : C?> Update<T, C>.with(expression: UpdateExpression<T, C, R>): DataFrame<T> =


In my testrun of the compiler plugin I now cannot use update {}.with {} anymore, just fillNulls {}.with {}. It gives

[NONE_APPLICABLE] None of the following candidates is applicable: val DataRow<Into_93I>.age: Int? val ColumnsContainer<Into_93I>.age: DataColumn<Int?>

when trying to access the updated column. Is this intended for now?

For now yes, there's such issue because plugin fails to interpret update { }.with { } (update not supported) and fallback to an empty schema. Will fix

`with` used to have C? in UpdateExpression return type position, and so it was always inferred as nullable. Even for fillNulls { }.with { 123 }

…data-agnostic operations

From my quick research, reflection doesn't know anything about these values. They don't have invoke methods, nor any supertypes. So for now i decided to simply fix NPE by falling back to generic Function type for such columns. It will then at least work together with compiler plugin

github-actions · 2024-08-22T12:31:04Z

Generated sources will be updated after merging this PR.
Please inspect the changes in here.

[Compiler plugin] Support dataFrameOf(header)(values)

6b892d1

koperagen requested a review from Jolanrensen August 21, 2024 18:50

koperagen added the Compiler plugin Anything related to the DataFrame Compiler Plugin label Aug 21, 2024

Jolanrensen approved these changes Aug 22, 2024

View reviewed changes

Jolanrensen reviewed Aug 22, 2024

View reviewed changes

koperagen force-pushed the compiler-plugin-member-functions branch from 327ec74 to 525d6b1 Compare August 22, 2024 11:15

koperagen added 8 commits August 22, 2024 15:25

Add toDataFrame(columnName) overload

7e6825d

[Compiler plugin] Support more parameters in read* functions

874abff

[Compiler plugin] fillNulls { }.with { }

2d31cd6

`with` used to have C? in UpdateExpression return type position, and so it was always inferred as nullable. Even for fillNulls { }.with { 123 }

[Compiler plugin] Support flatten via adapter for structure-related, …

01d99f6

…data-agnostic operations

[Compiler plugin] df.convert { }.to<Type>()

6318036

[Compiler plugin] Consider column name annotation when extracting schema

b989c63

Update compiler plugin to 2.0.20-RC2

d86d589

koperagen force-pushed the compiler-plugin-member-functions branch from 525d6b1 to d86d589 Compare August 22, 2024 12:25

koperagen self-assigned this Aug 22, 2024

koperagen merged commit a268c40 into master Aug 22, 2024
3 checks passed

koperagen deleted the compiler-plugin-member-functions branch August 26, 2024 13:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update compiler plugin #832

Update compiler plugin #832

koperagen commented Aug 21, 2024

Jolanrensen left a comment

Jolanrensen Aug 22, 2024

Jolanrensen Aug 22, 2024

koperagen Aug 22, 2024 •

edited

Loading

Jolanrensen Aug 22, 2024

Jolanrensen Aug 22, 2024

koperagen Aug 22, 2024

Jolanrensen Aug 22, 2024

Jolanrensen Aug 22, 2024

Jolanrensen Aug 22, 2024

koperagen Aug 22, 2024

Jolanrensen Aug 22, 2024 •

edited

Loading

koperagen Aug 22, 2024

Jolanrensen Aug 22, 2024

Jolanrensen Aug 22, 2024

Jolanrensen commented Aug 22, 2024

Jolanrensen Aug 22, 2024 •

edited

Loading

koperagen Aug 22, 2024

github-actions bot commented Aug 22, 2024

Update compiler plugin #832

Update compiler plugin #832

Conversation

koperagen commented Aug 21, 2024

Jolanrensen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

koperagen Aug 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Jolanrensen Aug 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Jolanrensen commented Aug 22, 2024

Jolanrensen Aug 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Aug 22, 2024

koperagen Aug 22, 2024 •

edited

Loading

Jolanrensen Aug 22, 2024 •

edited

Loading

Jolanrensen Aug 22, 2024 •

edited

Loading