Command line tools for building and deploying packages #391

akbertram · 2018-03-15T20:38:29Z

No description provided.

* Use common packager module that is also used by maven * Implement dependency resolution for package dependencies

* Record assumptions we make about the runtime state * If subsequent executions satistify those assumptions, reuse the compiled loop body

Interestingly, it's not even neccessary to add our own compiler specialization for the case of x^-1. We can achieve the same effect by adding a branch to the Ops.pow() function, which will be inlined by the JVM, and, if the exponent is constant, work its way into the machine code.

We want more freedom to be able to use different strategies for storing SEXPs on the stack. For example, for atomic vectors of known type with constant attributes, we can store them as primitive arrays. To make this possible, we use a similar pattern to GExpr in gcc-bridge, so that each type of storage is responsible for converting to another kind.

* Use liveness analysis to determine which arrays/SEXPs can be safely mutated in place to safely avoid duplicating the vector * Improve subset/replace value inference in order to delegate to specialized operators

This step actually turns out to be essental, as we otherwise can end up with phi-statements that rely on uninitialized values and cause the byte-code validation to fail.

When compiling a loop body or other SEXP, changes to local variables may still be visible in the environment after the compiled code terminates. For example: s <- 0 for(i in 1:1000) { s <- s + i } print(s) If we compile only the `for` expression, then the value of s must be updated in the enclosing Environment instance so that it is visible when print(s) is evaluated. We had been updating all local variables at each exit path, but this actually doesn't work for more complex control flows. For example: for(i in 1:1000) { for(j in seq_along(x)) { s <- s + 1 } } If the value of x is not known at compile time, then it is possible that x has a length of zero and the inner loop never executes. In this case, the values of s and j should *not* be updated in the enclosing environment (nor _could_ we, as they have no value) What we can do is insert UpdateEnv() statements by doing a depth-first search of the Reverse Dominance Tree.

The command line builder will now look for an extra 'Dependencies' field in the DESCRIPTION file, which should contain fully qualified package names, such as 'org.renjin:readxl' or 'com.acme:mypackage'

When building, packages will include a pointer at META-INF/org.renjin.package/{packageName} that allows the ClasspathPackageLoader to lookup packages on the classpath by its unqualified name. This also delegates resolution of unqualified package names to the PackageLoader implementation. This allows the AetherPackageLoader to query packages.renjin.org so that we don't have to cycle through org.renjin.cran, org.renjin.bioconductor group ids. This does withdraw support for loading packages in the format '{groupId}.{packageName}', which is ambiguous: only {groupId}:{packageName} will be accepted.

lapply() actually can be described to have non-standard evaluation of arguments as it relies on match.fun(). This is a first step towards compiling lapply() calls.

Argument need only be provided to the Specializer interface, they shouldn't have to be matched again when calling Specialization.getCompiledExpr().

* Improved new sapply() implementation * Moved as.vector() to R code so the compiler can better infer types/attributes * Moved tests in Java related to as.vector and types to R code * Retain original SEXP of arguments so that we can implement substitute() at compile time

akbertram added 30 commits March 15, 2018 17:20

Finishing command line builder implementation

4d0bcd3

* Use common packager module that is also used by maven * Implement dependency resolution for package dependencies

Implement package resolution via packages.renjin.org

eb59bb0

Updated license headers

b8436c5

Fix compilation error

6858aa1

Fixed stray import

1e09721

Fixed regression in maven plugin

93b6059

Log cause of installation failure

fdab121

Do not assume cli tool is running from package directory

85b1b3e

Faster DoubleVector constructor for scalars

9e107aa

Fix bug in length() compiler specialization

fb6bd71

Cache compiled loop bodies when possible

bf9bac4

* Record assumptions we make about the runtime state * If subsequent executions satistify those assumptions, reuse the compiled loop body

Checking format tests generator script

71b802c

wip

76bc5ff

Type inference logic for matrix subscripts and sum()

73e8b47

Further refactoring of bytecode generation

acfa3fc

Optimize vector/matrix updating in loops

b547ba7

* Use liveness analysis to determine which arrays/SEXPs can be safely mutated in place to safely avoid duplicating the vector * Improve subset/replace value inference in order to delegate to specialized operators

Implemented basic dead-code elimination

c9967a8

This step actually turns out to be essental, as we otherwise can end up with phi-statements that rely on uninitialized values and cause the byte-code validation to fail.

Optimizations for updating matrices in loops

d82789c

Specializations for double binary/unary operators

6986db9

Added support for switch() statements to compiler

69c43a3

More subset specializations

21ef2ba

Updated license headers

155dba9

Refactored handling of attribute bounds in compiler specialization

1163cf0

Fix checkstyle issues

f21a771

More work on compiler: rep, subsets, logical operations

75a1045

Fixed dim() specialization

c8ac68c

Specializations for attributes(x) <- y and fixes for sum()

0ea35f9

akbertram added 29 commits September 4, 2018 17:18

Update solve() and det() from GNU R 3.4.0

e02035c

Add compiler support for missing and ... args

ab433e8

wip stop and warning

bcf2648

Merge branch 'master' into wip-tooling

aedb285

Updated package builder to changes to the packager api

dfd5745

Add support for 'Dependencies' field in DESCRIPTION

42fd4e7

The command line builder will now look for an extra 'Dependencies' field in the DESCRIPTION file, which should contain fully qualified package names, such as 'org.renjin:readxl' or 'com.acme:mypackage'

Stub out mclapply

98bf267

Fix handling of escapes in sprintf()

353928d

Fix handling of whitespace when parsing dependencies in DESCRIPTION

65cec8c

Merge branch 'wip-more-compilation' into wip-tooling

a5cbd8d

Compiler specializations to support compiling functions on data frames

1d18447

Compiler specializations on the basis of list 'shape'

493edcc

Made lapply() a special function

13e31a3

lapply() actually can be described to have non-standard evaluation of arguments as it relies on match.fun(). This is a first step towards compiling lapply() calls.

Add support for exporting active bindings in namespaces

7e7e597

wip lapply work

8b37259

Implement faster hash-based merge routine

69d9f4a

Implement faster hash-based merge routine

014ccef

Initial implementation of lapply() in compiled code

815008a

Implemented lapply() call compilation

2f8edb0

Make sapply() special function

f8a3db5

Basic (incomplete) compilation of sapply()

9e7e2b9

Refactor Specializer/Specialization interfaces

ee59303

Argument need only be provided to the Specializer interface, they shouldn't have to be matched again when calling Specialization.getCompiledExpr().

Improved support for sapply() compilation

36204d3

Promoted warning() from .Internal to builtin

c2d27ea

Compiling: inherits(), is.numeric(), list(), range(), mean(), etc

8c154cc

Simplify data.frame() to workaround compiler for the momennt

b0359c4

Removed MicrobenchmarkHarness

52727b6

akbertram force-pushed the master branch from dd8608c to 4e7b43b Compare October 29, 2018 15:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Command line tools for building and deploying packages #391

Command line tools for building and deploying packages #391

akbertram commented Mar 15, 2018

Command line tools for building and deploying packages #391

Are you sure you want to change the base?

Command line tools for building and deploying packages #391

Conversation

akbertram commented Mar 15, 2018