Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change to use resolver v2, test more feature flag combinations in CI, fix errors (#1630) #1822

Merged
merged 11 commits into from
Jun 9, 2022

Conversation

tustvold
Copy link
Contributor

@tustvold tustvold commented Jun 8, 2022

Which issue does this PR close?

Closes #1630

Rationale for this change

We continue to have issues with various feature flag combinations resulting in compile errors. A particularly pernicious variant of this occurs you need to enable a feature of an optional dependency for tests, for example, arrow prettyprint within parquet. To do this you add the dependency as a dev-dependency, with the feature enabled.

However, the way feature resolution traditionally works, is that even when building the library crate on its own, it would take into account features enabled by the dev-dependencies. This would mask issues with feature flags. The previous solution to this has been to have test/dependency crates that simply depend on the library, without the dev-dependencies.

What changes are included in this PR?

rust-lang/cargo#7916 added an option to skip using dev-dependencies when resolving features, and this was stabilised in feature resolver v2 which was released in Rust 1.51. In particular with the new feature resolver

Dev-dependencies do not activate features unless building a target that needs them (like tests or examples).

This PR therefore:

  • Switches to using the new feature resolver
  • Uses this to test different feature flag combinations in CI
  • Removes the old test/dependency workaround
  • Fixes the bugs this turned up

Thanks to @carols10cents for the pointer ❤️

Are there any user-facing changes?

No

@github-actions github-actions bot added arrow Changes to the arrow crate parquet Changes to the parquet crate parquet-derive labels Jun 8, 2022
@tustvold tustvold force-pushed the feature-flag-consistency branch from 376f38e to 75f9b49 Compare June 8, 2022 21:35
cargo test --no-default-features

# re-run tests on arrow crate with all supported features
cargo test -p arrow --features=force_validate,prettyprint
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I merged these two test runs together as I couldn't see an obvious reason to separate them, and this will make CI slightly faster

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the original rationale was to try and test without the default features (mostly I think to try and catch build errors, which this PR does in another way)

cargo run --example builders
cargo run --example dynamic_types
cargo run --example read_csv
cargo run --example read_csv_infer_schema
cargo check --no-default-features

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I moved the cargo check logic into here as opposed to a separate job so it can benefit from the compilation already performed above.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe it would be worth making a separate named run step (as is done https://github.com/apache/arrow-rs/pull/1822/files#diff-73e17259d77e5fbef83b2bdbbe4dc40a912f807472287f7f45b77e0cbf78792dR188)

 - name: Check compilation with simd features

So when/if this check fails it will be easier to figure out what is wrong

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Working on this, running into nonsense with environment variables and caching

Cargo.toml Show resolved Hide resolved
@@ -135,6 +135,7 @@ macro_rules! eof_err {
($fmt:expr, $($args:expr),*) => (ParquetError::EOF(format!($fmt, $($args),*)));
}

#[cfg(any(feature = "arrow", test))]
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This removes a warning

@@ -27,7 +27,7 @@ use crate::data_type::{ByteArray, Decimal, Int96};
use crate::errors::{ParquetError, Result};
use crate::schema::types::ColumnDescPtr;

#[cfg(feature = "cli")]
#[cfg(any(feature = "cli", test))]
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Drive by refactor to make consistent with rest of crate which tests everything

cargo check -p parquet --no-default-features --features arrow --all-targets

# Test compilation of parquet_derive macro with different feature combinations
cargo check -p parquet_derive
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To be completely honest I'm not sure why we were testing this, parquet_derive doesn't have any dev-dependencies or feature flags, but I guess it can't hurt to be thorough

cargo check -p arrow
cargo check -p arrow --no-default-features

# Test compilation of arrow targets with different feature combinations
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would theoretically be more correct to test each target individually, so that feature resolution is not impacted by other targets, in practice this is probably good enough

@@ -39,38 +39,44 @@ brotli = { version = "3.3", optional = true }
flate2 = { version = "1.0", optional = true }
lz4 = { version = "1.23", optional = true }
zstd = { version = "0.11.1", optional = true, default-features = false }
chrono = { version = "0.4", default-features = false }
chrono = { version = "0.4", default-features = false, features = ["alloc"] }
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the actual fix for #1630

Unfortunately this is required by the record API which is tightly coupled with the file APIs, and so there isn't an easy way to hide this behind a feature flag. Perhaps something for another day, I'm a bit feature flagged out 😆

@codecov-commenter
Copy link

codecov-commenter commented Jun 9, 2022

Codecov Report

Merging #1822 (86dfef5) into master (ba38ebe) will increase coverage by 0.10%.
The diff coverage is 95.89%.

❗ Current head 86dfef5 differs from pull request most recent head 89c0429. Consider uploading reports for the commit 89c0429 to get more accurate results

@@            Coverage Diff             @@
##           master    #1822      +/-   ##
==========================================
+ Coverage   83.44%   83.54%   +0.10%     
==========================================
  Files         199      200       +1     
  Lines       56651    56798     +147     
==========================================
+ Hits        47272    47452     +180     
+ Misses       9379     9346      -33     
Impacted Files Coverage Δ
arrow/src/buffer/mod.rs 73.33% <ø> (ø)
parquet/src/errors.rs 29.62% <ø> (ø)
arrow/src/buffer/scalar.rs 93.47% <93.47%> (ø)
arrow/src/array/equal/list.rs 96.47% <100.00%> (+0.08%) ⬆️
arrow/src/array/equal/mod.rs 96.20% <100.00%> (+0.10%) ⬆️
parquet/src/record/api.rs 96.82% <100.00%> (+4.79%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ba38ebe...89c0429. Read the comment docs.

@alamb alamb changed the title Test more feature flag combinations in CI (#1630) Test more feature flag combinations in CI, fix errors (#1630) Jun 9, 2022
@alamb alamb changed the title Test more feature flag combinations in CI, fix errors (#1630) Change to use resolver v2, test more feature flag combinations in CI, fix errors (#1630) Jun 9, 2022
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The idea looks great - thanks @tustvold

I want to review the actual test output before I approve this PR and it still appears to be running

cargo test --no-default-features

# re-run tests on arrow crate with all supported features
cargo test -p arrow --features=force_validate,prettyprint
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the original rationale was to try and test without the default features (mostly I think to try and catch build errors, which this PR does in another way)

Cargo.toml Show resolved Hide resolved
cargo run --example builders
cargo run --example dynamic_types
cargo run --example read_csv
cargo run --example read_csv_infer_schema
cargo check --no-default-features

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe it would be worth making a separate named run step (as is done https://github.com/apache/arrow-rs/pull/1822/files#diff-73e17259d77e5fbef83b2bdbbe4dc40a912f807472287f7f45b77e0cbf78792dR188)

 - name: Check compilation with simd features

So when/if this check fails it will be easier to figure out what is wrong

@tustvold tustvold marked this pull request as draft June 9, 2022 14:51
@tustvold
Copy link
Contributor Author

tustvold commented Jun 9, 2022

Setting as draft whilst I try to get CI happy

@tustvold tustvold force-pushed the feature-flag-consistency branch 8 times, most recently from 9b0324a to b701d2b Compare June 9, 2022 16:11
@tustvold tustvold force-pushed the feature-flag-consistency branch from b701d2b to ed413af Compare June 9, 2022 16:33
@tustvold
Copy link
Contributor Author

tustvold commented Jun 9, 2022

I had to rework the caching to allow splitting up the stages into smaller steps, without having to duplicate "export CARGO_HOME". into every step. Setting these variable at the top-level caused rustup to fail to install correctly, and so rather than using CARGO_HOME to move cargo's location, instead we now just cache the relevant cargo paths. I moved this logic into the "Prepare Rust Build Environment" action to reduce duplication.

@tustvold tustvold force-pushed the feature-flag-consistency branch 2 times, most recently from 68d1441 to 7142dff Compare June 9, 2022 17:02
Don't install unused components
@tustvold tustvold force-pushed the feature-flag-consistency branch from 7142dff to 89c0429 Compare June 9, 2022 17:04
@tustvold
Copy link
Contributor Author

tustvold commented Jun 9, 2022

Here is an example of it restoring from a cache key (without lockfile suffix), and then publishing the lockfile specific cache key 🎉

This then gets picked up by https://github.com/apache/arrow-rs/runs/6817593817?check_suite_focus=true

@tustvold tustvold marked this pull request as ready for review June 9, 2022 17:39
@tustvold
Copy link
Contributor Author

tustvold commented Jun 9, 2022

MIRI appears to be unhappy... 👀

Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Beautiful

@tustvold tustvold mentioned this pull request Jun 9, 2022
@tustvold
Copy link
Contributor Author

tustvold commented Jun 9, 2022

Appears MIRI version is too old to supposed namespaced deps - rust-lang/cargo#5565

Attempting to update it in - #1828

@alamb
Copy link
Contributor

alamb commented Jun 10, 2022

Thank you @tustvold

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Changes to the arrow crate parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

parquet does not compile with features=["zstd"]
3 participants