Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Feature / Reworked data pipeline framework (#188)
* Initial interface for DataPipeline * New codec base classes to work with data pipelines * Move arrow schema util into the main data package (it is used by all data components, not just the arrow codec) * Update arrow codec to work with the data pipeline framework * Update CSV codec to work with the data pipeline framework * Update JSON codec to work with the data pipeline framework * First implementation of data pipeline and supporting classes * Update the codec test suite to work with data pipelines * Update data round trip test in the orchestrator after data framework changes * Update storage interface to use the new data framework, and use it in the main data service * Remove DataBlock class which was part of the old data implementation * Use a flow helper to handle mkdir before writing file in flat data storage * Add some comments on the rationale for data pipelines * Remove debug logging * Try to flush arrow content stream in RT test * Remove bad char sequence from doc comment that prevents compile in Java 11. * Fix handling of arrow data encode/decode in end-to-end integration test
- Loading branch information