You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the IngressDescriptor configuration in our system does not allow users to specify which columns to select from tables during the data extraction process. By default, all columns from the startTable and related tables are included, which can lead to unnecessary data being processed and transferred.
Proposed Enhancement:
We propose adding a select option to the IngressDescriptor YAML configuration that allows users to explicitly define the columns they want to extract from each table. This feature should apply to both the startTable and any related tables defined under relations.
version: v1IngressDescriptor:
startTable: public.customerselect: ["customer_id", "first_name"] # Select only specific columns from the start tablerelations:
- name: film_original_language_id_fkeyparent:
name: public.filmlookup: falsechild:
name: public.languagelookup: truewhere: "creation_date >= '01/01/2023'"select: ["language_id", "title"] # Select specific columns from the related table
Benefits:
Optimized Data Transfer: Reducing the amount of data transferred by selecting only necessary columns.
Improved Performance: Potentially faster query execution and data processing by limiting the scope of the data extracted.
Greater Flexibility: Users gain more control over the data extraction process, tailoring it to their specific needs.
Impact and Dependencies:
This enhancement will require modifications to the YAML parsing logic and the underlying SQL query generation to accommodate the new select attribute for each table involved in the extraction process. It may also require updates to documentation and examples provided to users.
Acceptance Criteria:
Users can specify a select option for both startTable and each related table in relations.
The system correctly generates SQL queries that only include the specified columns.
Extensive testing is performed to ensure backward compatibility with configurations that do not include the select option.
The text was updated successfully, but these errors were encountered:
youen
linked a pull request
Sep 3, 2024
that will
close
this issue
Currently, the
IngressDescriptor
configuration in our system does not allow users to specify which columns to select from tables during the data extraction process. By default, all columns from thestartTable
and related tables are included, which can lead to unnecessary data being processed and transferred.Proposed Enhancement:
We propose adding a
select
option to theIngressDescriptor
YAML configuration that allows users to explicitly define the columns they want to extract from each table. This feature should apply to both thestartTable
and any related tables defined underrelations
.Current Configuration Example:
Proposed Configuration Example:
Benefits:
Impact and Dependencies:
This enhancement will require modifications to the YAML parsing logic and the underlying SQL query generation to accommodate the new
select
attribute for each table involved in the extraction process. It may also require updates to documentation and examples provided to users.Acceptance Criteria:
select
option for bothstartTable
and each related table inrelations
.select
option.The text was updated successfully, but these errors were encountered: