-
Notifications
You must be signed in to change notification settings - Fork 14.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Status of testing Providers that were prepared on December 23, 2023 #36384
Comments
We've tested the following provider RCs with our example DAGs without encountering issues. apache-airflow-providers-microsoft-azure==8.5.0rc1 |
I have tested the following Kubernetes provider fixes. They are working as
expected. I am eagerly waiting for the release that has a critical
performance bug.
- Kubernetes executor running slots leak fix (#36240)
<#36240>: @dirrao
<https://github.com/dirrao>
- list pods performance optimization (#36092)
<#36092>: @dirrao
<https://github.com/dirrao>
…On Sat, Dec 23, 2023 at 7:53 AM Jarek Potiuk ***@***.***> wrote:
Body
I have a kind request for all the contributors to the latest provider
packages release.
Could you please help us to test the RC versions of the providers?
The guidelines on how to test providers can be found in
Verify providers by contributors
<https://github.com/apache/airflow/blob/main/dev/README_RELEASE_PROVIDER_PACKAGES.md#verify-the-release-candidate-by-contributors>
Let us know in the comment, whether the issue is addressed.
Those are providers that require testing as there were some substantial
changes introduced:
Provider airbyte: 3.5.1rc1
<https://pypi.org/project/apache-airflow-providers-airbyte/3.5.1rc1>
- Cancel airbyte job when timeout exceeded to prevent inconsistency
amo… (#36241) <#36241>: @shohamy7
<https://github.com/shohamy7>
Provider amazon: 8.14.0rc1
<https://pypi.org/project/apache-airflow-providers-amazon/8.14.0rc1>
- Add jsonpath_ng.ext.parse support for SqsSensor (#36170)
<#36170>: @cliebBS
<https://github.com/cliebBS>
- Increase ConflictException retries to 4 total (#36337)
<#36337>: @o-nikolas
<https://github.com/o-nikolas>
- Add more parameter support for the copy_object function in
airflow.providers.amazon.aws.hooks.s3 (#36304)
<#36304>: @samezapple
<https://github.com/samezapple>
- Allow storage options to be passed (#35820)
<#35820>: @bolkedebruin
<https://github.com/bolkedebruin>
- Remove is_authorized_cluster_activity from auth manager (#36175)
<#36175>: @vincbeck
<https://github.com/vincbeck>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
- Remove remaining Airflow 2.5 backcompat code from Amazon Provider
(#36324) <#36324>: @Taragolis
<https://github.com/Taragolis>
Provider apache.beam: 5.5.0rc1
<https://pypi.org/project/apache-airflow-providers-apache-beam/5.5.0rc1>
- Add ability to run streaming Job for BeamRunPythonPipelineOperator
in non deferrable mode (#36108)
<#36108>: @MaksYermak
<https://github.com/MaksYermak>
- Implement deferrable mode for BeamRunJavaPipelineOperator (#36122)
<#36122>: @moiseenkov
<https://github.com/moiseenkov>
Provider apache.cassandra: 3.4.1rc1
<https://pypi.org/project/apache-airflow-providers-apache-cassandra/3.4.1rc1>
- Sanitize keyspace and table in the cassandra query to avoid sql
injection (#36111) <#36111>:
@hussein-awala <https://github.com/hussein-awala>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
Provider apache.hdfs: 4.3.1rc1
<https://pypi.org/project/apache-airflow-providers-apache-hdfs/4.3.1rc1>
- fix connection type webhdfs (#36145)
<#36145>: @ankurbajaj9
<https://github.com/ankurbajaj9>
Provider apache.hive: 6.4.0rc1
<https://pypi.org/project/apache-airflow-providers-apache-hive/6.4.0rc1>
- add param proxy user for hive (#36221)
<#36221>: @romsharon98
<https://github.com/romsharon98>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
Provider apache.kafka: 1.3.1rc1
<https://pypi.org/project/apache-airflow-providers-apache-kafka/1.3.1rc1>
- Provide the consumed message to consumer.commit in
AwaitMessageTrigger (#36272)
<#36272>: @hussein-awala
<https://github.com/hussein-awala>
Provider apache.spark: 4.6.0rc1
<https://pypi.org/project/apache-airflow-providers-apache-spark/4.6.0rc1>
- SparkSubmit: Adding propertyfiles option (#36164)
<#36164>: @pateash
<https://github.com/pateash>
- SparkSubmit Connection Extras can be overridden (#36151)
<#36151>: @pateash
<https://github.com/pateash>
Provider celery: 3.5.1rc1
<https://pypi.org/project/apache-airflow-providers-celery/3.5.1rc1>
- Fix sentinel_kwargs load from ENV (#36318)
<#36318>: @vaaalik
<https://github.com/vaaalik>
Provider cncf.kubernetes: 7.12.0rc1
<https://pypi.org/project/apache-airflow-providers-cncf-kubernetes/7.12.0rc1>
- Add _request_timeout to KPO log fetch calls (#36297)
<#36297>: @dstandish
<https://github.com/dstandish>
- Add pod_template_dict field to KubernetesPodOperator (#33174)
<#33174>: @sonmezonur
<https://github.com/sonmezonur>
- KubernetesPodTrigger: add exception stack trace in TriggerEvent
(#35716) <#35716>: @functicons
<https://github.com/functicons>
- Limit pod_name length to HOST_NAME_MAX (#36332)
<#36332>: @csp33
<https://github.com/csp33>
- Move KubernetesPodTrigger hook to a cached property (#36290)
<#36290>: @hussein-awala
<https://github.com/hussein-awala>
- Kubernetes executor running slots leak fix (#36240)
<#36240>: @dirrao
<https://github.com/dirrao>
- list pods performance optimization (#36092)
<#36092>: @dirrao
<https://github.com/dirrao>
Provider common.sql: 1.10.0rc1
<https://pypi.org/project/apache-airflow-providers-common-sql/1.10.0rc1>
- Make "placeholder" of ODBC configurable in UI (#36000)
<#36000>: @jarnedemunter1
<https://github.com/jarnedemunter1>
- SQLCheckOperator fails if it returns dict with any Falsy values
(#36273) <#36273>:
@spencertollefson <https://github.com/spencertollefson>
Provider databricks: 6.0.0rc1
<https://pypi.org/project/apache-airflow-providers-databricks/6.0.0rc1>
- Make databricks sql hook return a namedtuple (#36205)
<#36205>: @Joffreybvn
<https://github.com/Joffreybvn>
- Fix: Implement support for fetchone() in the ODBCHook and the
Databricks SQL Hook (#36161)
<#36161>: @Joffreybvn
<https://github.com/Joffreybvn>
- Bump minimum Airflow version in providers to Airflow 2.6.0 (#36017)
<#36017>: @eladkal
<https://github.com/eladkal>
- fix typos in DatabricksSubmitRunOperator (#36248)
<#36248>: @adam133
<https://github.com/adam133>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
- Prepare docs 1st wave of Providers December 2023 (#36112)
<#36112>: @eladkal
<https://github.com/eladkal>
- Prepare docs 1st wave of Providers December 2023 RC2 (#36190)
<#36190>: @eladkal
<https://github.com/eladkal>
- Re-apply updated version numbers to 2nd wave of providers in
December (#36380) <#36380>:
@potiuk <https://github.com/potiuk>
- Prepare 2nd wave of providers in December (#36373)
<#36373>: @potiuk
<https://github.com/potiuk>
Provider docker: 3.9.1rc1
<https://pypi.org/project/apache-airflow-providers-docker/3.9.1rc1>
- Allow DockerOperator.skip_on_exit_code to be zero (#36360)
<#36360>: @dolfinus
<https://github.com/dolfinus>
- Remove remaining Airflow 2.5 backcompat code from Docker Provider
(#36325) <#36325>: @Taragolis
<https://github.com/Taragolis>
Provider google: 10.13.0rc3
<https://pypi.org/project/apache-airflow-providers-google/10.13.0rc3>
- GCP Secrets Backend Impersonation (#36072)
<#36072>: @nathadfield
<https://github.com/nathadfield>
- Add OpenLineage support to GCS operators (#35838)
<#35838>: @kacpermuda
<https://github.com/kacpermuda>
- Add support for service account impersonation with
computeEngineSSHHook (google provider) and IAP tunnel (#35136)
<#35136>: @ginolegigot
<https://github.com/ginolegigot>
- Added Datascan Profiling (#35696)
<#35696>: @shourya116
<https://github.com/shourya116>
- Add overrides to template fields of Google Cloud Run Jobs Execute
Operator (#36133) <#36133>:
@guillaumeblaquiere <https://github.com/guillaumeblaquiere>
- Implement deferrable mode for BeamRunJavaPipelineOperator (#36122)
<#36122>: @moiseenkov
<https://github.com/moiseenkov>
- Add ability to run streaming Job for BeamRunPythonPipelineOperator
in non deferrable mode (#36108)
<#36108>: @MaksYermak
<https://github.com/MaksYermak>
- Add use_glob to GCSObjectExistenceSensor (#34137)
<#34137>: @A-Costa
<https://github.com/A-Costa>
- Fix DataprocSubmitJobOperator to retrieve failed job error message
(#36053) <#36053>:
@VladaZakharova <https://github.com/VladaZakharova>
- Fix CloudRunExecuteJobOperator in deferrable mode (#36012)
<#36012>: @VladaZakharova
<https://github.com/VladaZakharova>
- Fix gcs listing - ensure blobs are loaded (#34919)
<#34919>: @atrbgithub
<https://github.com/atrbgithub>
- allow multiple elements in impersonation chain (#35694)
<#35694>: @melugoyal
<https://github.com/melugoyal>
- Change retry type for Google Clients to async ones where needed
(#36141) <#36141>: @potiuk
<https://github.com/potiuk>
- Minor fix to DataprocCreateClusterOperator operator docs. (#36322)
<#36322>: @bjankie1
<https://github.com/bjankie1>
- fix(bigquery.py): pass correct project_id to triggerer (#35200)
<#35200>: @mokshasoul
<https://github.com/mokshasoul>
- iterate through blobs before checking prefixes (#36202)
<#36202>: @Lee-W
<https://github.com/Lee-W>
- Fix incompatibility with google-cloud-monitoring 2.18.0 (#36200)
<#36200>: @potiuk
<https://github.com/potiuk>
- Update retry param typing in PubSubAsyncHook (#36198)
<#36198>: @josh-fell
<https://github.com/josh-fell>
- Bump minimum Airflow version in providers to Airflow 2.6.0 (#36017)
<#36017>: @eladkal
<https://github.com/eladkal>
- Deprecate CloudComposerEnvironmentSensor in favor of
CloudComposerCreateEnvironmentOperator with defer mode (#35775)
<#35775>: @VladaZakharova
<https://github.com/VladaZakharova>
- Allow storage options to be passed (#35820)
<#35820>: @bolkedebruin
<https://github.com/bolkedebruin>
- Add feature to build "chicken-egg" packages from sources (#35890)
<#35890>: @potiuk
<https://github.com/potiuk>
- Remove remaining Airflow 2.5 backcompat code from Google Provider
(#36366) <#36366>: @Taragolis
<https://github.com/Taragolis>
- Move KubernetesPodTrigger hook to a cached property (#36290)
<#36290>: @hussein-awala
<https://github.com/hussein-awala>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
Provider microsoft.azure: 8.5.0rc1
<https://pypi.org/project/apache-airflow-providers-microsoft-azure/8.5.0rc1>
- Allow storage options to be passed (#35820)
<#35820>: @bolkedebruin
<https://github.com/bolkedebruin>
- AzureFileShareHook fix with connection type "Azure" (#36309)
<#36309>: @abhishekbhakat
<https://github.com/abhishekbhakat>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
Provider odbc: 4.4.0rc1
<https://pypi.org/project/apache-airflow-providers-odbc/4.4.0rc1>
- Make "placeholder" of ODBC configurable in UI (#36000)
<#36000>: @jarnedemunter1
<https://github.com/jarnedemunter1>
Provider openlineage: 1.3.1rc1
<https://pypi.org/project/apache-airflow-providers-openlineage/1.3.1rc1>
- openlineage: set extractors None in config (#36362)
<#36362>: @JDarDagran
<https://github.com/JDarDagran>
Provider postgres: 5.10.0rc1
<https://pypi.org/project/apache-airflow-providers-postgres/5.10.0rc1>
- Make "placeholder" of ODBC configurable in UI (#36000)
<#36000>: @jarnedemunter1
<https://github.com/jarnedemunter1>
Provider presto: 5.4.0rc1
<https://pypi.org/project/apache-airflow-providers-presto/5.4.0rc1>
- Make "placeholder" of ODBC configurable in UI (#36000)
<#36000>: @jarnedemunter1
<https://github.com/jarnedemunter1>
Provider sftp: 4.8.1rc1
<https://pypi.org/project/apache-airflow-providers-sftp/4.8.1rc1>
- Fix sftp warrning log (#36148)
<#36148>: @romsharon98
<https://github.com/romsharon98>
- Add code snippet formatting in docstrings via Ruff (#36262)
<#36262>: @josh-fell
<https://github.com/josh-fell>
Provider slack: 8.5.1rc1
<https://pypi.org/project/apache-airflow-providers-slack/8.5.1rc1>
- Slack: Remove parameter token in SlackAPIPostOperator's docstring
(#36121) <#36121>: @brienzb
<https://github.com/brienzb>
Provider smtp: 1.6.0rc1
<https://pypi.org/project/apache-airflow-providers-smtp/1.6.0rc1>
- Modify SmtpNotifier to accept template with defaults (#36226)
<#36226>: @vchiapaikeo
<https://github.com/vchiapaikeo>
Provider sqlite: 3.7.0rc1
<https://pypi.org/project/apache-airflow-providers-sqlite/3.7.0rc1>
- Make "placeholder" of ODBC configurable in UI (#36000)
<#36000>: @jarnedemunter1
<https://github.com/jarnedemunter1>
Provider ssh: 3.10.0rc1
<https://pypi.org/project/apache-airflow-providers-ssh/3.10.0rc1>
- Add skip_on_exit_code to SSHOperator (#36303)
<#36303>: @dolfinus
<https://github.com/dolfinus>
- Allow SSHOperator.skip_on_exit_code to be zero (#36358)
<#36358>: @dolfinus
<https://github.com/dolfinus>
- Review and mark found potential SSH security issues by bandit
(#36162) <#36162>: @potiuk
<https://github.com/potiuk>
Provider trino: 5.6.0rc1
<https://pypi.org/project/apache-airflow-providers-trino/5.6.0rc1>
- Make "placeholder" of ODBC configurable in UI (#36000)
<#36000>: @jarnedemunter1
<https://github.com/jarnedemunter1>
Provider weaviate: 1.2.0rc1
<https://pypi.org/project/apache-airflow-providers-weaviate/1.2.0rc1>
- Add helper function for CRUD operations on weaviate's schema and
class objects (#35919) <#35919>:
@utkarsharma2 <https://github.com/utkarsharma2>
- Add retry mechanism and dataframe support for WeaviateIngestOperator
(#36085) <#36085>: @utkarsharma2
<https://github.com/utkarsharma2>
- Fixing template_fields for WeaviateIngestOperator (#36359)
<#36359>: @vatsrahul1001
<https://github.com/vatsrahul1001>
Committer
- I acknowledge that I am a maintainer/committer of the Apache Airflow
project.
—
Reply to this email directly, view it on GitHub
<#36384>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AJPTQJWWCMFL3PLTC7RXVADYKY6DNAVCNFSM6AAAAABBAO623CVHI2DSMVQWIX3LMV43ASLTON2WKOZSGA2TINRSGM4DSOA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
--
Thanks and regards
Dirisala Gopala Rao
Ph: +918861444868
+919494447656
alt. email: ***@***.***
|
Hey Jarek,
Would it be possible to have an RC2 to include this commits
<75d74b1> and <ff3b8da>
for the Weavaite Provider?
Thanks,
Utkarsh Sharma
On Sat, Dec 23, 2023 at 10:18 AM Gopal Dirisala ***@***.***>
wrote:
… I have tested the following Kubernetes provider fixes. They are working as
expected. I am eagerly waiting for the release that has a critical
performance bug.
- Kubernetes executor running slots leak fix (#36240)
<#36240>: @dirrao
<https://github.com/dirrao>
- list pods performance optimization (#36092)
<#36092>: @dirrao
<https://github.com/dirrao>
On Sat, Dec 23, 2023 at 7:53 AM Jarek Potiuk ***@***.***>
wrote:
> Body
>
> I have a kind request for all the contributors to the latest provider
> packages release.
> Could you please help us to test the RC versions of the providers?
>
> The guidelines on how to test providers can be found in
>
> Verify providers by contributors
> <
https://github.com/apache/airflow/blob/main/dev/README_RELEASE_PROVIDER_PACKAGES.md#verify-the-release-candidate-by-contributors>
>
> Let us know in the comment, whether the issue is addressed.
>
> Those are providers that require testing as there were some substantial
> changes introduced:
> Provider airbyte: 3.5.1rc1
> <https://pypi.org/project/apache-airflow-providers-airbyte/3.5.1rc1>
>
> - Cancel airbyte job when timeout exceeded to prevent inconsistency
> amo… (#36241) <#36241>: @shohamy7
> <https://github.com/shohamy7>
>
> Provider amazon: 8.14.0rc1
> <https://pypi.org/project/apache-airflow-providers-amazon/8.14.0rc1>
>
> - Add jsonpath_ng.ext.parse support for SqsSensor (#36170)
> <#36170>: @cliebBS
> <https://github.com/cliebBS>
> - Increase ConflictException retries to 4 total (#36337)
> <#36337>: @o-nikolas
> <https://github.com/o-nikolas>
> - Add more parameter support for the copy_object function in
> airflow.providers.amazon.aws.hooks.s3 (#36304)
> <#36304>: @samezapple
> <https://github.com/samezapple>
> - Allow storage options to be passed (#35820)
> <#35820>: @bolkedebruin
> <https://github.com/bolkedebruin>
> - Remove is_authorized_cluster_activity from auth manager (#36175)
> <#36175>: @vincbeck
> <https://github.com/vincbeck>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
> - Remove remaining Airflow 2.5 backcompat code from Amazon Provider
> (#36324) <#36324>: @Taragolis
> <https://github.com/Taragolis>
>
> Provider apache.beam: 5.5.0rc1
> <https://pypi.org/project/apache-airflow-providers-apache-beam/5.5.0rc1>
>
> - Add ability to run streaming Job for BeamRunPythonPipelineOperator
> in non deferrable mode (#36108)
> <#36108>: @MaksYermak
> <https://github.com/MaksYermak>
> - Implement deferrable mode for BeamRunJavaPipelineOperator (#36122)
> <#36122>: @moiseenkov
> <https://github.com/moiseenkov>
>
> Provider apache.cassandra: 3.4.1rc1
> <
https://pypi.org/project/apache-airflow-providers-apache-cassandra/3.4.1rc1>
>
> - Sanitize keyspace and table in the cassandra query to avoid sql
> injection (#36111) <#36111>:
> @hussein-awala <https://github.com/hussein-awala>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
>
> Provider apache.hdfs: 4.3.1rc1
> <https://pypi.org/project/apache-airflow-providers-apache-hdfs/4.3.1rc1>
>
> - fix connection type webhdfs (#36145)
> <#36145>: @ankurbajaj9
> <https://github.com/ankurbajaj9>
>
> Provider apache.hive: 6.4.0rc1
> <https://pypi.org/project/apache-airflow-providers-apache-hive/6.4.0rc1>
>
> - add param proxy user for hive (#36221)
> <#36221>: @romsharon98
> <https://github.com/romsharon98>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
>
> Provider apache.kafka: 1.3.1rc1
> <https://pypi.org/project/apache-airflow-providers-apache-kafka/1.3.1rc1>
>
> - Provide the consumed message to consumer.commit in
> AwaitMessageTrigger (#36272)
> <#36272>: @hussein-awala
> <https://github.com/hussein-awala>
>
> Provider apache.spark: 4.6.0rc1
> <https://pypi.org/project/apache-airflow-providers-apache-spark/4.6.0rc1>
>
> - SparkSubmit: Adding propertyfiles option (#36164)
> <#36164>: @pateash
> <https://github.com/pateash>
> - SparkSubmit Connection Extras can be overridden (#36151)
> <#36151>: @pateash
> <https://github.com/pateash>
>
> Provider celery: 3.5.1rc1
> <https://pypi.org/project/apache-airflow-providers-celery/3.5.1rc1>
>
> - Fix sentinel_kwargs load from ENV (#36318)
> <#36318>: @vaaalik
> <https://github.com/vaaalik>
>
> Provider cncf.kubernetes: 7.12.0rc1
> <
https://pypi.org/project/apache-airflow-providers-cncf-kubernetes/7.12.0rc1>
>
> - Add _request_timeout to KPO log fetch calls (#36297)
> <#36297>: @dstandish
> <https://github.com/dstandish>
> - Add pod_template_dict field to KubernetesPodOperator (#33174)
> <#33174>: @sonmezonur
> <https://github.com/sonmezonur>
> - KubernetesPodTrigger: add exception stack trace in TriggerEvent
> (#35716) <#35716>: @functicons
> <https://github.com/functicons>
> - Limit pod_name length to HOST_NAME_MAX (#36332)
> <#36332>: @csp33
> <https://github.com/csp33>
> - Move KubernetesPodTrigger hook to a cached property (#36290)
> <#36290>: @hussein-awala
> <https://github.com/hussein-awala>
> - Kubernetes executor running slots leak fix (#36240)
> <#36240>: @dirrao
> <https://github.com/dirrao>
> - list pods performance optimization (#36092)
> <#36092>: @dirrao
> <https://github.com/dirrao>
>
> Provider common.sql: 1.10.0rc1
> <https://pypi.org/project/apache-airflow-providers-common-sql/1.10.0rc1>
>
> - Make "placeholder" of ODBC configurable in UI (#36000)
> <#36000>: @jarnedemunter1
> <https://github.com/jarnedemunter1>
> - SQLCheckOperator fails if it returns dict with any Falsy values
> (#36273) <#36273>:
> @spencertollefson <https://github.com/spencertollefson>
>
> Provider databricks: 6.0.0rc1
> <https://pypi.org/project/apache-airflow-providers-databricks/6.0.0rc1>
>
> - Make databricks sql hook return a namedtuple (#36205)
> <#36205>: @Joffreybvn
> <https://github.com/Joffreybvn>
> - Fix: Implement support for fetchone() in the ODBCHook and the
> Databricks SQL Hook (#36161)
> <#36161>: @Joffreybvn
> <https://github.com/Joffreybvn>
> - Bump minimum Airflow version in providers to Airflow 2.6.0 (#36017)
> <#36017>: @eladkal
> <https://github.com/eladkal>
> - fix typos in DatabricksSubmitRunOperator (#36248)
> <#36248>: @adam133
> <https://github.com/adam133>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
> - Prepare docs 1st wave of Providers December 2023 (#36112)
> <#36112>: @eladkal
> <https://github.com/eladkal>
> - Prepare docs 1st wave of Providers December 2023 RC2 (#36190)
> <#36190>: @eladkal
> <https://github.com/eladkal>
> - Re-apply updated version numbers to 2nd wave of providers in
> December (#36380) <#36380>:
> @potiuk <https://github.com/potiuk>
> - Prepare 2nd wave of providers in December (#36373)
> <#36373>: @potiuk
> <https://github.com/potiuk>
>
> Provider docker: 3.9.1rc1
> <https://pypi.org/project/apache-airflow-providers-docker/3.9.1rc1>
>
> - Allow DockerOperator.skip_on_exit_code to be zero (#36360)
> <#36360>: @dolfinus
> <https://github.com/dolfinus>
> - Remove remaining Airflow 2.5 backcompat code from Docker Provider
> (#36325) <#36325>: @Taragolis
> <https://github.com/Taragolis>
>
> Provider google: 10.13.0rc3
> <https://pypi.org/project/apache-airflow-providers-google/10.13.0rc3>
>
> - GCP Secrets Backend Impersonation (#36072)
> <#36072>: @nathadfield
> <https://github.com/nathadfield>
> - Add OpenLineage support to GCS operators (#35838)
> <#35838>: @kacpermuda
> <https://github.com/kacpermuda>
> - Add support for service account impersonation with
> computeEngineSSHHook (google provider) and IAP tunnel (#35136)
> <#35136>: @ginolegigot
> <https://github.com/ginolegigot>
> - Added Datascan Profiling (#35696)
> <#35696>: @shourya116
> <https://github.com/shourya116>
> - Add overrides to template fields of Google Cloud Run Jobs Execute
> Operator (#36133) <#36133>:
> @guillaumeblaquiere <https://github.com/guillaumeblaquiere>
> - Implement deferrable mode for BeamRunJavaPipelineOperator (#36122)
> <#36122>: @moiseenkov
> <https://github.com/moiseenkov>
> - Add ability to run streaming Job for BeamRunPythonPipelineOperator
> in non deferrable mode (#36108)
> <#36108>: @MaksYermak
> <https://github.com/MaksYermak>
> - Add use_glob to GCSObjectExistenceSensor (#34137)
> <#34137>: @A-Costa
> <https://github.com/A-Costa>
> - Fix DataprocSubmitJobOperator to retrieve failed job error message
> (#36053) <#36053>:
> @VladaZakharova <https://github.com/VladaZakharova>
> - Fix CloudRunExecuteJobOperator in deferrable mode (#36012)
> <#36012>: @VladaZakharova
> <https://github.com/VladaZakharova>
> - Fix gcs listing - ensure blobs are loaded (#34919)
> <#34919>: @atrbgithub
> <https://github.com/atrbgithub>
> - allow multiple elements in impersonation chain (#35694)
> <#35694>: @melugoyal
> <https://github.com/melugoyal>
> - Change retry type for Google Clients to async ones where needed
> (#36141) <#36141>: @potiuk
> <https://github.com/potiuk>
> - Minor fix to DataprocCreateClusterOperator operator docs. (#36322)
> <#36322>: @bjankie1
> <https://github.com/bjankie1>
> - fix(bigquery.py): pass correct project_id to triggerer (#35200)
> <#35200>: @mokshasoul
> <https://github.com/mokshasoul>
> - iterate through blobs before checking prefixes (#36202)
> <#36202>: @Lee-W
> <https://github.com/Lee-W>
> - Fix incompatibility with google-cloud-monitoring 2.18.0 (#36200)
> <#36200>: @potiuk
> <https://github.com/potiuk>
> - Update retry param typing in PubSubAsyncHook (#36198)
> <#36198>: @josh-fell
> <https://github.com/josh-fell>
> - Bump minimum Airflow version in providers to Airflow 2.6.0 (#36017)
> <#36017>: @eladkal
> <https://github.com/eladkal>
> - Deprecate CloudComposerEnvironmentSensor in favor of
> CloudComposerCreateEnvironmentOperator with defer mode (#35775)
> <#35775>: @VladaZakharova
> <https://github.com/VladaZakharova>
> - Allow storage options to be passed (#35820)
> <#35820>: @bolkedebruin
> <https://github.com/bolkedebruin>
> - Add feature to build "chicken-egg" packages from sources (#35890)
> <#35890>: @potiuk
> <https://github.com/potiuk>
> - Remove remaining Airflow 2.5 backcompat code from Google Provider
> (#36366) <#36366>: @Taragolis
> <https://github.com/Taragolis>
> - Move KubernetesPodTrigger hook to a cached property (#36290)
> <#36290>: @hussein-awala
> <https://github.com/hussein-awala>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
>
> Provider microsoft.azure: 8.5.0rc1
> <
https://pypi.org/project/apache-airflow-providers-microsoft-azure/8.5.0rc1>
>
> - Allow storage options to be passed (#35820)
> <#35820>: @bolkedebruin
> <https://github.com/bolkedebruin>
> - AzureFileShareHook fix with connection type "Azure" (#36309)
> <#36309>: @abhishekbhakat
> <https://github.com/abhishekbhakat>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
>
> Provider odbc: 4.4.0rc1
> <https://pypi.org/project/apache-airflow-providers-odbc/4.4.0rc1>
>
> - Make "placeholder" of ODBC configurable in UI (#36000)
> <#36000>: @jarnedemunter1
> <https://github.com/jarnedemunter1>
>
> Provider openlineage: 1.3.1rc1
> <https://pypi.org/project/apache-airflow-providers-openlineage/1.3.1rc1>
>
> - openlineage: set extractors None in config (#36362)
> <#36362>: @JDarDagran
> <https://github.com/JDarDagran>
>
> Provider postgres: 5.10.0rc1
> <https://pypi.org/project/apache-airflow-providers-postgres/5.10.0rc1>
>
> - Make "placeholder" of ODBC configurable in UI (#36000)
> <#36000>: @jarnedemunter1
> <https://github.com/jarnedemunter1>
>
> Provider presto: 5.4.0rc1
> <https://pypi.org/project/apache-airflow-providers-presto/5.4.0rc1>
>
> - Make "placeholder" of ODBC configurable in UI (#36000)
> <#36000>: @jarnedemunter1
> <https://github.com/jarnedemunter1>
>
> Provider sftp: 4.8.1rc1
> <https://pypi.org/project/apache-airflow-providers-sftp/4.8.1rc1>
>
> - Fix sftp warrning log (#36148)
> <#36148>: @romsharon98
> <https://github.com/romsharon98>
> - Add code snippet formatting in docstrings via Ruff (#36262)
> <#36262>: @josh-fell
> <https://github.com/josh-fell>
>
> Provider slack: 8.5.1rc1
> <https://pypi.org/project/apache-airflow-providers-slack/8.5.1rc1>
>
> - Slack: Remove parameter token in SlackAPIPostOperator's docstring
> (#36121) <#36121>: @brienzb
> <https://github.com/brienzb>
>
> Provider smtp: 1.6.0rc1
> <https://pypi.org/project/apache-airflow-providers-smtp/1.6.0rc1>
>
> - Modify SmtpNotifier to accept template with defaults (#36226)
> <#36226>: @vchiapaikeo
> <https://github.com/vchiapaikeo>
>
> Provider sqlite: 3.7.0rc1
> <https://pypi.org/project/apache-airflow-providers-sqlite/3.7.0rc1>
>
> - Make "placeholder" of ODBC configurable in UI (#36000)
> <#36000>: @jarnedemunter1
> <https://github.com/jarnedemunter1>
>
> Provider ssh: 3.10.0rc1
> <https://pypi.org/project/apache-airflow-providers-ssh/3.10.0rc1>
>
> - Add skip_on_exit_code to SSHOperator (#36303)
> <#36303>: @dolfinus
> <https://github.com/dolfinus>
> - Allow SSHOperator.skip_on_exit_code to be zero (#36358)
> <#36358>: @dolfinus
> <https://github.com/dolfinus>
> - Review and mark found potential SSH security issues by bandit
> (#36162) <#36162>: @potiuk
> <https://github.com/potiuk>
>
> Provider trino: 5.6.0rc1
> <https://pypi.org/project/apache-airflow-providers-trino/5.6.0rc1>
>
> - Make "placeholder" of ODBC configurable in UI (#36000)
> <#36000>: @jarnedemunter1
> <https://github.com/jarnedemunter1>
>
> Provider weaviate: 1.2.0rc1
> <https://pypi.org/project/apache-airflow-providers-weaviate/1.2.0rc1>
>
> - Add helper function for CRUD operations on weaviate's schema and
> class objects (#35919) <#35919>:
> @utkarsharma2 <https://github.com/utkarsharma2>
> - Add retry mechanism and dataframe support for WeaviateIngestOperator
> (#36085) <#36085>: @utkarsharma2
> <https://github.com/utkarsharma2>
> - Fixing template_fields for WeaviateIngestOperator (#36359)
> <#36359>: @vatsrahul1001
> <https://github.com/vatsrahul1001>
>
> Committer
>
> - I acknowledge that I am a maintainer/committer of the Apache Airflow
> project.
>
> —
> Reply to this email directly, view it on GitHub
> <#36384>, or unsubscribe
> <
https://github.com/notifications/unsubscribe-auth/AJPTQJWWCMFL3PLTC7RXVADYKY6DNAVCNFSM6AAAAABBAO623CVHI2DSMVQWIX3LMV43ASLTON2WKOZSGA2TINRSGM4DSOA>
> .
> You are receiving this because you were mentioned.Message ID:
> ***@***.***>
>
--
Thanks and regards
Dirisala Gopala Rao
Ph: +918861444868
+919494447656
alt. email: ***@***.***
—
Reply to this email directly, view it on GitHub
<#36384 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADDLAHIJQIVE7E4OTRHLQUTYKZPATAVCNFSM6AAAAABBAO623CVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNRYGIYDIOJZGQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
I am not sure if there will be RC2 for Waeviate, but we can release a new version right after this one gets released as well. I've been slightly improving the release process to make it possible (and easy) to have a more-or-less continuous release process. I have one more small things to fix but as I explained to @eladkal in a separate discussion, basically we should be able to start a new release process right after we release previous wave - and continue doing it all the time - even if some providers get removed from the previouse wave. This should streamline our release process, where we should be able to make provider releases even more frequently - maybe even once a week - especially if we also introduce more "rotating" position of release manager, so maybe that's a good thing to try it right away :). |
As you can see - for example - in the past wave we had to withdraw google and databricks, so while we are releasing new changes in other providers, in this wave the google and databricks providers are also included (google with RC3 and databricks with new - breaking - 6.0.0 version rc1). All this is going to be pretty much fully automated after one more small thing I will need to add and we should be able just carry-over such previous providers to new wave, which means that no matter what the result of voting is, we will be able to start new wave right after we release the previous one. That will also make our release waves smaller. |
Ok for #36133 |
I've tested #36241, all looks good |
I tested #36273. All good. |
I tested all my changes, and all work as expected. |
I won't get a chance to test the RC until late next week, but #36248 worked when I ran the changes locally. |
I am not sure if there will be RC2 for Waeviate, but we can release a new
version right after this one gets released as well.
Thanks for your consideration, releasing a new version after this release
also works for me.
I've been slightly improving the release process to make it possible (and
easy) to have a more-or-less continuous release process. I have one more
small things to fix but as I explained to @eladkal
<https://github.com/eladkal> in a separate discussion, basically we
should be able to start a new release process right after we release
previous wave - and continue doing it all the time - even if some providers
get removed from the previouse wave. This should streamline our release
process, where we should be able to make provider releases even more
frequently - maybe even once a week - especially if we also introduce more
"rotating" position of release manager, so maybe that's a good thing to try
it right away :).
As you can see - for example - in the past wave we had to withdraw google
and databricks, so while we are releasing new changes in other providers,
in this wave the google and databricks providers are also included (google
with RC3 and databricks with new - breaking - 6.0.0 version rc1). All this
is going to be pretty much fully automated after one more small thing I
will need to add and we should be able just carry-over such previous
providers to new wave, which means that no matter what the result of voting
is, we will be able to start new wave right after we release the previous
one.
That will also make our release waves smaller.
Considering providers are the most changed code in airflow's codebase, this
sounds like a real time saver for getting the code out quickly. :)
…On Sun, Dec 24, 2023 at 4:15 AM Adam B ***@***.***> wrote:
I won't get a chance to test the RC until late next week, but #36248
<#36248> worked when I ran the
changes locally.
—
Reply to this email directly, view it on GitHub
<#36384 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADDLAHMSXM2MMUCRLFOFZ7DYK5NKPAVCNFSM6AAAAABBAO623CVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNRYGM4DGMRQGA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
This probably also needs changing the connection type in default connections #36145 . but in itself it is ok |
Has anyone noticed unusual behavior with the DagFileProcessor lately? I've observed some unusual behavior where a relatively simple dag is continually getting removed and added back. I went ahead and reverted my change locally to see if I could still repro this bug and I could so do not think it is related. However, it is hard to test my change because this simple dag seems to be continually getting removed and re-added. Logs below - Scheduler Logs
Dag File Processor Logs
my_dag.py dag file import datetime
from airflow import DAG
from airflow.operators.bash import BashOperator
# from airflow.providers.smtp.notifications.smtp import SmtpNotifier
from datetime import timedelta
EMAIL = "xyz@gmail.com"
with DAG(
dag_id="my_dag",
start_date=datetime.datetime(2021, 1, 1),
schedule="*/5 * * * *",
max_active_tasks=1,
max_active_runs=1,
# sla_miss_callback=SmtpNotifier(from_email=EMAIL, to=EMAIL),
):
BashOperator(
task_id="task",
bash_command="sleep 120",
retries=2,
sla=timedelta(seconds=30),
# on_success_callback=SmtpNotifier(from_email=EMAIL, to=EMAIL),
# on_failure_callback=SmtpNotifier(from_email=EMAIL, to=EMAIL),
# on_retry_callback=SmtpNotifier(from_email=EMAIL, to=EMAIL),
) |
I think there are some main changes that influence that - I already saw one change while I was testing Python Client #36377 - but it works fine with 2.8.0 - so maybe you can test it with 2.8.0 ? |
BTW. Happy Holidays everyone ! - this one will continue after 26th of December 🎄 |
Back after holidays :) |
All good on #36072 |
Thank you everyone. Providers are released I invite everyone to help improve providers for the next release, a list of open issues can be found here |
Hello! |
Released constraints are almost never updated (the only case they are updated is when for some reason airflow is not installable with them). This is explained here https://airflow.apache.org/docs/apache-airflow/stable/installation/installing-from-pypi.html#constraints-files : constraints are there to provide reproducible installation of Airflow - with THE SAME packages that it was released with originally (including providers). If you want to upgrade to latest providers, you should install them without constraints. This is explained in https://airflow.apache.org/docs/apache-airflow/stable/installation/installing-from-pypi.html#installation-and-upgrade-scenarios and in this talk from Airflow Summit 2023: https://airflowsummit.org/sessions/2023/mastering-dependencies-the-airflow-way/ (I recommend reading it / watching the talk). The constraints are updated (automatically, when all unit and integration tests pass in related branch):
In case of https://github.com/apache/airflow/blob/constraints-v2-8-test/constraints-3.10.txt - when we release 2.8.1, those constraints will become 2.8.1 and (as well as 2.8.0) will be frozen and will not be updated any more. |
Many thanks! |
Body
I have a kind request for all the contributors to the latest provider packages release.
Could you please help us to test the RC versions of the providers?
The guidelines on how to test providers can be found in
Verify providers by contributors
Let us know in the comment, whether the issue is addressed.
Those are providers that require testing as there were some substantial changes introduced:
Provider airbyte: 3.5.1rc1
Provider amazon: 8.14.0rc1
is_authorized_cluster_activity
from auth manager (#36175): @vincbeckProvider apache.beam: 5.5.0rc1
Provider apache.cassandra: 3.4.1rc1
Provider apache.hdfs: 4.3.1rc1
Provider apache.hive: 6.4.0rc1
Provider apache.kafka: 1.3.1rc1
Provider apache.spark: 4.6.0rc1
Provider celery: 3.5.1rc1
sentinel_kwargs
load from ENV (#36318): @vaaalikProvider cncf.kubernetes: 7.12.0rc1
pod_template_dict
field toKubernetesPodOperator
(#33174): @sonmezonurProvider common.sql: 1.10.0rc1
Provider databricks: 6.0.0rc1
fetchone()
in the ODBCHook and the Databricks SQL Hook (#36161): @JoffreybvnProvider docker: 3.9.1rc1
Provider google: 10.13.0rc3
retry
param typing in PubSubAsyncHook (#36198): @josh-fellCloudComposerEnvironmentSensor
in favor ofCloudComposerCreateEnvironmentOperator
with defer mode (#35775): @VladaZakharovaProvider microsoft.azure: 8.5.0rc1
Provider odbc: 4.4.0rc1
Provider openlineage: 1.3.1rc1
Provider postgres: 5.10.0rc1
Provider presto: 5.4.0rc1
Provider sftp: 4.8.1rc1
Provider slack: 8.5.1rc1
token
in SlackAPIPostOperator's docstring (#36121): @brienzbProvider smtp: 1.6.0rc1
Provider sqlite: 3.7.0rc1
Provider ssh: 3.10.0rc1
Provider trino: 5.6.0rc1
Provider weaviate: 1.2.0rc1
Committer
The text was updated successfully, but these errors were encountered: