You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With Druid 24.0.0 the team included the possibility to start an ingestion using SQL. This uses a new SQL engine called multi-stage-query. Changelog here.
This seems to not be supported in pydruid (using SqlAlchemy). If i run
importsqlalchemyassqlaquery="""INSERT INTO w000SELECT TIME_PARSE("timestamp") AS __time, isRobot, channel, flags, isUnpatrolled, page, diffUrl, added, comment, commentLength, isNew, isMinor, delta, isAnonymous, user, deltaBucket, deleted, namespace, cityName, countryName, regionIsoCode, metroCode, countryIsoCode, regionNameFROM TABLE( EXTERN( '{"type":"http","uris":["https://druid.apache.org/data/wikipedia.json.gz"]}', '{"type":"json"}', '[{"name":"isRobot","type":"string"},{"name":"channel","type":"string"},{"name":"timestamp","type":"string"},{"name":"flags","type":"string"},{"name":"isUnpatrolled","type":"string"},{"name":"page","type":"string"},{"name":"diffUrl","type":"string"},{"name":"added","type":"long"},{"name":"comment","type":"string"},{"name":"commentLength","type":"long"},{"name":"isNew","type":"string"},{"name":"isMinor","type":"string"},{"name":"delta","type":"long"},{"name":"isAnonymous","type":"string"},{"name":"user","type":"string"},{"name":"deltaBucket","type":"long"},{"name":"deleted","type":"long"},{"name":"namespace","type":"string"},{"name":"cityName","type":"string"},{"name":"countryName","type":"string"},{"name":"regionIsoCode","type":"string"},{"name":"metroCode","type":"long"},{"name":"countryIsoCode","type":"string"},{"name":"regionName","type":"string"}]' ))PARTITIONED BY HOURCLUSTERED BY channel"""engine=sqla.create_engine("druid://localhost:8888/druid/v2/sql/")
engine.execute(query)
I'm not entirely sure about it but checked how the UI console does the engine switch and basically the MSQ is a separate endpoint. Normal SQL queries are sent to /druid/v2/sql while MSQ queries go to /druid/v2/sql/task. Unfortunately just changing the URL didn't work for me; possibly the MSQ endpoint isn't quite compatible.
Hi everyone.
With Druid 24.0.0 the team included the possibility to start an ingestion using SQL. This uses a new SQL engine called
multi-stage-query
. Changelog here.This seems to not be supported in
pydruid
(using SqlAlchemy). If i runi get
Using:
pydruid[sqlalchemy]==0.6.5
sqlalchemy==1.4.46
Is there a way to specify a different engine? it will be great to use
sql-msq-task
instead ofnative
.The text was updated successfully, but these errors were encountered: