bug: multiple reads in source's internal table #5590

tabVersion · 2022-09-27T11:04:03Z

Describe the bug

write one row in state table and it read multiple identical rows when select * from the internal table

this issue is introduced in #5433

To Reproduce

start a cluster with kafka (./risedev d full & ./scripts/source/prepare_ci_kafka.sh)
create a materialized source create table s2 (v1 int, v2 varchar) with ( connector = 'kafka', topic = 'kafka_2_partition_topic', properties.bootstrap.server = '127.0.0.1:29092', scan.startup.mode = 'earliest') row format json;
get internal table id from dashboard, eg. __internal_s2_2_sourceinternaltable_1003
select * from __internal_s2_2_sourceinternaltable_1003

dev=> select * from __internal_s2_2_sourceinternaltable_1003 ;
 partition_id |                                               offset
--------------+-----------------------------------------------------------------------------------------------------
 0            |                                                                                                    +
              | \x05kafka\x12U{"topic":"kafka_2_partition_topic","partition":0,"start_offset":3,"stop_offset":null}
 0            |                                                                                                    +
              | \x05kafka\x12U{"topic":"kafka_2_partition_topic","partition":0,"start_offset":3,"stop_offset":null}
 0            |                                                                                                    +
              | \x05kafka\x12U{"topic":"kafka_2_partition_topic","partition":0,"start_offset":3,"stop_offset":null}
 0            |                                                                                                    +
              | \x05kafka\x12U{"topic":"kafka_2_partition_topic","partition":0,"start_offset":3,"stop_offset":null}
(4 rows)

./risedev d full
./scripts/source/prepare_ci_kafka.sh
psql -h localhost -p 4566 -d dev -U root -c "create table s2 (v1 int, v2 varchar) with ( connector = 'kafka', topic = 'kafka_2_partition_topic', properties.bootstrap.server = '127.0.0.1:29092', scan.startup.mode = 'earliest') row format json;"

Expected behavior

1 row

Additional context

None

The text was updated successfully, but these errors were encountered:

st1page · 2022-09-28T05:45:47Z

"parallelism": 4 in batch plan.

explain (distsql) select * from "__internal_s2_2_sourceinternaltable_1003";
 {
   "root_stage_id": 0,
   "stages": {
     "1": {
       "root": {
         "plan_node_id": 20,
         "plan_node_type": "BatchSeqScan",
         "schema": [
           {
             "dataType": {
               "typeName": "VARCHAR",
               "isNullable": true
             },
             "name": "__internal_s2_2_sourceinternaltable_1003.partition_id"
           },
           {
             "dataType": {
               "typeName": "VARCHAR",
               "isNullable": true
             },
             "name": "__internal_s2_2_sourceinternaltable_1003.offset"
           }
         ],
         "children": [],
         "source_stage_id": null
       },
       "parallelism": 4,
       "exchange_info": {
         "mode": "SINGLE"
       }
     },
     "0": {
       "root": {
         "plan_node_id": 21,
         "plan_node_type": "BatchExchange",
         "schema": [
           {
             "dataType": {
               "typeName": "VARCHAR",
               "isNullable": true
             },
             "name": "__internal_s2_2_sourceinternaltable_1003.partition_id"
           },
           {
             "dataType": {
               "typeName": "VARCHAR",
               "isNullable": true
             },
             "name": "__internal_s2_2_sourceinternaltable_1003.offset"
           }
         ],
         "children": [],
         "source_stage_id": 1
       },
       "parallelism": 1,
       "exchange_info": {
         "mode": "SINGLE"
       }
     }
   },
   "child_edges": {
     "1": [],
     "0": [
       1
     ]
   },
   "parent_edges": {
     "1": [
       0
     ],

BugenZhao · 2022-10-19T05:38:39Z

After #5907, we will panic at The stage has single distribution, but contains a table scan node with multiple partitions. 😄

Actually, it's a little bit tricky to get the correct result. If we schedule multiple tasks, we'll get duplicated rows as there's no way to prune the distribution. If we schedule a single task, then it may behave strangely with #5850.

tabVersion · 2023-01-16T07:57:04Z

reopen this issue because the case is still there

Just checked on the latest main (25f6655)

BugenZhao · 2023-01-30T07:11:24Z

There seems something wrong with the distribution of BatchSeqScan::to_local: we should make it singleton for Source's internal tables. cc @kwannoel Would you please help to take a look?

kwannoel · 2023-01-30T09:50:00Z

There seems something wrong with the distribution of BatchSeqScan::to_local: we should make it singleton for Source's internal tables. cc @kwannoel Would you please help to take a look?

To provide some context, the distribution for BatchSeqScan on non-system tables is SomeShard after this PR: #7240

risingwave/src/frontend/src/optimizer/plan_node/batch_seq_scan.rs

Lines 258 to 261 in 1d2bb32

    
           } else { 
        
               // NOTE(kwannoel): This is a hack to force an exchange to always be inserted before scan. 
        
               Distribution::SomeShard 
        
           };

By making the distribution SomeShard, an Exchange would be inserted.
This forces BatchSeqScan to be executed on the compute node instead of the frontend.

If BatchSeqScan uses SomeShard, it will just defer to table_scan_info to get number of partitions, and infer parallelism from there:

risingwave/src/frontend/src/scheduler/plan_fragmenter.rs

Lines 527 to 532 in 9f64e93

    
           if let Some(table_scan_info) = &table_scan_info { 
        
               table_scan_info 
        
                   .partitions 
        
                   .as_ref() 
        
                   .map(|m| m.len()) 
        
                   .unwrap_or(1)

Is it correct to assume that each worker thread should scan independent partitions?

And if so, there should be no duplicated results, even if Distribution::SomeShard? 🤔

liurenjie1024 · 2023-02-01T09:16:58Z

Yes, I think this is caused by incorrect vnode mapping for internal table, and I'll take a look at this.

tabVersion · 2023-03-23T11:39:12Z

any updates?

liurenjie1024 · 2023-04-10T11:27:46Z

I'll look into this recently.

fuyufjh · 2023-09-11T06:44:51Z

Any updates?

liurenjie1024 · 2023-09-11T10:19:57Z

Will look into it later.

tabVersion · 2024-03-06T08:55:04Z

close as no updates

tabVersion added the type/bug Something isn't working label Sep 27, 2022

tabVersion assigned st1page Sep 27, 2022

github-actions bot added this to the release-0.1.14 milestone Sep 27, 2022

st1page assigned BugenZhao Oct 19, 2022

This was referenced Oct 26, 2022

panic when query internal table #6055

Closed

fix(frontend): internal table scan of source operator #6081

Merged

mergify bot closed this as completed in #6081 Oct 28, 2022

tabVersion reopened this Jan 16, 2023

BugenZhao mentioned this issue Jan 16, 2023

test: add e2e tests for internal table scans #7413

Open

BugenZhao unassigned st1page Jan 30, 2023

kwannoel self-assigned this Jan 30, 2023

liurenjie1024 changed the title ~~multiple reads in source's internal table~~ bug: multiple reads in source's internal table Feb 1, 2023

liurenjie1024 self-assigned this Feb 1, 2023

fuyufjh assigned liurenjie1024 and unassigned liurenjie1024, BugenZhao and kwannoel Feb 6, 2023

fuyufjh modified the milestones: release-0.1.14, release-0.1.17 Feb 14, 2023

liurenjie1024 modified the milestones: release-0.1.17, release-0.1.18 Feb 27, 2023

liurenjie1024 modified the milestones: release-0.18, release-0.19 Mar 20, 2023

liurenjie1024 modified the milestones: release-0.19, release-0.20 May 12, 2023

liurenjie1024 modified the milestones: release-1.0, future-release-1.2 Jul 21, 2023

fuyufjh modified the milestones: release-1.2, release-1.3 Sep 11, 2023

liurenjie1024 modified the milestones: release-1.3, release-1.4 Oct 10, 2023

liurenjie1024 modified the milestones: release-1.4, release-1.5 Nov 8, 2023

liurenjie1024 removed this from the release-1.5 milestone Dec 4, 2023

BugenZhao added this to the release-1.6 milestone Jan 2, 2024

liurenjie1024 modified the milestones: release-1.6, release-1.7 Jan 9, 2024

tabVersion closed this as not planned Won't fix, can't repro, duplicate, stale Mar 6, 2024

BugenZhao mentioned this issue Aug 12, 2024

bug: internal table doesn't display correctly #17313

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: multiple reads in source's internal table #5590

bug: multiple reads in source's internal table #5590

tabVersion commented Sep 27, 2022 •

edited by kwannoel

Loading

st1page commented Sep 28, 2022

BugenZhao commented Oct 19, 2022

tabVersion commented Jan 16, 2023

BugenZhao commented Jan 30, 2023

kwannoel commented Jan 30, 2023 •

edited

Loading

liurenjie1024 commented Feb 1, 2023

tabVersion commented Mar 23, 2023

liurenjie1024 commented Apr 10, 2023

fuyufjh commented Sep 11, 2023

liurenjie1024 commented Sep 11, 2023

tabVersion commented Mar 6, 2024

bug: multiple reads in source's internal table #5590

bug: multiple reads in source's internal table #5590

Comments

tabVersion commented Sep 27, 2022 • edited by kwannoel Loading

Describe the bug

To Reproduce

Expected behavior

Additional context

st1page commented Sep 28, 2022

BugenZhao commented Oct 19, 2022

tabVersion commented Jan 16, 2023

BugenZhao commented Jan 30, 2023

kwannoel commented Jan 30, 2023 • edited Loading

liurenjie1024 commented Feb 1, 2023

tabVersion commented Mar 23, 2023

liurenjie1024 commented Apr 10, 2023

fuyufjh commented Sep 11, 2023

liurenjie1024 commented Sep 11, 2023

tabVersion commented Mar 6, 2024

tabVersion commented Sep 27, 2022 •

edited by kwannoel

Loading

kwannoel commented Jan 30, 2023 •

edited

Loading