Take component size into account for Q13(a) and Q14(a) variants #197

szarnyasg · 2022-10-13T18:29:28Z

To make the queries really challenging, in the paramgen we could select components that are roughly similar in size -- and reasonably large, if possible.

szarnyasg · 2022-10-13T21:58:17Z

Getting two large components may not be possible... E.g. in the SF10,000 data set, most people are in a single large CC while the rest are all isolated nodes.

D create or replace table cc as select * from read_parquet('factors-sf10000/parquet/raw/composite-merged-fk/personKnowsPersonConnected/*.parquet');
D select distinct component, count from cc order by count desc limit 5;
┌────────────────┬──────────┐
│   Component    │  count   │
├────────────────┼──────────┤
│ 14             │ 26519261 │
│ 10995121104613 │ 1        │
│ 10995121125221 │ 1        │
│ 10995121148539 │ 1        │
│ 10995121161903 │ 1        │
└────────────────┴──────────┘

szarnyasg · 2023-03-28T17:40:28Z

See the related issue for the BI workload.

szarnyasg self-assigned this Oct 13, 2022

szarnyasg changed the title ~~Take component size into account Q13(a) and Q14(a) variants~~ Take component size into account for Q13(a) and Q14(a) variants Oct 13, 2022

szarnyasg added the enhancement label Nov 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Take component size into account for Q13(a) and Q14(a) variants #197

Take component size into account for Q13(a) and Q14(a) variants #197

szarnyasg commented Oct 13, 2022 •

edited

Loading

szarnyasg commented Oct 13, 2022 •

edited

Loading

szarnyasg commented Mar 28, 2023

Take component size into account for Q13(a) and Q14(a) variants #197

Take component size into account for Q13(a) and Q14(a) variants #197

Comments

szarnyasg commented Oct 13, 2022 • edited Loading

szarnyasg commented Oct 13, 2022 • edited Loading

szarnyasg commented Mar 28, 2023

szarnyasg commented Oct 13, 2022 •

edited

Loading

szarnyasg commented Oct 13, 2022 •

edited

Loading