[kv] Support index lookup for primary key table #222

swuferhong · 2024-12-18T13:52:50Z

Purpose

Linked issue: #65

Index lookup is a feature that exposes lookup capabilities built on top of secondary indexes. By using secondary indexes, the required data can be located quickly, which can be utilized in conjunction with Flink to implement delta joins.
The purpose of this PR is to provide index lookup for kv tables. The implementation approach is to define the primary key of the kv storage as "secondary keys + primary key", and set the bucket key to the secondary keys. This way, when looking up data through the secondary keys, the corresponding bucket and server can be quickly identified, providing efficient point query capabilities.

Tests

API and Format

Documentation

wuchong

I think our current index is not a general index, it is just a prefix of primary key index. So, actually, it is just a prefix scan/lookup for the prefix of primary key (the prefix should include bucket key). I don't want to call this indexLookup because it occupies the API for future possible index (index on arbitrary columns).

How about changing the API into prefixLookup? The parameter key should be the prefix of primary key and must include bucket key. For DDL, we don't need to introduce new options table.index.keys, we can just continue to use bucket.key.

As we don't have force checks for bucket key is a prefix of primary key. We have to add some best practices for Delta Join cases in the future documentation. For tables used for DeltaJoin queries, the best practice is putting columns of bucket key before other columns in the definition of primary key. Otherwise, the prefixLookup doesn't work when the parameter key only contains bucket join. For example, given a primary key table orders with schema user_id, item_id, order_id, col1, col2, col3 (order_id can be used as primary key as it is unique). If the join key is (user_id, item_id), the primary key of the table must be set to user_id, item_id, order_id and bucket key to user_id, item_id. The prefixLookup will not work if the primary key is set to order_id, user_id, item_id, because the join key is not a prefix of primary key.

wuchong · 2024-12-21T03:00:50Z

website/docs/maintenance/monitor-metrics.md

@@ -490,6 +490,26 @@ Some metrics might not be exposed when using other JVM implementations (e.g. IBM
      <td>The number of failed lookup requests to lookup value by key from this table per second.</td>
      <td>Meter</td>
    </tr>
+    <tr>
+      <td>totalLimitScanRequestsPerSecond</td>
+      <td>The number of limit scn requests to scan records with limit from this table per second.</td>


scn -> scan?

wuchong · 2024-12-21T03:00:57Z

website/docs/maintenance/monitor-metrics.md

+    </tr>
+    <tr>
+      <td>failedLimitScanRequestsPerSecond</td>
+      <td>The number of failed limit scn requests to scan records with limit from this table per second.</td>


wuchong · 2024-12-21T07:35:13Z

fluss-client/src/main/java/com/alibaba/fluss/client/lookup/AbstractLookup.java

+        return key;
+    }
+
+    public abstract LookupType lookupType();


Class 'LookupType' is exposed outside its defined visibility scope

Make LookupType public.

wuchong · 2024-12-21T07:35:43Z

fluss-client/src/main/java/com/alibaba/fluss/client/lookup/AbstractLookupBatch.java

+
+/** An abstract lookup batch. */
+@Internal
+public abstract class AbstractLookupBatch {


This is only used by IndexLookupBatch, we don't need this abstraction.

wuchong · 2024-12-21T07:38:34Z

fluss-client/src/main/java/com/alibaba/fluss/client/lookup/AbstractLookup.java

+
+    public abstract LookupType lookupType();
+
+    public abstract CompletableFuture<List<byte[]>> future();


This is strange and inefficient that Lookup returns a List of result. We can introduce a generic type T to AbstractLookup and allows Lookup and IndexLookup to define their own return type.

public abstract class AbstractLookup<T> { ... public abstract CompletableFuture<T> future(); } public class Lookup extends AbstractLookup<byte[]> { ... } public class IndexLookup extends AbstractLookup<List<byte[]>> { ... }

wuchong · 2024-12-21T12:51:19Z

fluss-server/src/main/java/com/alibaba/fluss/server/replica/ReplicaManager.java

+        for (Map.Entry<TableBucket, List<byte[]>> entry : entriesPerBucket.entrySet()) {
+            TableBucket tb = entry.getKey();
+            PbIndexLookupRespForBucket respForBucket = new PbIndexLookupRespForBucket();
+            respForBucket.setBucketId(tb.getBucket());


set partition id as well .

wuchong · 2024-12-21T12:53:30Z

fluss-server/src/main/java/com/alibaba/fluss/server/replica/ReplicaManager.java

@@ -456,6 +460,50 @@ public void multiLookupValues(
        responseCallback.accept(lookupResultForBucketMap);
    }

+    /** Lookup by index keys on kv store. */
+    public void indexLookup(


indexLookup -> indexLookups

and rename multiLookupValues to lookups

wuchong · 2024-12-21T12:56:36Z

fluss-server/src/test/java/com/alibaba/fluss/server/replica/ReplicaManagerTest.java

+        TableBucket tb = new TableBucket(tableId, 0);
+        makeKvTableAsLeader(tb.getBucket());
+
+        // TODO


missing implementation

wuchong · 2024-12-21T13:06:28Z

fluss-server/src/test/java/com/alibaba/fluss/server/testutils/KvTestUtils.java

+
+    public static void assertIndexLookupResponse(
+            IndexLookupResponse indexLookupResponse, List<List<byte[]>> expectedValues) {
+        checkArgument(indexLookupResponse.getBucketsRespsCount() == 1);


do not use checkArgument for assertion. Use assertThat instead!

wuchong · 2024-12-21T13:07:14Z

fluss-server/src/test/java/com/alibaba/fluss/server/tablet/TabletServiceITCase.java

+                                                primaryKeyType,
+                                                rowType,
+                                                Arrays.asList(
+                                                        Tuple2.of(
+                                                                new Object[] {1, "a", 1L},
+                                                                new Object[] {
+                                                                    1, "a", 1L, "value1"
+                                                                }),
+                                                        Tuple2.of(
+                                                                new Object[] {1, "a", 2L},
+                                                                new Object[] {
+                                                                    1, "a", 2L, "value2"
+                                                                }),
+                                                        Tuple2.of(
+                                                                new Object[] {1, "a", 3L},
+                                                                new Object[] {
+                                                                    1, "a", 3L, "value3"
+                                                                }),
+                                                        Tuple2.of(
+                                                                new Object[] {2, "a", 4L},
+                                                                new Object[] {


code is not readable, reformat it.

swuferhong requested review from wuchong and luoyuxia December 18, 2024 13:52

wuchong linked an issue Dec 18, 2024 that may be closed by this pull request

[Feature] Fluss support index lookup for primary key table #65

Open

2 tasks

swuferhong force-pushed the index-lookup-1216 branch 2 times, most recently from 572a477 to b95540f Compare December 20, 2024 09:37

[kv] Support index lookup for primary key table

90f6295

swuferhong force-pushed the index-lookup-1216 branch from b95540f to 90f6295 Compare December 20, 2024 09:55

wuchong requested changes Dec 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[kv] Support index lookup for primary key table #222

[kv] Support index lookup for primary key table #222

swuferhong commented Dec 18, 2024

wuchong left a comment

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024

wuchong Dec 21, 2024


		public abstract LookupType lookupType();

		public abstract CompletableFuture<List<byte[]>> future();

[kv] Support index lookup for primary key table #222

Are you sure you want to change the base?

[kv] Support index lookup for primary key table #222

Conversation

swuferhong commented Dec 18, 2024

Purpose

Tests

API and Format

Documentation

wuchong left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment