Feature/235 get flow partitionings #267

salamonpavel · 2024-09-10T12:23:45Z

Introduces GET endpoint for retrieval of partitionings of a given flow
Closes #235

Release notes:

Introduces GET endpoint for retrieval of partitionings of a given flow

salamonpavel · 2024-09-10T12:24:10Z

Release notes:

Introduces GET endpoint for retrieval of partitionings of a given flow

github-actions · 2024-09-10T12:26:35Z

JaCoCo model module code coverage report - scala 2.13.11

Overall Project	63.64%	🍏

There is no coverage information present for the Files changed

github-actions · 2024-09-10T12:26:36Z

JaCoCo agent module code coverage report - scala 2.13.11

Overall Project	84.63%	🍏

There is no coverage information present for the Files changed

github-actions · 2024-09-10T12:26:37Z

JaCoCo reader module code coverage report - scala 2.13.11

Overall Project	100%	🍏

There is no coverage information present for the Files changed

github-actions · 2024-09-10T12:26:38Z

JaCoCo server module code coverage report - scala 2.13.11

Overall Project	75.29% `-7.88%`	🍏
Files changed	62.57%	❌

File	Coverage
PartitioningRepositoryImpl.scala	100%	🍏
PartitioningServiceImpl.scala	100%	🍏
PartitioningControllerImpl.scala	89.62%	🍏
GetFlowPartitionings.scala	70.48%	❌

# Conflicts: # server/src/main/scala/za/co/absa/atum/server/api/repository/PartitioningRepositoryImpl.scala # server/src/test/scala/za/co/absa/atum/server/api/controller/PartitioningControllerUnitTests.scala # server/src/test/scala/za/co/absa/atum/server/api/repository/PartitioningRepositoryUnitTests.scala

benedeki · 2024-09-18T09:55:22Z

database/src/main/postgres/flows/V1.9.8__get_flow_partitionings.sql

+        WITH limited_partitionings AS (
+            SELECT P.id_partitioning
+            FROM flows.partitioning_to_flow PF
+            JOIN runs.partitionings P ON PF.fk_partitioning = P.id_partitioning
+            WHERE PF.fk_flow = i_flow_id
+            ORDER BY P.created_at DESC, P.id_partitioning
+            LIMIT i_limit OFFSET i_offset
+        )
+        SELECT
+            11 AS status,
+            'OK' AS status_text,
+            P.id_partitioning AS id,
+            P.partitioning,
+            P.created_by AS author,
+            _has_more AS has_more
+        FROM
+            runs.partitionings P
+        WHERE
+            P.id_partitioning IN (SELECT LP.id_partitioning FROM limited_partitionings LP)
+        ORDER BY
+            P.created_at DESC,
+            P.id_partitioning;


Suggested change

WITH limited_partitionings AS (

SELECT P.id_partitioning

FROM flows.partitioning_to_flow PF

JOIN runs.partitionings P ON PF.fk_partitioning = P.id_partitioning

WHERE PF.fk_flow = i_flow_id

ORDER BY P.created_at DESC, P.id_partitioning

LIMIT i_limit OFFSET i_offset

)

SELECT

11 AS status,

'OK' AS status_text,

P.id_partitioning AS id,

P.partitioning,

P.created_by AS author,

_has_more AS has_more

FROM

runs.partitionings P

WHERE

P.id_partitioning IN (SELECT LP.id_partitioning FROM limited_partitionings LP)

ORDER BY

P.created_at DESC,

P.id_partitioning;

SELECT

11 AS status,

'OK' AS status_text,

P.id_partitioning,

P.partitioning,

P.created_by,

has_more

FROM

runs.partitionings P INNER JOIN

flows.partitioning_to_flow PF ON PF.fk_partitioning = P.id_partitioning

WHERE

PF.fk_flow = i_flow

ORDER BY

P.id_partitioning,

P.created_at DESC

LIMIT i_limit OFFSET i_offset;

Several things:

You don't need the WITH here. Generally the order of approaches to use:

use JOIN

use subquery - there are cases where this can be better then JOIN, but we don't have that much data yet to run a meaningful comparison

use WITH

use temporary tables

in the RETURN QUERY statement it's not required the output column names to be the same as the OUT ones. Only the position matters.

I switched the order of ORDER BY columns, reasons described here

benedeki · 2024-09-18T09:58:30Z

database/src/main/postgres/flows/V1.9.8__get_flow_partitionings.sql

+            P.created_at DESC,
+            P.id_partitioning;
+
+    IF NOT FOUND THEN


Same comment as here. Do we return an error?

lsulak · 2024-09-18T13:34:12Z

database/src/main/postgres/flows/V1.9.8__get_flow_partitionings.sql

+    OUT id                     BIGINT,
+    OUT partitioning           JSONB,
+    OUT author                 TEXT,
+    OUT has_more               BOOLEAN


do we also want to care whether a given partitioning was primary for the given flow?

Little value now, and adds a table to the JOIN.
IMHO not worth it.
Btw, we could add the main flow id to partitioning table (for easier search and one less index).

lsulak

Review finished, apart from 1 q from me and 1 note from David it looks good

benedeki · 2024-09-20T10:14:55Z

database/src/main/postgres/flows/V1.9.8__get_flow_partitionings.sql

+    IF NOT FOUND THEN
+        status := 12;
+        status_text := 'OK with no partitionings found';
+        RETURN NEXT;
+    END IF;


Suggested change

IF NOT FOUND THEN

status := 12;

status_text := 'OK with no partitionings found';

RETURN NEXT;

END IF;

database/src/main/postgres/flows/V1.9.8__get_flow_partitionings.sql

…s.sql Co-authored-by: David Benedeki <14905969+benedeki@users.noreply.github.com>

# Conflicts: # server/src/main/scala/za/co/absa/atum/server/api/repository/PartitioningRepository.scala # server/src/main/scala/za/co/absa/atum/server/api/service/PartitioningServiceImpl.scala

salamonpavel added 3 commits September 10, 2024 12:22

endpoint

a1e822c

Merge branch 'master' into feature/235-get-flow-partitionings

bc18a03

get_flow_partitionings

649f969

salamonpavel self-assigned this Sep 10, 2024

salamonpavel added the work in progress Work on this item is not yet finished (mainly intended for PRs) label Sep 10, 2024

salamonpavel added 12 commits September 10, 2024 15:33

test unfinished

165c3dd

test sql

ad08738

test sql

3fb80b6

test sql

b6e4038

_add_to_parent_flows fix

dcdc93e

tmp

bb2cbf9

implementation without tests

44ab165

GetFlowPartitioningsIntegrationTests

6a2be40

repository tests

10ff0b6

service and controller tests

aa5c39f

GetFlowPartitioningsV2EndpointUnitTests

d2b8e30

input validation, sql pagination changed

01fd90a

salamonpavel marked this pull request as ready for review September 16, 2024 13:04

salamonpavel requested review from benedeki, lsulak, TebaleloS, Zejnilovic and dk1844 as code owners September 16, 2024 13:04

salamonpavel removed the work in progress Work on this item is not yet finished (mainly intended for PRs) label Sep 16, 2024

benedeki reviewed Sep 18, 2024

View reviewed changes

lsulak reviewed Sep 18, 2024

View reviewed changes

salamonpavel added 2 commits September 19, 2024 10:58

pr comments addressed

06b18de

Merge branch 'master' into feature/235-get-flow-partitionings

bdcc00e

benedeki reviewed Sep 20, 2024

View reviewed changes

no status when no results

baa526e

benedeki mentioned this pull request Sep 25, 2024

#262: Full Flyway integration #276

Merged

benedeki reviewed Sep 26, 2024

View reviewed changes

database/src/main/postgres/flows/V1.9.8__get_flow_partitionings.sql Outdated Show resolved Hide resolved

salamonpavel and others added 3 commits September 26, 2024 16:31

Update database/src/main/postgres/flows/V1.9.8__get_flow_partitioning…

819c05f

…s.sql Co-authored-by: David Benedeki <14905969+benedeki@users.noreply.github.com>

Merge branch 'master' into feature/235-get-flow-partitionings

f9f016c

# Conflicts: # server/src/main/scala/za/co/absa/atum/server/api/repository/PartitioningRepository.scala # server/src/main/scala/za/co/absa/atum/server/api/service/PartitioningServiceImpl.scala

conflicts resolved

56adaa1

benedeki approved these changes Sep 26, 2024

View reviewed changes

salamonpavel merged commit db4a867 into master Sep 26, 2024
10 checks passed

salamonpavel deleted the feature/235-get-flow-partitionings branch September 26, 2024 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/235 get flow partitionings #267

Feature/235 get flow partitionings #267

salamonpavel commented Sep 10, 2024 •

edited by miroslavpojer

Loading

salamonpavel commented Sep 10, 2024

github-actions bot commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

benedeki Sep 18, 2024

salamonpavel Sep 19, 2024

benedeki Sep 18, 2024

salamonpavel Sep 19, 2024

lsulak Sep 18, 2024

benedeki Sep 19, 2024

lsulak left a comment

benedeki Sep 20, 2024

Feature/235 get flow partitionings #267

Feature/235 get flow partitionings #267

Conversation

salamonpavel commented Sep 10, 2024 • edited by miroslavpojer Loading

salamonpavel commented Sep 10, 2024

github-actions bot commented Sep 10, 2024 • edited Loading

JaCoCo model module code coverage report - scala 2.13.11

github-actions bot commented Sep 10, 2024 • edited Loading

JaCoCo agent module code coverage report - scala 2.13.11

github-actions bot commented Sep 10, 2024 • edited Loading

JaCoCo reader module code coverage report - scala 2.13.11

github-actions bot commented Sep 10, 2024 • edited Loading

JaCoCo server module code coverage report - scala 2.13.11

benedeki Sep 18, 2024

Choose a reason for hiding this comment

salamonpavel Sep 19, 2024

Choose a reason for hiding this comment

benedeki Sep 18, 2024

Choose a reason for hiding this comment

salamonpavel Sep 19, 2024

Choose a reason for hiding this comment

lsulak Sep 18, 2024

Choose a reason for hiding this comment

benedeki Sep 19, 2024

Choose a reason for hiding this comment

lsulak left a comment

Choose a reason for hiding this comment

benedeki Sep 20, 2024

Choose a reason for hiding this comment

salamonpavel commented Sep 10, 2024 •

edited by miroslavpojer

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading