Load balancer #425

Philip-NLnetLabs · 2024-10-28T13:22:16Z

Load balancer for client transports

bal-e

This is a really cool feature! My main concern is the complexity of the most important functions. I'd also like to see more explicit .expect() messages that explain underlying assumptions.

src/net/client/load_balancer.rs

bal-e · 2024-11-11T10:32:13Z

src/net/client/load_balancer.rs

+        // probe upstreams with a queue length of zero. If the queue length
+        // is non-zero then the upstream recently got work and does not need
+        // to be probed.
+        if conn_rt_len > 1 && random::<f64>() < PROBE_P {


Prefer to use rand::Rng::gen_bool()?

gen_bool is a method. Is it worth first selecting a random number generator to be able to invoke the method? Or is there a shortcut?

I think the shortest is

use rand::{thread_rng, Rng}; thread_rng().gen_bool(PROBE_P)

Is that better than the existing code?

That's for you and @bal-e to fight over :)

I don't mind either way. I just wanted to see what it would become.

gen_bool() does the same thing you're doing right now, @Philip-NLnetLabs, it's just more explicit. There's no performance change or anything.

If gen_bool would be a function, just like random I would agree. But now suddenly we introduce a lot more machinery. It doesn't actually improve the code. But now the reader has to know what thread_rng is and if that is the right choice.

If is even worse, the reader now has to read the description of gen_bool to figure out what is does and when it panics.

I want code to be as simple as possible. And in this case the change seems increase code complexity not decrease it.

I disagree with "the reader now has to read the description of gen_bool to figure out what is does and when it panics." gen_bool is a far more descriptive name ("generate boolean") than the current expression. You are implying that readers would understand random better than they would understand gen_bool. What about random's panics? Which random number generator does it use? What range of numbers does it produce? Those are all non-obvious questions that the use of random raises here.

src/net/client/load_balancer.rs

bal-e · 2024-11-11T10:50:17Z

src/net/client/load_balancer.rs

+                    let mut tmp_conn_rt = conn_rt.clone();
+
+                    // Remove entries that exceed the QPS limit. Loop
+                    // backward to efficiently remove them.
+                    for i in (0..tmp_conn_rt.len()).rev() {
+                        // Fill-in current queue length.
+                        tmp_conn_rt[i].queue_length = Arc::strong_count(
+                            &conn_stats[i].queue_length_plus_one,
+                        ) - 1;
+                        if let Some(max_burst) = conn_stats[i].max_burst {
+                            if conn_stats[i].burst_start.elapsed()
+                                > conn_stats[i].burst_interval
+                            {
+                                conn_stats[i].burst_start = Instant::now();
+                                conn_stats[i].burst = 0;
+                            }
+                            if conn_stats[i].burst > max_burst {
+                                tmp_conn_rt.swap_remove(i);
+                            }
+                        } else {
+                            // No limit.
+                        }
+                    }


Instead of cloning the whole Vec and then filtering out of it, why not use .filter().collect()?

That would require two loops: one to update queue_length, burst_start and burst, and another for the filter. Do you think it is worth it?

Updating those parameters can be done in forward order too, right? Then you can do them using .map() or .inspect() in the same iterator chain.

I'm confused. The current loop updates conn_stats. How would that work with map?

Well, it only updates conn_stats at corresponding indices. You can .zip(&mut conn_stats) or use .enumerate() to get the right index to use.

Here's the full code:

let tmp_conn_rt: Vec<_> = conn_rt .iter() .cloned() .zip(&mut conn_stats) .filter_map(|(mut conn, stats)| { // Fill-in current queue length. conn.queue_length = Arc::strong_count(&stats.queue_length_plus_one) - 1; if let Some(max_burst) = stats.max_burst { if stats.burst_start.elapsed() > stats.burst_interval { stats.burst_start = Instant::now(); stats.burst = 0; } if stats.burst > max_burst { // Filter out the connection. return None; } } Some(conn) }) .collect();

I don't see how this is more readable.

It's more efficient and more idiomatic. Readers don't need to consider how entries get filtered using swap_remove(), nor do they have to think about the backward iteration. Instead of indexing conn_stats, we just naturally get the right element from it to work with.

src/net/client/load_balancer.rs

Philip-NLnetLabs added 30 commits September 4, 2024 16:45

Prototype version of a query router.

749eded

Clippy

a62225c

Features for query-routing.

9772ae0

Remove MyService

af1ee53

Duplicate mock_instant

0e6d67d

QnameRouter instead of QueryRouter.

27689d2

SrService becomes SingleService

7243cde

Cargo fmt

65095d3

Changes for service-layering

67fe2f5

Adapt to changes in net::client.

c3fae7a

Adapts to changes in Service.

1e791a8

Changes for proxy.

0460739

Fmt

9cab8db

Docs

2b7e949

Fix example.

1e77887

Docs and cleanup

2731da8

Cleanup use.

21c9304

Document RequestNG

fb2f7ec

Remove some dead code.

2c779d4

Improvements by Ximon

c2bac03

WIP

58b3cb8

WIP

1375b27

wip

7ce5ff5

wip

398d954

wip

3b5d62a

wip

ab04564

wip

0a29c52

Use Arc for queue length.

aa677cf

Docs and examples

f696269

Docs

9d4ae49

Philip-NLnetLabs added 19 commits October 14, 2024 15:36

Merge branch 'main' into qname-router

25d796e

Avoid futures

26df1a7

Remove Unpin bounds

d71c766

Switch to ServiceError, include EDE to report error.

6420956

Remove unwraps

5094638

Remove unwraps

e371c28

Merge branch 'main' into qname-router

0b3bc88

Clippy

6df7ace

Merge branch 'qname-router' into load-balancer

794eeec

Small fix

304bf41

Add tracing and more dnsi query examples

d513f78

Some comments from @bal-e and @tertsdiepraam.

9568c0a

Small fix.

1729ff3

Clippy

b159484

Move boilerplate out of the way

20558da

Merge branch 'qname-router' into load-balancer

165339f

Merge branch 'main' into load-balancer

85506e9

Merge branch 'client-transport-improvements' into load-balancer

ca9e458

Remove debug output

e926198

Philip-NLnetLabs requested a review from a team October 28, 2024 13:22

Clippy

7efde2e

Philip-NLnetLabs added the enhancement New feature or request label Oct 28, 2024

Philip-NLnetLabs added 3 commits October 28, 2024 14:29

Fmt

331e833

Merge branch 'client-transport-improvements' into load-balancer

1580844

Merge branch 'client-transport-improvements' into load-balancer

d8420ba

bal-e requested changes Nov 11, 2024

View reviewed changes

Philip-NLnetLabs added 4 commits November 13, 2024 14:53

Review feedback.

ba1efb0

Clippy.

c772cb8

Allow value zero for max_burst.

5591175

Some more review changes

82fffc5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load balancer #425

Load balancer #425

Philip-NLnetLabs commented Oct 28, 2024

bal-e left a comment

bal-e Nov 11, 2024

Philip-NLnetLabs Nov 13, 2024 •

edited

Loading

tertsdiepraam Nov 14, 2024

Philip-NLnetLabs Nov 14, 2024

tertsdiepraam Nov 14, 2024

bal-e Nov 18, 2024

Philip-NLnetLabs Nov 18, 2024

bal-e Nov 19, 2024

bal-e Nov 11, 2024

Philip-NLnetLabs Nov 13, 2024

bal-e Nov 18, 2024

Philip-NLnetLabs Nov 18, 2024

bal-e Nov 19, 2024

bal-e Nov 19, 2024

Philip-NLnetLabs Nov 20, 2024

bal-e Nov 20, 2024

Load balancer #425

Are you sure you want to change the base?

Load balancer #425

Conversation

Philip-NLnetLabs commented Oct 28, 2024

bal-e left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Philip-NLnetLabs Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Philip-NLnetLabs Nov 13, 2024 •

edited

Loading