[Concurrent Low-Code] ConcurrentDeclarativeSource class that low-code connectors can inherit from to uptake Concurrent CDK #46662

brianjlai · 2024-10-09T04:10:27Z

Closes https://github.com/airbytehq/airbyte-internal-issues/issues/9710

What

Adds the new ConcurrentDeclarativeSource class which serves as the way we adapt the existing YamlDeclarativeSource used by all low-code connectors into being runnable within the Concurrent CDK framework. This PR combines all the other previous units of work so that low-code streams are translated into concurrent DefaultStream instances.

Another big aspect of this review is how we gate the streams that will run concurrently:

Only incremental non-substreams will run concurrently and we do so by inspecting for the DatetimeBasedCursor
We cannot migrate full refresh streams to concurrent because concurrent CDK does not support RFR so we would be regressing in behavior.
We cannot migrate substreams because concurrent CDK does not support substreams

How

The overall design is predicated on introducing a new class ConcurrentDeclarativeSource which behaves as a kind of adapter between the existing entrypoint.py that all syncs are triggered from and the ConcurrentSource which is responsible for running certain streams using the Concurrent CDK engine.

The ConcurrentDeclarativeSource inherits the ManifestDeclarativeSource so that we can reuse the logic to parse a manifest into low-code runtime components and allow the inspection of it's components to decide whether it can be run concurrently or synchronously.

The last big part of the code is this puts into place the logic to transform a low-code stream's DatetimeBasedCursor into a ConcurrentCursor. The reason why we need to do this is that the interfaces for a low-code cursor and concurrent cursor differ in a few specific ways and trying to make them both fit the same interface created a frankenstein class that proved to be even more unwieldly. In prior PRs, see 45413, it was determined that there was feature parity so now we perform the transformation and supply it to the concurrent engine to handle date window partitioning and state management.

Something else important to note is that there are some specific cases where an incremental stream cannot be run as a concurrent source. Since we introduced the language, we've allow stream_state to be a valid interpolation for various components. However, because partitions can be run in any order and complete at anytime, stream_state managed by the ConcurrentCursor is not a thread-safe value anymore (vs when it was managed sequentially). I inspected the schema and our repo for it's usage. For streams using stream_state in an unsafe way, we make it a synchronous stream, but we should fix those connectors to use the thread safe stream_interval and ultimately get rid of the extra code later.

Short term how we enable this

I've included two examples of how connectors can uptake concurrent processing. They are the same and will be deleted before merging.

The two things that need to be changed are:

run.py - Our previous design for connectors did not take any arguments passed to the connector from the platform. This is a significant limitation because the concurrent framework is entirely based around instantiating things like cursors up front before performing a read. I haven't found a great way to avoid changing this as this is a limitation of the Concurrent CDK
source.py - Once run.py is updated to pass in the various operation arguments like state, config, and catalog, we need to pass them to the ConcurrentDeclarativeSource constructor

Review guide

concurrent_declarative_source.py
datetime_based_cursor.py
adapter.py
datetime_stream_state_converter.py
yaml_declarative_source.py
See either source-sentry or source-amplitude

User Impact

This is considered a breaking CDK change because connectors will need to follow the included migration guide to update a connectors run.py and source.py files

Can this PR be safely reverted and rolled back?

YES 💚
NO ❌

Yes, because this isn't release yet. However, this does pose a risk once we move a connector to concurrent because once we start emitting the new state format, then it is much harder to go backward since the connector cannot process concurrent state. See my comment in the code for more

…n the concurrent framework

… and correctly merging intervals

…fix bugs found during functional testing

vercel · 2024-10-09T04:10:35Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
airbyte-docs	⬜️ Ignored (Inspect)	Visit Preview		Oct 26, 2024 10:21pm

brianjlai · 2024-10-09T04:15:27Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+
+                    connector_state_converter = CustomOutputFormatConcurrentStreamStateConverter(
+                        datetime_format=declarative_cursor_attributes.datetime_format,
+                        is_sequential_state=False,


By disabling is_sequential_state, we will automatically move connectors from their sequential state to per-partition concurrent state. This is the ideal end goal we want, however, if we were to revert the connector to a previous version that doesn't use the concurrent CDK, the state message would not be compatible.

It feels like to de-risk this a little bit on the early release, we should set this to true so we accept sequential state (i.e. {"updated_at": "2024-12-12"} and also emit it back to the platform as such

do you think we could add a mechanism to handle this situation by using the lowest value to set the sequential state if we revert the changes?

brianjlai · 2024-10-09T04:16:24Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/declarative_source.py

@@ -32,3 +33,6 @@ def check_connection(self, logger: logging.Logger, config: Mapping[str, Any]) ->
          The error object will be cast to string to display the problem to the user.
        """
        return self.connection_checker.check_connection(self, logger, config)
+
+    def all_streams(self, config: Mapping[str, Any]) -> List[Stream]:
+        return self.streams(config=config)


For non concurrent low-code sources, these are equivalent, but we override this implementation in ConcurrentDeclarativeSource to create a single list of both concurrent and synchronous sources so that we properly generate catalogs and other things

codeflash-ai · 2024-10-09T05:54:53Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+                        cursor_field=[declarative_cursor_attributes.cursor_field]
+                        if declarative_cursor_attributes.cursor_field is not None
+                        else None,


Suggested change

cursor_field=[declarative_cursor_attributes.cursor_field]

if declarative_cursor_attributes.cursor_field is not None

else None,

cursor_field=(

[declarative_cursor_attributes.cursor_field] if declarative_cursor_attributes.cursor_field is not None else None

),

codeflash-ai · 2024-10-09T05:54:54Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+        catalog: ConfiguredAirbyteCatalog,
+        concurrent_stream_names: set[str],
+    ) -> ConfiguredAirbyteCatalog:
+        return ConfiguredAirbyteCatalog(streams=[stream for stream in catalog.streams if stream.stream.name not in concurrent_stream_names])


Suggested change

return ConfiguredAirbyteCatalog(streams=[stream for stream in catalog.streams if stream.stream.name not in concurrent_stream_names])

catalog.streams = [stream for stream in catalog.streams if stream.stream.name not in concurrent_stream_names]

return catalog

codeflash-ai · 2024-10-09T05:54:56Z

⚡️ Codeflash found optimizations for this PR

📄 `ConcurrentDeclarativeSource._remove_concurrent_streams_from_catalog()` in `airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py`

📈 Performance improved by 29% (0.29x faster)

⏱️ Runtime went down from 5.43 microseconds to 4.20 microseconds

Explanation and details

To optimize the provided Python code for better performance, we will focus on streamlining operations, reducing overhead, and minimizing redundant calculations. However, given the complexity and dependencies involved in the original code, changes will be limited to areas that have clear potential for performance improvements without altering the function signatures and overall behavior. Below is the optimized version.

Optimizations.

Avoid Redundant Checks: Consolidated the initialization of concurrency_level and initial_number_of_partitions_to_generate into a single block.
max for Safe Calculation: Ensured that initial_number_of_partitions_to_generate is at least 1 using max function.
Inline Stream Update: Updated catalog.streams in place within _remove_concurrent_streams_from_catalog method to reduce memory overhead.

These changes streamline the code execution flow and reduce unnecessary computations, potentially leading to better runtime performance while maintaining the original functionality.

Correctness verification

The new optimized code was tested for correctness. The results are listed below.

🔘 (none found) − ⚙️ Existing Unit Tests

✅ 2 Passed − 🌀 Generated Regression Tests

(click to show generated tests)

# imports
from typing import Any, List, Mapping, Optional

import pytest  # used for our unit tests
from airbyte_cdk.models import (ConfiguredAirbyteCatalog,
                                ConfiguredAirbyteStream)
from airbyte_cdk.sources.concurrent_source.concurrent_source import \
    ConcurrentSource
from airbyte_cdk.sources.declarative.concurrency_level import ConcurrencyLevel
from airbyte_cdk.sources.declarative.concurrent_declarative_source import \
    ConcurrentDeclarativeSource
from airbyte_cdk.sources.declarative.manifest_declarative_source import \
    ManifestDeclarativeSource
# function to test
from airbyte_cdk.sources.declarative.models.declarative_component_schema import \
    ConcurrencyLevel as ConcurrencyLevelModel
from airbyte_cdk.sources.declarative.parsers.model_to_component_factory import \
    ModelToComponentFactory
from airbyte_cdk.sources.declarative.types import ConnectionDefinition
from airbyte_cdk.sources.source import TState

# unit tests

def create_catalog(stream_names: List[str]) -> ConfiguredAirbyteCatalog:
    """Helper function to create a ConfiguredAirbyteCatalog from a list of stream names."""
    streams = [ConfiguredAirbyteStream(stream={"name": name}) for name in stream_names]
    return ConfiguredAirbyteCatalog(streams=streams)
    # Outputs were verified to be equal to the original implementation








def test_empty_catalog():
    catalog = create_catalog([])
    concurrent_stream_names = set()
    codeflash_output = ConcurrentDeclarativeSource._remove_concurrent_streams_from_catalog(catalog, concurrent_stream_names)
    # Outputs were verified to be equal to the original implementation

def test_empty_catalog_with_non_empty_concurrent_stream_names():
    catalog = create_catalog([])
    concurrent_stream_names = {"stream1"}
    codeflash_output = ConcurrentDeclarativeSource._remove_concurrent_streams_from_catalog(catalog, concurrent_stream_names)
    # Outputs were verified to be equal to the original implementation

🔘 (none found) − ⏪ Replay Tests

natikgadzhi

Got distracted, just posting this so comments don't get lost, will read thoroughly later

natikgadzhi · 2024-10-09T18:34:48Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/checks/check_stream.py

@@ -28,7 +28,7 @@ def __post_init__(self, parameters: Mapping[str, Any]) -> None:
        self._parameters = parameters

    def check_connection(self, source: Source, logger: logging.Logger, config: Mapping[str, Any]) -> Tuple[bool, Any]:
-        streams = source.streams(config)  # type: ignore # source is always a DeclarativeSource, but this parameter type adheres to the outer interface
+        streams = source.all_streams(config)  # type: ignore # source is always a DeclarativeSource, but this parameter type adheres to the outer interface


Teach me your ways, what is the difference betweeen streams and all_streams? (ignore if it's in this PR, reading through it)

good question, I mention it in a comment: https://github.com/airbytehq/airbyte/pull/46662/files#r1792816384

But the reason why we can't just rewrite the streams() method is because within the existing Python CDK core.py, when processing synchronous streams, we invoke the streams() method and in that context we don't want to return the concurrent streams that aren't compatible in that are of code.

As we discussed earlier, it’s preferable to use the stream method and condition the behavior accordingly. This approach adds some complexity, but it provides a tradeoff by allowing simpler modifications later. With this setup, when the core will be able to handle concurrent streams, we’ll get a stream generation interface for free.

yep this is addressed in my latest commit using the optional param include_concurrent_streams

natikgadzhi · 2024-10-09T18:43:24Z

airbyte-cdk/python/airbyte_cdk/sources/streams/concurrent/availability_strategy.py

@@ -67,6 +67,7 @@ def check_availability(self, logger: logging.Logger) -> StreamAvailability:
        """


+@deprecated("This class is experimental. Use at your own risk.", category=ExperimentalClassWarning)


@brianjlai probably better to call out that it should not be used at all, if we're ripping out availability strategies over mid-long term?

ah yeah i think i carried that over from a merge of serhii's work which deprecated availability strategy in concurrent. I'll update this to say do not use

lazebnyi

Overall, the implementation looks great—nice work! However, we still need to make a few updates. After that I can approve.

lazebnyi · 2024-10-14T16:35:46Z

airbyte-cdk/python/airbyte_cdk/sources/streams/concurrent/adapters.py

+
+        # This needs to be revisited as we can't lose precision
+        if isinstance(obj, datetime):
+            return list(obj.timetuple())[0:6]


Should we set 6 as a variable to avoid using magic numbers?

yes thank you for calling this out. I had put this in as a placeholder as I was working through getting this tested the first time around and this needs to be reinvestigated/fixed

lazebnyi · 2024-10-15T21:01:14Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/checks/check_stream.py

@@ -28,7 +28,7 @@ def __post_init__(self, parameters: Mapping[str, Any]) -> None:
        self._parameters = parameters

    def check_connection(self, source: Source, logger: logging.Logger, config: Mapping[str, Any]) -> Tuple[bool, Any]:
-        streams = source.streams(config)  # type: ignore # source is always a DeclarativeSource, but this parameter type adheres to the outer interface
+        streams = source.all_streams(config)  # type: ignore # source is always a DeclarativeSource, but this parameter type adheres to the outer interface


As we discussed earlier, it’s preferable to use the stream method and condition the behavior accordingly. This approach adds some complexity, but it provides a tradeoff by allowing simpler modifications later. With this setup, when the core will be able to handle concurrent streams, we’ll get a stream generation interface for free.

lazebnyi · 2024-10-15T21:04:08Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+            concurrency_level = concurrency_level_component.get_concurrency_level()
+            initial_number_of_partitions_to_generate = concurrency_level // 2
+        else:
+            concurrency_level = 1


I think it would be better to move the value from the code into a class variable may be to improve readability

lazebnyi · 2024-10-15T21:10:29Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+    def all_streams(self, config: Mapping[str, Any]) -> List[Stream]:
+        return self._synchronous_streams + self._concurrent_streams  # type: ignore  # Although AbstractStream doesn't inherit stream, they were designed to fit the same interface when called from streams()
+
+    def _separate_streams(self, config: Mapping[str, Any]) -> Tuple[List[AbstractStream], List[Stream]]:


Do you think the name _group_streams would be more informative?

i'm fine with _group_streams

lazebnyi · 2024-10-15T21:13:45Z

airbyte-cdk/python/airbyte_cdk/sources/streams/concurrent/adapters.py

-        for slice_start, slice_end in self._cursor.generate_slices():
-            stream_slice = StreamSlice(partition={}, cursor_slice={"start": slice_start, "end": slice_end})
+
+        start_boundary = self._slice_boundary_fields[self._START_BOUNDARY] if self._slice_boundary_fields else "start"


For current implementation of datetime cursor self._slice_boundary_fields never has none value

lazebnyi · 2024-10-15T21:17:16Z

airbyte-integrations/connectors/source-amplitude/source_amplitude/run.py

 from source_amplitude import SourceAmplitude


+def _get_source(args: List[str]):


I think we need to mention in PR description clearly that this is will be a breaking change. And add info about this to cdk-migration file.

yep that is the plan, I wrote up a migration guide that will be included in the next commit I push explaining what needs to change in run.py and source.py.

pnilan

A couple minor comments. I'll defer to Serhii for approval.

Also can you run regression tests w/ one/two of the test connectors?

pnilan · 2024-10-15T19:06:01Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+
+        state_manager = ConnectorStateManager(state=self._state)  # type: ignore  # state is always in the form of List[AirbyteStateMessage]. The ConnectorStateManager should use generics, but this can be done later
+
+        self.logger.info(f"what is config: {config}")


Is this a personal debugging log?

it is. i'm removing this thank you

pnilan · 2024-10-15T20:12:29Z

airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

+                declarative_cursor.get_partition_field_end().eval(config=config),
+            )
+
+            interpolated_state_date = declarative_cursor.get_start_datetime()


nit: interpolated_state_date typo?

pnilan · 2024-10-15T21:03:36Z

airbyte-cdk/python/airbyte_cdk/sources/streams/concurrent/adapters.py

+        end_boundary = self._slice_boundary_fields[self._END_BOUNDARY] if self._slice_boundary_fields else "end"
+
+        wam = list(self._cursor.generate_slices())
+        for slice_start, slice_end in wam:


nit: It seems like it would be more memory efficient to directly iterate over the generated slices, is there a specific reason for saving to a list?

correct. I had originally added this for debugging to see the entire set of slices easier, but you are correct this should be iterable

pnilan · 2024-10-15T21:37:25Z

airbyte-cdk/python/airbyte_cdk/sources/streams/concurrent/adapters.py

+
+        # This needs to be revisited as we can't lose precision
+        if isinstance(obj, datetime):
+            return list(obj.timetuple())[0:6]


I don't follow this -- Is there somewhere we would be serializing a datetime object?

i'll double check, but I think we needed this serializer deeper in our code, potentially in how we emit state back out. I'll reconfirm this as I work through @lazebnyi comment above.

I'm also curious about this. From my understanding, this would mean that CursorPartitionGenerator would create slices with datetime within them but this is not what I see.

@lazebnyi @pnilan @maxi297 to close the loop on this one. I think what originally happened was i wrote this in when i was first testing because we were getting the datetime object from the ConcurrentCursor and it would fail trying to serialize it.

However, later after cleaning up the code and fixing edge cases, I addressed the serialization by converting the datetime into the correct output string format with the correct precision in the generate() function here in https://github.com/airbytehq/airbyte/pull/46662/files#diff-93127bface0b323fe43b21cdb8fb14493dd465995b085a4f81647f3697930bddR396-R399 . And since this was now already a string we don't need to convert it again

I'll get rid of this code as it's not actually used anymore and we're applying the correct precision based on the cursor definition.

…back

…pports second precision

…ent cursor transformation

coderabbitai · 2024-10-23T20:38:12Z

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

brianjlai added 10 commits September 22, 2024 22:46

initial work to create the ConcurrentDeclarativeSource that can run i…

1130a74

…n the concurrent framework

Merge branch 'master' into brian/concurrent_declarative_source

54a1f85

adding more tests and fixing bugs for only syncing streams in catalog…

0bd2deb

… and correctly merging intervals

Merge branch 'master' into brian/concurrent_declarative_source

2ad65d9

fix a few more merge conflict errors

8861054

Fix tests and add cursor granularity to the cursor partition generator

aa55ab7

integrate YamlDeclarativeSource with ConcurrentDeclarativeSource and …

4288b8d

…fix bugs found during functional testing

Merge branch 'master' into brian/concurrent_declarative_source

1650e30

rebase, formatting, fix tests, add new test cases for concurrency level

98c42a7

forgot to remove change to test

0f79069

octavia-squidington-iii added area/connectors Connector related issues CDK Connector Development Kit connectors/source/sentry connectors/source/amplitude labels Oct 9, 2024

brianjlai commented Oct 9, 2024

View reviewed changes

brianjlai requested review from natikgadzhi, lazebnyi and a team October 9, 2024 04:17

codeflash-ai bot reviewed Oct 9, 2024

View reviewed changes

natikgadzhi reviewed Oct 9, 2024

View reviewed changes

brianjlai added 2 commits October 11, 2024 18:03

fix mypy errors and a few others bugs and testing

5992e19

Merge branch 'master' into brian/concurrent_declarative_source

6a160c0

octavia-squidington-iii added the connectors/source/jira label Oct 12, 2024

vercel bot deployed to Preview October 12, 2024 01:09 View deployment

pnilan self-requested a review October 15, 2024 14:54

lazebnyi requested changes Oct 15, 2024

View reviewed changes

pnilan reviewed Oct 15, 2024

View reviewed changes

maxi297 mentioned this pull request Oct 18, 2024

Maxi297/fix streams interface #46995

Merged

2 tasks

brianjlai added 11 commits October 18, 2024 17:25

whatever

9b5eb62

parse DatetimeBasedCursorModel to ConcurrentCursor, bugfixes, pr feed…

b82c21c

…back

formatting + mypy

6a91848

fix mypy by replacing empty tuple() with None to make it truly optional

15127a7

remove local cdk from sentry

977c525

update lockfile

4e38c4e

swapped updating lockfiles

f7f3e9d

reducing granularity of events stream cursor partitions which only su…

7ab1bf1

…pports second precision

Merge branch 'master' into brian/concurrent_declarative_source

69aa511

add more complete testing of the datetimebasedcursor model to concurr…

626e78d

…ent cursor transformation

add concurrency to chargebee and fix usage of epoch timestamps

e5efec4

octavia-squidington-iii added the connectors/source/chargebee label Oct 23, 2024

brianjlai added 2 commits October 23, 2024 12:50

extra space

b571301

Merge branch 'master' into brian/concurrent_declarative_source

c131645

vercel bot deployed to Preview October 23, 2024 19:56 View deployment

brianjlai added 3 commits October 23, 2024 12:56

fix mypy errors

10431bc

formatting

3fa228f

fix lockfile and remove jira

96cf12b

brianjlai and others added 10 commits October 23, 2024 19:10

fix spec command

2ca0f74

Maxi297/fix streams interface (#46995)

55b53df

add is_resumable to concurrent DefaultStream

13b9ccb

remove lockfile changes to individual connectors

d13a1e6

update sentry lockfile

c9e1441

fix tests and increase retry attempts

32941cc

reduce chargebee concurrency to 1 for testing

651be92

test keeping the cursor on the declarative stream

f21b209

remove cursor on the declarative stream

57127bc

remove chargebee concurrency for testing

5054469

	return ConfiguredAirbyteCatalog(streams=[stream for stream in catalog.streams if stream.stream.name not in concurrent_stream_names])
	catalog.streams = [stream for stream in catalog.streams if stream.stream.name not in concurrent_stream_names]
	return catalog

		@@ -67,6 +67,7 @@ def check_availability(self, logger: logging.Logger) -> StreamAvailability:
		"""


		@deprecated("This class is experimental. Use at your own risk.", category=ExperimentalClassWarning)

		from source_amplitude import SourceAmplitude


		def _get_source(args: List[str]):


		state_manager = ConnectorStateManager(state=self._state) # type: ignore # state is always in the form of List[AirbyteStateMessage]. The ConnectorStateManager should use generics, but this can be done later

		self.logger.info(f"what is config: {config}")

[Concurrent Low-Code] ConcurrentDeclarativeSource class that low-code connectors can inherit from to uptake Concurrent CDK #46662

Are you sure you want to change the base?

[Concurrent Low-Code] ConcurrentDeclarativeSource class that low-code connectors can inherit from to uptake Concurrent CDK #46662

Conversation

brianjlai commented Oct 9, 2024 • edited Loading

What

How

Short term how we enable this

Review guide

User Impact

Can this PR be safely reverted and rolled back?

vercel bot commented Oct 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codeflash-ai bot Oct 9, 2024

Choose a reason for hiding this comment

codeflash-ai bot Oct 9, 2024

Choose a reason for hiding this comment

codeflash-ai bot commented Oct 9, 2024

⚡️ Codeflash found optimizations for this PR

📄 ConcurrentDeclarativeSource._remove_concurrent_streams_from_catalog() in airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py

Explanation and details

Optimizations.

Correctness verification

🔘 (none found) − ⚙️ Existing Unit Tests

✅ 2 Passed − 🌀 Generated Regression Tests

🔘 (none found) − ⏪ Replay Tests

natikgadzhi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lazebnyi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pnilan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brianjlai Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

coderabbitai bot commented Oct 23, 2024

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

brianjlai commented Oct 9, 2024 •

edited

Loading

vercel bot commented Oct 9, 2024 •

edited

Loading

📄 `ConcurrentDeclarativeSource._remove_concurrent_streams_from_catalog()` in `airbyte-cdk/python/airbyte_cdk/sources/declarative/concurrent_declarative_source.py`

brianjlai Oct 16, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)