[Draft] Proposal for SQLAlchemy Data Layer with ORM #1365

qtangs · 2024-09-22T04:10:53Z

This is a proposal for a different way to implement SQLAlchemy Data Layer using ORM instead of raw SQL statements, taking inspiration from #832
Hopefully it will simplify support for other SQL dialects besides Postgres and SQLite.

This is a rather major change:

Define tables and indices using SQLAlchemy objects
Use SQLAlchemy functions to create statements instead of raw SQL queries

Other changes:

in get_thread and list_threads. only get specific thread(s) for better performance and reduced bandwidth compared to get_all_user_threads (note: reduced bandwidth is important for providers such as Supabase)
in list_threads, filter would use thread name as well as steps input/output texts, this needs careful review.
in create_element, storage_provider is made optional so that elements that don't need content to be stored will still be added to database, example of these elements are external links that can be read and displayed at runtime.

The diff between current SQLAlchemy and this one can be found here: https://gist.github.com/qtangs/71370e6ce5feec78d9586c808804e81c/revisions?diff=split&w=

Some tests have been added based on https://github.com/Chainlit/chainlit/blob/main/backend/tests/data/test_sql_alchemy.py but more might be needed.

…ostgres and others

This reverts commit 0a11293.

dokterbob · 2024-09-23T09:50:24Z

@qtangs Woop woop! :D :D

One important aspect is that we don't break existing applications.

Do you think we can reasonably assure that? Or should we deprecate older installations?

I expect to review this in the 2nd half of this week. Want to take my time to do it well, as it's such a big one.

dokterbob · 2024-09-23T10:27:35Z

Might want to take this into account #1368

qtangs · 2024-09-25T10:44:34Z

Do you think we can reasonably assure that? Or should we deprecate older installations?

@dokterbob I think more testing is needed if we want to replace the old version in-place. I've added a new class to avoid this kind of major impact, anyone needing the new fix can switch to new class in their environment and test accordingly, if something breaks it's easy to switch back to old class.

dokterbob · 2024-09-25T11:06:14Z

Do you think we can reasonably assure that? Or should we deprecate older installations?

@dokterbob I think more testing is needed if we want to replace the old version in-place. I've added a new class to avoid this kind of major impact, anyone needing the new fix can switch to new class in their environment and test accordingly, if something breaks it's easy to switch back to old class.

We're not gonna support both though, and it seems there's a reasonable user base for the current implementation. I imagine we could deprecate the old version. Or we could use unit tests to somewhat assure consistency.

Curious how other community members/users of SQLite/SQLAlchemy reflect on this.

barrel-roll-42 · 2024-10-03T19:29:12Z

@dokterbob This would totally be great for my Databricks implementation of this.

daviddwlee84 · 2024-11-06T10:15:52Z

@dokterbob
Came from #832
I'm looking for this feature to preserve data locally.
SQLite would be one of the easiest ways to do this.

dokterbob · 2024-11-06T12:00:43Z

#1463 is the first step towards the cleanup of data layers. Once that's merged, @qtangs, could you please resolve conflicts on this one?

I want to either have this code merged to main or (preferably) move it directly to our community repo!

Kai Liu and others added 7 commits September 21, 2024 21:45

Add sqlite data layer and a simple sanity check

38eaee7

feat: add new SQLAlchemyORM data layer to support sqlite along with p…

42989dc

…ostgres and others

fix: remove SQLiteDataLayer, use SQLAlchemyORMDataLayer instead

b0e77f9

feat: add tests for SQLAlchemyORMDataLayer

550c6d5

feat: update test for data_layer_sqlite

4b44657

chore: update poetry.lock

b81f82c

fix: correct test db file name

d86654a

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Sep 22, 2024

qtangs added 3 commits September 23, 2024 06:54

feat: move aiosqlite to custom-data group

0a11293

Revert "feat: move aiosqlite to custom-data group"

013de55

This reverts commit 0a11293.

feat: remove aiosqlite from main dependencies

29ff4d2

dokterbob mentioned this pull request Sep 23, 2024

Support SQLite for custom data layer #832

Closed

dokterbob mentioned this pull request Sep 23, 2024

Handle session ID does not exist in the user_sessions dictionary #1364

Closed

dokterbob added the blocked Awaiting update or feedback from user after initial review/comments. label Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] Proposal for SQLAlchemy Data Layer with ORM #1365

[Draft] Proposal for SQLAlchemy Data Layer with ORM #1365

qtangs commented Sep 22, 2024

dokterbob commented Sep 23, 2024

dokterbob commented Sep 23, 2024

qtangs commented Sep 25, 2024

dokterbob commented Sep 25, 2024

barrel-roll-42 commented Oct 3, 2024 •

edited

Loading

daviddwlee84 commented Nov 6, 2024

dokterbob commented Nov 6, 2024

[Draft] Proposal for SQLAlchemy Data Layer with ORM #1365

Are you sure you want to change the base?

[Draft] Proposal for SQLAlchemy Data Layer with ORM #1365

Conversation

qtangs commented Sep 22, 2024

dokterbob commented Sep 23, 2024

dokterbob commented Sep 23, 2024

qtangs commented Sep 25, 2024

dokterbob commented Sep 25, 2024

barrel-roll-42 commented Oct 3, 2024 • edited Loading

daviddwlee84 commented Nov 6, 2024

dokterbob commented Nov 6, 2024

barrel-roll-42 commented Oct 3, 2024 •

edited

Loading