Showing the tool we implemented using Langium: A DSL to describe datasets for ML #710
Replies: 2 comments 2 replies
-
There was post recently where someone had a similar issue which was resolved by merging the textmate grammar in a separate step (link). @msujew might have a more clever way though. |
Beta Was this translation helpful? Give feedback.
-
Hi @JoanGi, thanks for presenting your tool!
You can import another grammar file to access all its declaration. An example is available here: |
Beta Was this translation helpful? Give feedback.
-
Following the threads in this section, I think it will be a good point to show the work we have done using Langium. I hope this could be useful for anyone who is starting to use this great toolkit to create DSLs.
The tool we developed with Langium is called DescribeML, which implement a DSL to describe dataset for machine-learning. The prototype will be presented in the MODELS 2022 conference and you can take a look at the paper here and at a video presentation here.
If you like it, please, support us with a star in the repo ;)
The DSL goal:
The goal was to implement a declarative DSL of a set of concepts we detected as relevant to describe datasets intended to train machine learning (ML) models. For instance, an image dataset with few dark faces could add a racial bias in a face classifier, so better know this issues before using this data to train your ML model.
An example of a grammar instance:
In addition to the grammar, we have implemented:
Which troubles are we facing?
Which are our plans?
I hope this example of a declarative DSL will help anyone, and if you have a question or an answer to our troubles, please ask here, and we will learn together.
At last, thanks again to the Langium team for this incredible work 👍
Beta Was this translation helpful? Give feedback.
All reactions