Switch from BiLSTM to the modern attention architecture #32

vmarkovtsev · 2019-07-25T16:00:07Z

Our current NN splitter is based on BiLSTM, which has problems with performance. We should leverage the recent advancements in deep learning and implement the new attention-based (seq2seq-like?) architecture of the model.

Stage 1 - research

Follow the paper, take the same dataset, and design the model. Calculate the metrics.

Stage 2 - production

Package the model, publish it on Modelforge.

vmarkovtsev · 2019-07-25T16:01:32Z

Assigning to you @zurk because you worked for solutions and missed interesting tasks.

Guillemdb · 2020-04-08T13:35:13Z

@vmarkovtsev I think it's time to close this issue 😉, for some reason I cannot do it myself.

vmarkovtsev · 2020-04-08T14:47:07Z

I'd rather leave these to indicate what was lacking in the project when we stopped. Thanks for pinging anyway!

vmarkovtsev assigned zurk Jul 25, 2019

Guillemdb unassigned zurk Apr 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch from BiLSTM to the modern attention architecture #32

Switch from BiLSTM to the modern attention architecture #32

vmarkovtsev commented Jul 25, 2019

vmarkovtsev commented Jul 25, 2019

Guillemdb commented Apr 8, 2020

vmarkovtsev commented Apr 8, 2020

Switch from BiLSTM to the modern attention architecture #32

Switch from BiLSTM to the modern attention architecture #32

Comments

vmarkovtsev commented Jul 25, 2019

Stage 1 - research

Stage 2 - production

vmarkovtsev commented Jul 25, 2019

Guillemdb commented Apr 8, 2020

vmarkovtsev commented Apr 8, 2020