Skip to content

Latest commit

 

History

History
403 lines (359 loc) · 9.83 KB

sequence_based_models.md

File metadata and controls

403 lines (359 loc) · 9.83 KB

title: On the naturalness of software year: 2012 venue: None task: Code Generation model: N-gram dataset: pdf: https://dl.acm.org/doi/pdf/10.1145/2902362 code:

title: On the localness of software year: 2014 venue: FSE/ESEC task: Code Generation model: N-gram dataset: pdf: https://dl.acm.org/doi/pdf/10.1145/2635868.2635875 code:

title: Phrase-Based Statistical Translation of Programming Languages year: 2014 venue:OOPSLA task: Code Generation model: N-gram dataset: pdf:https://files.sri.inf.ethz.ch/website/papers/onward14.pdf
code:

title: A convolutional attention network for extreme summarization of source code year: 2016 venue: ICML task: Code Summarization model: CAN dataset: Java pdf: http://proceedings.mlr.press/v48/allamanis16.html code: https://github.com/mast-group/convolutional-attention

title: Code completion with statistical language models year: 2014 venue:PLDI task: Code Generation model: RNN dataset: pdf:https://dl.acm.org/doi/pdf/10.1145/2594291.2594321
code:

title: Neural Code Comprehension: A Learnable Representation of Code Semantics year: 2018 venue:NuerIPs task: Code representation model: RNN dataset: pdf: https://proceedings.neurips.cc/paper/2018/hash/17c3433fecc21b57000debdf7ad5c930-Abstract.html code:

title: A deep language model for software code year: 2016 venue: None task: Code Generation model: LSTM dataset: pdf: https://arxiv.org/pdf/1608.02715
code:

title: Summarizing Source Code using a Neural Attention Model year: 2016 venue: ACL task: Code Summarization model: LSTM dataset: C# pdf: https://aclanthology.org/P16-1195.pdf code: https://github.com/sriniiyer/codenn

title: Latent Attention For If-Then Program Synthesis year: 2016 venue:NuerIPs task: Code Generation model: Bi-LSTM dataset: pdf: https://proceedings.neurips.cc/paper/2016/file/716e1b8c6cd17b771da77391355749f3-Paper.pdf
code:

title: Abstract Syntax Networks for Code Generation and Semantic Parsing year: 2016 venue:ACL task: Code Generation model: LSTM dataset: pdf: https://arxiv.org/pdf/1704.07535 code:

title: CodeGRU: Context-aware deep learning with gated recurrent unit for source code modeling year: 2020 venue: IST task: Code Generation model: GRU dataset: pdf: https://www.sciencedirect.com/science/article/pii/S0950584920300616?casa_token=mKr3XC1pMD4AAAAA:AiVTPP7wnxInR_g-PFI5Y_XXlk-KpFlnK8DtKoNULlLamBJlMNfDgtplzgYSgiYyCx0qstFjbZE code:

title: A transformer-based approach for source code summarization year: 2020 venue: ACL task: Code Summarization model: Transformer dataset: pdf: https://arxiv.org/abs/2005.00653 code:https://github.com/wasiahmad/NeuralCodeSum

title: CodeBERT: A Pre-Trained Model for Programming and Natural Languages year: 2020 venue:EMNLP task: Pretrain model: Transformer dataset: pdf: https://arxiv.org/pdf/2002.08155.pdf
code: https://github.com/microsoft/CodeBERT

title: Learning and Evaluating Contextual Embedding of Source Code year: 2020 venue:ICML task: Pretrain model: Transformer dataset: pdf: https://proceedings.mlr.press/v119/kanade20a.html
code:

title: Learning and Evaluating Contextual Embedding of Source Code year: 2020 venue:FSE/ESEC task: Pretrain model: Transformer dataset: pdf: https://dl.acm.org/doi/pdf/10.1145/3368089.3417058 code:

title: CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation year: 2021 venue:EMNLP task: Pretrain model: Transformer dataset: pdf: https://arxiv.org/pdf/2109.00859 code:

title: A general path-based representation for predicting programproperties year: 2018 venue: PLDL task: Code Generation model: word2vec,CRF dataset: JavaScript, Java, Python, C# pdf: https://dl.acm.org/doi/pdf/10.1145/3296979.3192412 code:

title: Exploring API embedding for API usages and applications year: 2017 venue: ICSE task: Code Generation model: word2vec dataset: Java, C# pdf: https://ieeexplore.ieee.org/abstract/document/7985683 code:

title: Automatically learning semantic features for defect prediction year: 2016 venue: ICSE task: Safety Analysis model: DBN dataset: pdf: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7886912 code:

title: Deep Semantic Feature Learning for Software Defect Prediction year: 2020 venue: TSE task: Safety Analysis model: DBN dataset: pdf: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8502853 code:

title: Neural Code Completion year: 2018 venue: ICPC task: Code Generation model: LSTM dataset: JS150,PY150 pdf: https://openreview.net/pdf?id=rJbPBt9lg code:

title: Code Completion with Neural Attention and Pointer Networks year: 2018 venue: IJCAI task: Code Generation model: LSTM,pointer network dataset: JS150,PY150 pdf: https://ieeexplore.ieee.org/abstract/document/7985683 code:https://github.com/jack57lee/neuralCodeCompletion

title: Deep code comment generation year: 2018 venue: ICPC task: Code Summarization model: LSTM dataset: pdf: https://ieeexplore.ieee.org/abstract/document/8973050 code:https://github.com/LRNavin/AutoComments

title: Code2vec: learning distributed representations of code year: 2019 venue: POPL task: Code Generation model: LSTM dataset: 10072 Java GitHub repositories pdf: https://arxiv.org/pdf/1803.09473 code: https://github.com/tech-srl/code2vec

title: Seml: A semantic lstm model for software defect prediction year: 2019 venue: None task: Safety Analysis model: LSTM dataset: pdf: https://ieeexplore.ieee.org/abstract/document/8747001 code:

title: Modeling programs hierarchically with stack-augmented LSTM year: 2020 venue: JSS task: Code Generation model: LSTM dataset: C, python pdf: https://www.sciencedirect.com/science/article/pii/S0164121220300297?casa_token=B2mvgbpiwFUAAAAA:kpOAhKMiSEnvJPN0as8qH-_8EMDK-pF5bu_e8TT6_4c6Kae5gMhvi-00_nzSC3Y4VHNzoAFzqQ code:

title: Code2seq: Generating Sequences from Structured Representations of Code year: 2019 venue: ICLR task: Code Generation model: Bi-LSTM dataset: Java, C#(dataset of CodeNN) pdf: https://arxiv.org/pdf/1808.01400 code: https://github.com/tech-srl/code2seq

title: DeepCPDP: Deep Learning Based Cross-Project Defect Prediction year: 2019 venue: task: Safety Analysis model: Bi-LSTM dataset: pdf: https://ieeexplore.ieee.org/abstract/document/8937501/ code:

title: Pythia: AI-assisted Code Completion System year: 2019 venue: SIGKDD task: Code Generation model: Bi-LSTM dataset: Python pdf: https://dl.acm.org/doi/pdf/10.1145/3292500.3330699 code: https://github.com/Microsoft/PTVS

title: A neural model for generating natural language summaries of program subroutines(astted-gru) year: 2019 venue: ICSE task: Code Summarization model: GRU dataset: pdf: https://arxiv.org/pdf/1902.01954v1.pdf code: https://github.com/mcmillco/funcom

title: Deep code comment generation with hybrid lexical and syntactical information year: 2020 venue: FSE/EFEC task: Code Summarization model: GRU dataset: 9714 Java projects from GitHub pdf: https://link.springer.com/article/10.1007/s10664-019-09730-9 code: https://github.com/Rick-Feng-u/Deep-code-comment-generation

title:TreeBERT: A Tree-Based Pre-Trained Model for Programming Language year:2021 venue:UAI task: Pretrain model: TreeBERT dataset: pdf: https://arxiv.org/abs/2105.12485 code: https://github.com/17385/TreeBERT

title: Structural language models of code year: 2020 venue: ICML task: Code Generation model: Transformer dataset: pdf: https://proceedings.mlr.press/v119/alon20a.html code:

title: Code prediction by Feeding Trees to Transfomers year: 2021 venue: ICSE task: Code Generation model: Transformer dataset: pdf: https://dl.acm.org/doi/pdf/10.1145/3387904.3389261

title: A self-attentional neural architecture for code completion with multi-task learning year: 2020 venue: ICPC task: Code Generation model: Transformer dataset: pdf: https://ieeexplore.ieee.org/abstract/document/9402114 code:

title: Retrieval-based Neural Source Code Summarization year: 2020 venue: ICSE task: Code Summarization model: Others dataset: pdf: https://ieeexplore.ieee.org/abstract/document/9284039 code:

title: Retrieval on Source Code: A Neural Code Search year: 2018 venue: PLDI task: Code Search model: word embedding dataset: pdf: https://ieeexplore.ieee.org/abstract/document/9284039 code:

title: Deep code search year: 2018 venue: ICSE task: Code Search model: RNN dataset: pdf: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8453172 code:

title: Improving Code Search with Co-Attentive Representation Learning year: 2020 venue: ICPC task: Code Search model: RNN dataset: pdf: https://dl.acm.org/doi/pdf/10.1145/3387904.3389269 code:

title: Cclearner: A deep learning-based clone detection approach year: 2017 venue: ICSME task: Clone Detection model: DNN dataset: pdf: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8094426 code:

title: Deep learning code fragments for code clone detection year: 2017 venue: ASE task: Clone Detection model: RNN dataset: pdf: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7582748&tag=1 code:

title: Neural Program Repair by Jointly Learning to Localize and Repair year: 2019 venue: ICLR task: Program Repair model: LSTM dataset: DeepFix pdf: https://arxiv.org/pdf/1904.01720 code:https://github.com/mdrafiqulrabin/SIVAND

title: TFix: Learning to Fix Coding Errors with a Text-to-Text Transformer year: 2021 venue: ICML task: Program Repair model: Transformer dataset: TFix's Code Patches Data pdf: https://files.sri.inf.ethz.ch/website/papers/icml21-tfix.pdf code:https://github.com/eth-sri/TFix

title: Embedding Java Classes with code2vec: Improvements from Variable Obfuscation year: 2020 venue: task: Program Classification model: LSTM dataset: pdf: https://files.sri.inf.ethz.ch/website/papers/icml21-tfix.pdf code:https://github.com/eth-sri/TFix

title: SCC: Automatic Classification of Code Snippets year: 2018 venue: task: Program Classification model: Multinomial Naive Bayes (MNB) dataset: pdf: https://arxiv.org/pdf/1809.07945v1.pdf code:https://github.com/mindscan-de/FluentGenesis-Classifier