Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
-
Updated
May 3, 2023 - Python
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке
Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts"
Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
Klexikon: A German Dataset for Joint Summarization and Simplification
Code and data for the Dreyer et al (2023) paper on abstractiveness and factuality in abstractive summarization
Thai Crosslingual Summarization Datasets.
This repository contains the implementation of a Transformer-based model for abstractive text summarization and a rule-based approach for extractive text summarization.
This is the official PyTorch codebase for the ACL 2023 paper: "What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization".
[EACL 2021] - Unsupervised Abstractive Summarization of Bengali Text Documents.
[Computer Speech & Language, Elsevier] - Neural Sentence Fusion for Diversity Driven Abstractive Multi-Document Summarization.
M3LS : Multi-lingual Multi-modal summarization dataset
In deep learning NLP, using a model we are trying to summarization the text.
Dataset for abstractive summarization of long multimodal presentations
This repository contains evaluation script for all the LLMs evaluated with iCOPERNICUS for testing In-Context Personalization Learning w.r.t summarization
Specific-Aspect Summarization on News According to Social Sentiments on Twitter
Using T5-Small and fine-tuning it using BBC's article summarization dataset.
Add a description, image, and links to the summarization-dataset topic page so that developers can more easily learn about it.
To associate your repository with the summarization-dataset topic, visit your repo's landing page and select "manage topics."