Skip to content

jpcorb20/google-translate-backtranslation-da

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Google Translate Backtranslation for NLP data augmentation

By Jean-Philippe Corbeil and Hadi Abdi Ghadivel

This script was programmed for data augmentation of NLP copora for the paraphrase identification task. It can easily be adapted for any other NLP task. We use Google API. Thus, you need to provide your own Google API token in your own .env file and activate Google Translate API.

backtranslation data augmentation scheme
Figure 1. Data augmentation in NLP with backtranslation procedure.

No filter is applied in this part of the code. We leave it to further processing steps.

Install depencies

Simply use the requirements.txt file (better in virtual environment):

pip install -r requirements.txt

Last update: June 8th, 2020.

About

Backtranslation for NLP data augmentation done with Google API.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages