Skip to content

A textual unit aligner for comparable corpora using Expectation-Maximization.

Notifications You must be signed in to change notification settings

accurat-toolkit/EMACC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

To do a quick test-run, just issue (from this directory) the command:

perl emacc2.pl --source en --target ro --input test\doclist-en.txt --input test\doclist-ro.txt --output test-alignment.txt

In Windows, be sure to configure the file 'cluster.info' and 'emaccconf.pm' first! Especially if you have a lot of documents to align!
In Windows it is advisable to run EMACC under Cygwin.

For more details, please read ACCURAT D2.6 deliverable, Version 2.

About

A textual unit aligner for comparable corpora using Expectation-Maximization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages