Syllable parser (R implementation)

This README is under construction

Syllable parser (R implementation)

Newly updated syllable parser. See the old scripts here.

This set of scripts defines several functions (the foremost of which is syllabify()) that together can syllabify a phonetic transcription using principles taught to students in Phonology 1.

The following functionality has not yet been incorporated into the new version.

Some of the provided functions (namely, transcribe()) interface with a local copy of the Carnegie Mellon University (CMU) Pronouncing Dictionary, which provides phonetic transcriptions (using ARPABET sequences) for over 134,000 words in the English lexicon. Scripts are also provided that check the dictionary for updates, download the most recent version from the dictionary's website, and format it, including converting ARPABET codes to International Phonetic Alphabet (IPA) characters.

The parser basically uses a strategy taught to undergraduate students of Phonology when they are learning how to determine the syllabification of a word:

Identify syllable nuclei by looking for the vowels. If there are two adjacent vowels, check if those vowels together are a known diphthong in the language. If they are, consider them part of the same syllable nucleus. Standalone vowels each count as the nucleus of their own syllable.
Identify syllable onsets (this occurs before step 3 in accordance with the Onset Principle). Automatically parse consonants preceding the first vowel into the first syllable. Parse a consonant immediately preceding any of the other vowels into the syllable of that vowel. If any remaining unparsed consonants are not after the last vowel in the word and have a consonant following them, check if that consonant and the following are a legal onset in the language (defined as a consonant cluster that can occur at the beginning of a word, as long as that word is not a borrowing). If they're a legal onset, parse that consonant into the syllable of the consonant following it. If they're not a legal onset, leave that consonant unparsed.
Identify syllable codas by parsing any remaining unparsed consonants into the syllables preceding them.

Getting started

Prerequisites

R programming language environment
The following R packages:
- dplyr
- magrittr
- readr
- tidyr
- stringr
Optional packages:
- RCurl (Required for checking dictionary version and downloading updates)
- pbapply (Required by dictionary formatter)

Installing

Using Git

Clone this repository into a directory of your choosing: git clone https://github.com/jakewvincent/R-syllable-parser.git
Open an R terminal and set your working directory to the directory where you cloned this repository.
Source master.R by running source(file = "master.R") in your R terminal.

By downloading .zip file

Download this repository as a zip file and unzip it into a directory of your choosing.
Open an R terminal and set your working directory to the directory where you unzipped the zipped repository file.
Source master.R by running source(file = "master.R") in your R terminal.

Using the parser (section under construction)

After sourcing 0_master.R as above, all of the functions defined by these scripts are available for use. Some of these functions (namely, cvify() and sonority()) are mainly used internally. Here is a description of each function:

`syllabify()`

`cvify()`

`sonority()`

To do

Re-incorporate CMU pronouncing dictionary
- random_word() function
- transcribe() function
Make outputs when verbose = TRUE consistent and predictable
Recognize syllabic consonants (e.g. words that end in [ɪzm] should have the [m] assigned to a nucleus)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
0_master.r		0_master.r
1_os_compatibility_functions.r		1_os_compatibility_functions.r
2_minor_functions.r		2_minor_functions.r
3_distinctive_features.r		3_distinctive_features.r
4_english_phonology.r		4_english_phonology.r
5_parsing_functions.r		5_parsing_functions.r
LICENSE		LICENSE
README.md		README.md
distinctive_features.csv		distinctive_features.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Syllable parser (R implementation)

Getting started

Prerequisites

Installing

Using Git

By downloading .zip file

Using the parser (section under construction)

`syllabify()`

`cvify()`

`sonority()`

To do

About

Releases

Packages

Languages

License

jakewvincent/R-syllable-parser

Folders and files

Latest commit

History

Repository files navigation

Syllable parser (R implementation)

Getting started

Prerequisites

Installing

Using Git

By downloading .zip file

Using the parser (section under construction)

syllabify()

cvify()

sonority()

To do

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`syllabify()`

`cvify()`

`sonority()`

Packages