Learn R Programming

alineR (version 1.1.4)

generate.training: Generate training dataset

Description

Generates an output file of training data to be used by a linguist to select the best alignments from a list of the unique set of possible alignments for each given pair of words.

Usage

generate.training(raw.data, search.size=1000,table=TRUE,
                  file.out="candidate_alignments.csv")

Arguments

raw.data

This is a 2*n matrix containing n ipa encoded cognate pairs.

search.size

Number of time to randomize feature parameters while searching for unique alignments.

table

table=TRUE will generate a csv file named by the user containing possible alignments in IPA encodings.

file.out

Name of CSV file for output.

Value

A list containing two elements:

standard_ipa_symbol

A data frame containing input cognate pairs and a list of possible alignemnts. UTF-8 IPA

ALINE_symbol

Same as above, but using ALINE symbol for use in internal funcitons

See Also

optimize.features

Examples

Run this code
# NOT RUN {
# some cognates
data<-data.frame(dog=c('dog','perro'),cat=c('cat','gato'),rat=c('rat','rata'))

# write out a CSV file that can be openned in Excel and used for expert determinations
M<-generate.training(raw.data=data,search.size=100,file="open.with.excel.csv")

# }

Run the code above in your browser using DataLab