get_dependency(annotation, get_token = FALSE)
c("tbl_df", "tbl", "data.frame")
containing one row for every dependency pair in the corpus.The returned data frame includes at a minimum the following columns:
If get_token
is set to true, the following columns will also be included:
Danqi Chen and Christopher D Manning. 2014. A Fast and Accurate Dependency Parser using Neural Networks. In: Proceedings of EMNLP 2014
Spence Green, Marie-Catherine de Marneffe, John Bauer, and Christopher D. Manning. 2010. Multiword Expression Identification with Tree Substitution Grammars: A Parsing tour de force with French. In: EMNLP 2011.
Spence Green and Christopher D. Manning. 2010. Better Arabic Parsing: Baselines, Evaluations, and Analysis. In: COLING 2010.
Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, and Christopher D. Manning. 2009. Discriminative Reordering with Chinese Grammatical Relations Features. In: Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation.
Anna Rafferty and Christopher D. Manning. 2008. Parsing Three German Treebanks: Lexicalized and Unlexicalized Baselines. In: ACL Workshop on Parsing German.
data(obama)
# find the most common noun lemmas that are the syntactic subject of a clause
res <- get_dependency(obama, get_token = TRUE) %>%
filter(relation == "nsubj")
res$lemma_target %>%
table() %>%
sort(decreasing = TRUE) %>%
head(n = 40)
Run the code above in your browser using DataLab