Last chance! 50% off unlimited learning
Sale ends in
ft_collect
grabs full text data from file paths in your
ft_data
object (result of call to ft_get()
). ft_text
is a
convenience function to grab the nested text data and bring it up in
the list for easier access
ft_collect(x, ...)ft_text(x, ...)
# S3 method for default
ft_text(x, ...)
# S3 method for ft_data
ft_text(x, ...)
Input. An object of class ft_data
Further args, ignored.
an object of class ft_data
, but the data
slot should have
character string of text from the XML/plain text/PDF file
The result of this call is actual text you can read
# NOT RUN {
# Get some data
x <- ft_get('10.1371/journal.pone.0086169')
# note that the data is not in the object, gives NULL
x$plos$data$data
# Collect data from the .xml file
y <- x %>% ft_collect()
# note how the data is now in the object
y$plos$data$data
# Let's get the actual
## ft_collect() alone, replaces file pointers with parsed text,
## maintaining object structure
x %>% ft_collect()
## pulls the text out of the object
x %>% ft_collect() %>% ft_text()
# }
Run the code above in your browser using DataLab