Get information on possibly bad files in your cache
cache_file_info()
list, with three elements:
xml_not_valid: xml files that could not be read in with
xml2::read_xml()
xml_abstract_only: xml files that only have abstracts. you can of choose to retain these if you like
pdf_not_valid: pdf files that could not be read in with
pdftools::pdf_info()
This function only identifies possibly bad files. You have to remove/delete them yourself. See example for how to do so. You can also open up your cache folder and delete them that way as well.
Other caching-functions:
cache
,
ftxt_cache
# NOT RUN {
# identify likely bad files
res <- cache_file_info()
# you can remove them yourself, e.g.,
# invisible(lapply(res$xml_abstract_only, unlink))
# }
Run the code above in your browser using DataLab