Automated REtrieval from TExt
Description
A Python based pipeline for extraction of species occurrence data through the usage of large language models. Includes validation tools designed to handle model hallucinations for a scientific, rigorous use of LLM. Currently supports usage of GPT with more planned, including local and non-proprietary models. For more details on the methodology used please consult the references listed under each function, such as Kent, A. et al. (1995) , van Rijsbergen, C.J. (1979, ISBN:978-0408709293, Levenshtein, V.I. (1966) and Klaus Krippendorff (2011) .