fuzzywuzzyR
The fuzzywuzzyR package is a fuzzy string matching implementation of the fuzzywuzzy python package. It uses the Levenshtein Distance to calculate the differences between sequences. More details on the functionality of fuzzywuzzyR can be found in the blog-post and in the package Vignette.
System Requirements
Python (>= 3.8)
difflib
fuzzywuzzy ( >=0.15.0 )
python-Levenshtein ( >=0.12.0, optional, provides a 4-10x speedup in String Matching, though may result in differing results for certain cases)
Before the installation of any python modules one should check the python-configuration using :
reticulate::py_config()
All modules should be installed in the default python configuration (the configuration that the R-session displays as default), otherwise errors might occur during package installation.
Linux OS (Debian, Ubuntu, Fedora etc.)
sudo apt-get install python3-pip
sudo pip3 install --upgrade pip
pip3 install fuzzywuzzy
pip3 install python-Levenshtein
Mac OSX and Windows OS
Users of the fuzzywuzzyR
package will find the required information on how to configure Python for Mac OSX and Windows OS in the reticulate Github repository. The fuzzywuzzyR
package depends on the reticulate
package.
Installation of the fuzzywuzzyR package
To install the package from CRAN use,
install.packages('fuzzywuzzyR')
and to download the latest version from Github use the install_github function of the remotes package,
remotes::install_github(repo = 'mlampros/fuzzywuzzyR')
https://github.com/mlampros/fuzzywuzzyR/issues
Citation:
If you use the code of this repository in your paper or research please cite both fuzzywuzzyR and the original software https://CRAN.R-project.org/package=fuzzywuzzyR/citation.html:
@Manual{,
title = {{fuzzywuzzyR}: Fuzzy String Matching in R},
author = {Lampros Mouselimis},
year = {2025},
note = {R package version 1.0.6},
url = {https://CRAN.R-project.org/package=fuzzywuzzyR},
}