Learn R Programming

MaxWiK: Maxima Weighted Isolation Kernel Mapping Method

MaxWiK (Maxima Weighted-isolation Kernel mapping method) is a machine learning method of meta-sampling based on Isolation Kernel and Kernel mean embedding. For more details of the method, please, be kind to read the papers:

Iurii Nagornov, Sampling vs. Metasampling Based on Straightforward Hilbert Representation of Isolation Kernel, In: Arai, K. (eds) Intelligent Systems and Applications. IntelliSys 2024. Lecture Notes in Networks and Systems, vol 1067, pp. 243-258. Springer, Cham, 2024

Iurii Nagornov, Overfitting Problem in the Approximate Bayesian Computation Method Based on Maxima Weighted Isolation Kernel, In: Takama, Y., Yada, K., Satoh, K., Arai, S. (eds) New Frontiers in Artificial Intelligence. JSAI-isAI 2022. Lecture Notes in Computer Science, vol 13859, pp. 267–282, Springer, 2023.

Iurii S. Nagornov, Approximate Bayesian Computation Based on Maxima Weighted Isolation Kernel Mapping, arXiv.2201.12745, 2022

Authors and contributor list:

Iurii (Yuri) Nagornov (Maintainer, Author)

All questions and requests can be sent to nagornov.yuri@gmail.com

Short description of the package:

Motivation: A branching processes model yields an unevenly stochastically distributed dataset that consists of sparse and dense regions. This work addresses the problem of precisely evaluating parameters for such a model. Applying a branching processes model to an area such as cancer cell evolution faces a number of obstacles, including high dimensionality and the rare appearance of a result of interest. We take on the ambitious task of obtaining the coefficients of a model that reflects the relationship of driver gene mutations and cancer hallmarks on the basis of personal data regarding variant allele frequencies.

Method: An approximate Bayesian computation method based on Isolation Kernel is developed. The method involves the transformation of row data to a Hilbert space (mapping) and the measurement of the similarity between simulated points and maxima weighted Isolation Kernel mapping related to the observation point. We also design a meta-sampling algorithm for parameter estimation that requires no gradient calculation and is dimension independent. The advantages of the proposed machine learning method are more clearly can be illustrated using multidimensional data as well as a specific branching processes model like cancer cell evolution.

Package: This software is a package named MaxWiK contains Approximate Bayesian Computation methods to choose a single parameter for a single observation point.

To install from CRAN:

utils::install.packages("MaxWiK")

To install from the archive file 'MaxWiK_1.XX.XX.tar.gz' (1.XX.XX is a release number):

utils::install.packages("./MaxWiK_1.XX.XX.tar.gz", repos = NULL, type = "source")

To see how it works, please, be kind use the templates and read vignettes. To get templates, please, use command:

MaxWiK_templates( dir = './' )   # dir can be any working folder where template will be copied

Cite package MaxWiK

For publication, please, be kind to use next references related to MaxWiK software:

Copy Link

Version

Install

install.packages('MaxWiK')

Monthly Downloads

124

Version

1.0.6

License

GPL (>= 3)

Maintainer

Yuri Nagornov

Last Published

July 7th, 2025

Functions in MaxWiK (1.0.6)

get_inverse_GRAM

The function to get inverse Gram matrix
meta_sampling

Function to get Approximate Bayesian Computation based on Maxima Weighted Isolation Kernel mapping
get_voronoi_feature

The function to get feature representation in RKHS based on Voronoi diagram for WHOLE dataset
get_voronoi_feature_PART_dataset

The function to get feature representation in RKHS based on Voronoi diagram for PART of dataset
iKernel

Function returns the value of similarity or Isolation KERNEL for TWO points
read_file

Function to read file
read_hyperparameters

Function to read hyperparameters and their values from the file
sampler_MaxWiK

Function to generate parameters and simulate a model based on MaxWiK algorithm
restrict_data

Function to restrict data in the size to accelerate the calculations
get_kernel_mean_embedding

The function to calculate Maxima weighted kernel mean mapping for Isolation Kernel in RKHS related to parameters space
get_subset_of_feature_map

The function to get subset of points based on feature mapping
MSE_sim

The function to get the mean square error values for statistics of simulations
apply_range

Function to restrict values of the data according with the range for each dimension
MaxWiK.ggplot.density

Density plot
MaxWiK_templates

Function to copy the templates from extdata folder in the library to /Templates/ folder in the working directory
Data.2D

List of the objects for the 2D example of the MaxWiK methods usage
check_packages

Check the installation of packages and attach them with corresponding functions
MaxWiK-package

MaxWiK: Machine Learning Method Based on Isolation Kernel Mean Embedding
sudoku

The function to get the best tracer bullets related to kernel mean embedding
check_pkg

Check the installation of a package for some functions
GET_SUBSET

The function to get subset with size psi for Voronoi diagram
check_numeric_format

Function to check DATA.FRAME
norm_vec

The norm function for vector