Learn R Programming

r2lh (version 0.6.1)

rthb: ~ Function: R to HTML, Bivariate analysis ~

Description

rthb performs some bivariate analyses, then generates code to be included in a HTML document in order to print out the analyses on a Web page.

Usage

rthb(formula,fileOut="",textBefore="",textAfter="",graphDir="graphBiv",graphName="V",type="png",displayStyle=7,limDiscreteY=10,limDiscreteX=10)

Arguments

formula
[variable~variable] or [variable~data.frame] : contains the data to analyse. In all the following, the left part will be called Y, the right part will be X.
fileOut
[character] name of the output file in which the HTML summary will be saved. If empty, the HTML code is printed on screen.
textBefore
[character] or [vector(character)] : before printing a variable analysis, rthb can write a text. If X is a single variable, textBefore should be of length 1. If X is a d
textAfter
[character] or [vector(character)] : same as textBefore but the text is printed after the variable analysis. See textBefore and examples for details.
graphDir
[character] : directory used to save the graphs generated by the analyses.
graphName
[character] or [vector(character)] : prefix for the graph names. If empty, the graph names are V1 to V length(data.frame)
type
[character] : type of plotting device used to export the graphics. Can be Windows metafile, PNG, JPEG, BMP (Windows bitmap format), TIFF, PostScript or PDF.
displayStyle
[numeric] or [vector(numeric)] : when X is a factor, an ordered factor or a numeric discrete variable (see limDiscreteY for details on discrete variables), rthb proposes two d
limDiscreteY
[numeric] : rthb distinguishes two kinds of numeric : discrete designates numeric variables with only a few modalities, continuous designates numeric var
limDiscreteX
[numeric] or [vector(numeric)]: same as limDiscreteY. If X is a data.frame, limDiscreteX can have the same length as X or can be of length 1 (and is recyc

Value

  • rthb generates HTML code and either prints it on the screen, or saves it in a file. It also generates several graphs, optionally in a different directory.

Classical usage

The use of rthb goes through the following steps: ll{ Step 1. Load the data (usually, a data.frame). Step 2. Optionally, set some variables as ordered. Step 3. Run rthb(Y~dataFrame,"fileOut.html"). } See examples of application.

Author

Christophe Genolini christophe.genolini@free.fr PSIGIAM: Paris Sud Innovation Group in Adolescent Mental Health INSERM U669 / Maison de Solenn / Paris Bernard Desgraupes bernard.desgraupes@u-paris10.fr University of Paris Ouest - Nanterre

English correction

Jean-Marc Chamot jchamot@u-paris10.fr Laboratoire "Sport & Culture" / "Sports & Culture" Laboratory University of Paris 10 / Nanterre

Details

rthb performs some basic analyses, then generates code to be included in a HTML document in order to print out the analyses in a Web page. rthb performs the analyses automatically according to the data class. It considers 5 classes: nominal with 2 modalities, nominal with 3 modalities or more, ordered, discrete and continuous (see the description of limDiscreteY for details on discrete and continuous). The analysis of the variable depends on the class of Y and X wich gives 25 possible combinations. We will not describe all of them here. They can be divided in two categories. First (on the top of the tabular) are descriptive analyses:
  1. table: absolute and relative frequency.
  2. summary: mainly whenYis continuous andXhas few modalities.
  3. graphical representation: barplot or boxplot for each modalities ofX, mosaic plot, scatter plot, density lines according to the type of the variable.
On the second part of the tabular are all the informations related to a potential link between Y and X.
  1. test: khi2, Fisher exact test, Student's T, ANOVA, Wilcoxon, Kruskal & Wallis, Spearman correlation, Pearson correlation, Odds Ratio and Relative Risk, depending on the classes ofYandX. Note that as many tests as possible are run. For example, ifYis nominal andXis ordered,Xcan be considered as a factor (khi2 and Fisher exact test) but also as a discrete variable (Wilcoxon).
  2. graphical diagnostic: the test presented might not be all valid. Some graphical diagnostic (check for normality) are presented to let the user decide which test is more relevant.
The wide display gives : +---+---+---+ | 1 | 2 | 3 | +---+-+-+---+ | 4 | 5 | +-----+-----+ The long display : +-------+ | 1 | +---+---+ | 2 | 3 | +---+---+ | 4 | 5 | +---+---+ If X is a data.frame, rthb runs the analyses on every column.

References

HTML web site http://www.latex-project.org/ Data are available on line: http://christophe.genolini.free.fr/EPO/EPO2007-Fraude.php

See Also

rthMainFile, rthu, r2lUniv-package, examCheating

Examples

Run this code
### Create some data
V1 <- factor(LETTERS[floor(runif(50,1,4))])
V2 <- rnorm(50,1,1)<0
V3 <- ordered(LETTERS[floor(runif(50,1,4))])

### Create a directory for the output
r2lhOutDir <- paste(tempdir(),"rthbExample",sep="/")
if(!file.exists(r2lhOutDir)){dir.create(r2lhOutDir)}
setwd(r2lhOutDir)

### Execute rthb
rthb(V1~V2,fileOut="first.html",textBefore="<H2>Variables V1, V2, V3</H2>",graphName="Gr1",type="png")
rthb(V2~V1,fileOut="second.html",graphName="Gr2",type="png")
rthb(V3~V1,fileOut="third.html",textBefore="This is V3 vs. V1",graphDir="P",graphName="Gr3",type="png",displayStyle=2)
rthMainFile(text="<LU>
<LI><A HREF='first.html'>First example</A></LI>
<LI><A HREF='second.html'>Second example</A></LI>
<LI><A HREF='third.html'>Third example</A></LI>
</LU>
")
setwd("..")

Run the code above in your browser using DataLab