{smcl} {* 20sep2001}{...} {hline} help for {hi:chiplot}{right:experimental version 0.0.2, a Stata v7 program} {hline} {title:Chi-plot: Nonparametric Graphical Assessment of Dependence} {p 4 8}{cmd:chiplot }{it:yvarname xvarname} [{cmd:if} {it:exp}] [{cmd:in} {it:range}] [{cmd:,} {cmdab:h:ilite(}{it:exp}{cmd:)} {cmd:ci(}{it:#}{cmd:)} {cmdab:sam:ple(}{it:%}{cmd:)} graph_options] {title:Description} {p}{cmd:chiplot} is an experimental program implementing the nonparametric graphical assessment of dependence plot described by N. I. Fisher and P. Switzer (see: Graphical assessment of dependence: is a picture worth 100 tests? {it:The American Statistician} 55(3):233-9, 2001). Because this is offered as an experimental program only, no details about interpretation of the plots are given; please see the original article for guidance. Note that the matrix form of the plot is not implemented in this version of the program. {title:Options} {p 0 4}{cmd:hilite(}{it:exp}{cmd:)} allows a logical expression to be defined to highlight a subgroup of data in the plot. (This is an extension to the plot described by Fisher and Switzer.) {p 0 4}{cmd:ci(}{it:#}{cmd:)} sets {it:p} = {it:#} such that approximately 100{it:p} % of the (lambda, chi) pairs will fall between the plotted dashed horizontal control lines when X and Y are independently distributed (the null hypothesis). The only allowed values of {it:#} are .90, .95, and .99 (or, equivalently, 90, 95, and 99). The default is .95. {p 0 4}{cmd:sample(}{it:%}{cmd:)} allows a random subsample of the data to be selected for processing. The {it:%} given to {cmd:sample()} should be a value in the range: 0 < {it:%} < 100. If {cmd:hilite()} is specified, the highlighted and non-highlighted subsets will be sampled separatedly to (approximately) yield the specified percentage of each subgroup. {p 4 4}(Note that this option is offered because processing time for large samples is very long -- for example, exceeding 70 seconds on a 300MHz Pentium for a file of 8000 observations; this despite optimization of the code in version 0.0.2 that reduced the time to one-third that of version 0.0.1.) {p 0 4}{cmd:graph_options} are those of Stata's twoway graph. With the exception of {cmd:saving()}, options are applied equally to both the twoway scatterplot and the chiplot, so caution is warranted. {title:Examples} {p 1 33}{inp:. chiplot mpg weight} {space 21} (chi-plot of mpg against weight){p_end} {p 1 33}{inp:. chiplot mpg weight if foreign} {space 10} (same, but only foreign cars){p_end} {p 1 33}{inp:. chiplot mpg weight, hilite(price<5000)} {space 1} (same, but inexpensive cars highlighted){p_end} {p 1 33}{inp:. chiplot mpg weight, hilite(foreign)} {space 4} (same, but with foreign cars hilighted){p_end} {p 1 33}{inp:. chiplot mpg weight, h(foreign) sam(50)} {space 1} (same, but a 50% sample of each group){p_end} {title:Author} {p}Thomas J. Steichen, RJRT, {browse "mailto:steicht@rjrt.com":steicht@rjrt.com} {title:Thanks to...} {p 0 2}Fred Wolfe for exercising the program and pointing out some useful enhancements now incorporated in this, version 0.0.2; and {p 0 2}Nick Cox and Bill Gould for discussions that led indirectly to notable reductions in processing time.