[Blindmath] (no subject)

Mon Sep 14 21:56:39 UTC 2009

Hi Christine et al,

I have several comments and concerns and then the details below.

First your question about accessibility to information is well 
directed in terms of sending it to this list. My problem is that the 
reason for asking the question is in my opinion misguided on 
statistical grounds.

A quote often attributed to George Box goes something like "All 
models are wrong, but some are useful." It's usually misquoted 
(including here) and was actually first published in a NASA report 
before it appeared in a Box article.

A chi-square test can tell you if a model is useful - well more 
exactly it will tell you if it is not useful. This is a direct 
consequence of the way we do hypothesis tests.

Having decided that two or more models are useful though, the 
chi-square test becomes irrelevant for deciding which model is the 
one to use. You must have other reasons for even considering various 
models and their usefulness to your situation. To go back to the Box 
quote, you might find out that two models are actually valid and 
useful. You then need to ask if they yield differing results in terms 
of what you want to do with them. If they don't differ then it 
probably doesn't matter which one you use. If they do differ then you 
have a problem with choosing the one to apply on the grounds of the 
assumptions made behind each model (and there are always assumptions).
Ultimately we do not want our opinion to depend on the assumptions we make.

Now to remind you of the basics of chi-square testing.

1. Under any model, you should know how many observed values fall 
into well defined classes. In situations where you are counting 
things then this is easy. If the response is continuous then you must 
be careful determining the cutoffs between the classes.
2. You must now consider how many observations you would expect to 
fall into each of the classes defined in step 1.
3. If any of the classes have expected values less than 5, you will 
need to merge classes. This normally occurs at the extremes. Merge 
until all expected values are >=5. Of course you will need to merge 
the observed counts as well to match.
4. The chi-square value is the sum of (O-E)^2/E where O is observed 
(step 1) and E is expected (step 2).
5. This should follow a chi-square distribution with n-p-1 degrees of 
freedom. n is the number of classes after merging, p is the number of 
parameters you estimated under each of your models.
6. Either
6a. Using some software (EXCEL will do) find the p-value for the 
chi-square test statistic found in step 4, compare this to the 
pre-determined level of significance you are happy with - normally 
0.05 is applied. If your p-value is less than 0.05 then your model is 
not useful.
or
6b. determine the critical value for the chi-square distribution with 
the right degrees  of freedom found in step 5. If your test statistic 
is >= critical value then your model is not useful.

Jonathan

At 04:29 a.m. 11/09/2009, you wrote:
>Hi All,
>   Do any of you happen to know where I might be able to obtain 
> speech-software friendly information on running a Chi Square 
> Analysis (e.g., the actual statistics involved). It has been a long 
> time since I have done such an analysis and I need to do so to 
> compare which of a few mathematical models provides a best-fit to 
> some behavioral data I have collected in Cognitive Psychology.
>Many thanks,
>Christine
>Christine M. Szostak
>Graduate Student
>Language Perception Laboratory
>Department of Psychology, Cognitive Area
>The Ohio State University
>Columbus, Ohio
>szostak.1 at osu.edu
>_______________________________________________
>Blindmath mailing list
>Blindmath at nfbnet.org
>http://www.nfbnet.org/mailman/listinfo/blindmath_nfbnet.org
>To unsubscribe, change your list options or get your account info 
>for Blindmath:
>http://www.nfbnet.org/mailman/options/blindmath_nfbnet.org/a.j.godfrey%40massey.ac.nz
>
>No virus found in this incoming message.
>Checked by AVG - www.avg.com
>Version: 8.5.409 / Virus Database: 270.13.90/2361 - Release Date: 
>09/10/09 18:12:00

_____
Dr A. Jonathan R. Godfrey
Lecturer in Statistics
Institute of Fundamental Sciences
Massey University
Palmerston North

Room: AH2.82
Phone: +64-6-356 9099 ext 7705
Mobile: +64-29-538-9814
Home Address: 22 Bond St, Palm. Nth.
Home Phone: +64-6-353 2224 (Just think FLEABAG)