Bug 615920 – CHITEST function

After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.

Bug 615920 - CHITEST function


Summary:	CHITEST function


Status:	RESOLVED FIXED

Product:	Gnumeric
Classification:	Applications
Component:	Analytics
Version:	git master
Hardware:	Other Linux

Importance:	Normal normal
Target Milestone:	---
Assigned To:	Morten Welinder
QA Contact:	Jody Goldberg

URL:
Whiteboard:

Depends on:
Blocks:

Reported:	2010-04-16 04:04 UTC by Andreas J. Guelzow
Modified:	2010-04-17 03:56 UTC

See Also:
GNOME target:	---
GNOME version:	---

Description Andreas J. Guelzow 2010-04-16 04:04:19 UTC

The CHITEST function currently calculates a goodness of fit test if the expected and observed range is an n by 1 or 1 by n range.

If the ranges are m by n with both m and n at least 2, then Excel and OOo appear to use a degree of freedom of (n-1) * (m-1) which is only appropriate if the expected values are calculated from the observed values to perform a test of independence or a test of homogeneity. Of course in this case we should not need to specify the expected values. (Behaviour A)

If the ranges are m by n with both m and n at least 2, one could also make sense out of this function by using degrees of freedom (n*m-1). In that case it would simply be a goodness of fit test (assuming that the expected values were determined apriori (rather than fromthe observed values). (Behaviour B)

Currently in the case of the ranges being m by n with both m and n at least 2, gnumeric uses the degree of freedom (n-1). This does not make any sense at all.

Gnumeric's behaviour should be changed to reflect either Behaviour A or B. 

The strength of Behaviour A is compatibility with Excel (and probably OOo) with the drawback that the results are only meaningful if the expected values are appropriate constructed, 

THe strength of Behaviour B is that it makes the most mathematical sense (unless the expected values are constructed from the observed ones.)

Comment 1 Morten Welinder 2010-04-16 12:36:04 UTC

I would say do A+docs.

There are surely detailed instructions out there for how to do this kind
of test with Excel.  Some of those instructions might even be correct.
Adding to the confusion by behaving differently is not likely to help.

Comment 2 Andreas J. Guelzow 2010-04-17 03:56:09 UTC

This problem has been fixed in the development version. The fix will be available in the next major software release. Thank you for your bug report.