What is this product?
statistiXL 1.8 Released
The final release of statistiXL Version 1.8 is now available. This release is mainly intended to correct some charting layout issues experienced with Excel 2007.
- statistiXL now provides support for Microsoft Office Excel 2007. This overcomes incompatibilities experienced by Version 1.6, particularly with regards to charting. It also includes the ability to work with the new format larger worksheets of 1048576 rows x 16384 columns.
- Version 1.7 of statistiXL now provides support for Microsoft Windows Vista Operating Systems. statistiXL 1.7 will install and run under Microsoft Windows Vista.
- Support for non-English language versions of Windows and Office has been enhanced. In previous versions of statistiXL, checkbox options on dialog forms were not properly restored between tests. Checkboxes would typically be restored in an indeterminate state (i.e. neither checked or cleared). If not manually corrected for each test this would lead to unexpected errors when attempting to perform an analysis. Checkbox settings are now remembered and restored correctly between tests when running statistiXL on non-English versions of Excel and Windows.
- The ability to add Prediction Bands to bivariate regression plots has been added. This supplements the Confidence and Standard Error Bands available in previous version of statistiXL.
- Multiple Linear Regression now tests for Multicollinearity between variables. Variables with a Tolerance of 0, indicating exact multicollinearity, are automatically excluded from the analysis.
- Cluster Analysis now behaves like other tests, outputing results to a new blank worksheet when the Output Range field is left blank.
statistiXL is an extremely powerful, feature rich data analysis package that runs as an add-in to Microsoft’s popular Excel™ spreadsheet program. It provides access to a wide variety of both parametric and non-parametric data analysis tools and statistical tests. By working from within Microsoft Excel™, statistiXL is able to leverage the existing features of Excel™ and provide a host of additional benefits including:
- A familiar and powerful user interface for entering and manipulating data
- A wide variety of formatting options for altering the appearance of results
- The presence of a sophisticated charting package that allows both the manipulation of charts produced by statistiXL and the manual creation of new charts based on statistiXL‘s output.
- The ability to perform further ad hoc analysis using Excel’s™ own built-in functions and calculating abilities.
Factor Analysis is a procedure that seeks to determine a reduced number of variables, called factors, that explain much of the variation present in a larger number of measured variables. statistiXL provides a comprehensive module for Factor Analysis, with a variety of analytical options. Either the correlation or covariance matrix can be used in calculating the factors. The number of factors to be extracted can be established by several different criteria: 1) the number of factors can be chosen to encompass a specified percentage of the total variance in the original data, 2) you can choose to extract factors with eigenvalues greater than a set value, 3) you can extract a specific number of factors. A variety of methods are included for determining the factors including the principal component method (not to be confused with principal component analysis), principal factor method, and maximum likelihood method. The axes of the resulting factors can then be rotated to improve the factor structure using either Varimax, Quartimax, or Equamax procedures.
Results are presented in tabular form. Descriptive statistics and the correlation or covariance matrix are provided, if these options were selected.
Linear regression attempts to explain the variation present in one variable (for example Height) in terms in terms of a linear relationship to variation in one or more predictor variables (for example Age). statistiXL provides comprehensive Model I regression analysis. The module allows the selection of one or more predictor variables for each single dependent variable. Options include forward or backwards stepwise regression (with P level to enter or remove), forcing of the relationship through the origin, and graphical output (normal probability plot, residuals plot, scatterplots).
Results from regression analysis are presented in tabular form and graphical form. Summary statistics are provided if this option is selected. Statistics include the R², the correlation coefficient, the adjusted R², and the standard error of the estimate. An ANOVA table is given, to summarise the significance of the regression relationship.
Principal Component Analysis
As with Factor Analysis, Principal Component Analysis is a technique that attempts to reduce complex data sets consisting of many different variables to a smaller set of new variables that still manage to describe much of the variation in the original data. These new variables, called Principal Components, are chosen to be independent and to maximise the variance found in the original data set. statistiXL provides a number of options for Principal Component Analysis. Either the correlation or covariance matrix between variables can be selected as the basis for analysis. All Principal Components can be extracted or a subset of these based on limits such as the number to extract, the percent of variance to explain or the value of an eigenvalue. Screeplots can be produced to help in the visual determination of the appropriate number of Principal Components to extract.
Results are presented in tabular and graphical form. Descriptive statistics and the correlation or covariance matrix are displayed, if these options were selected. The eigenvalues are then tabulated along with the percent of variance and cumulative percent of total variance evident in the original dataset that each of the extracted Principal Components explains.
Analysis of Variance and Covariance
Analysis of Variance (ANOVA) is used to determine whether there is a difference between three or more categorical sets of values. Analysis of Covariance (ANCOVA) on the other hand, while also used to determine whether there is a difference between categorical sets of values also takes into account the effect of one or more numerical variables called covariates. StatistiXL provides both univariate and multivariate ANOVA and ANCOVA. Factors can be specified as fixed or random and the nesting of factors is also supported. Simplified dialog boxes aid the rapid analysis of full factorial andrepeated measures models, while for more advanced analyses a comprehensive dialog box is available that allows custom models to be specified precisely detailing the factors and interactions to be included in the analysis. Post Hoc Tests are provided so that you can drill down into your dataset and see what, if any, the major differences between groups are.
Results are presented in tabulated form, starting optionally with a table of simple descriptive statistics for each group.
A contingency table is a table of counts or frequencies. It lists the number of times that each of 2 or more variables falls into a variety of different categories. A contingency test simply examines the null hypothesis that the frequencies of observations found for one variable are independent of the frequencies of observations in the other. statistiXL provides a flexible module for the analysis of contingency tables. Two-way and multi-way contingency tables can be analyzed. The statistics available for the frequency test are Chi² and log-likelihood, the latter being a good alternative approach to Chi² if the expected frequencies are small. Yates’ and Cochran’s corrections for continuity are provided for 2×2 contingency where such an adjustment is recommended because of the low degrees of freedom. statistiXL explains how to subdivide a contingency table and warns of the limited statistical value of this approach. A Heterogeneity Chi² can be used in a contingency table analysis to determine if a number of sets of observed frequency data can be combined into a single set. statistiXL explains how to analyze contingency tables for heterogeneity.
Results are presented in tabulated form, starting with an optional table of a summary of the observed and expected frequencies. The test results are then presented, with the Chi² and Log-likelihood values, along with their degrees of Freedom and P values.
The descriptive statistics feature of statistiXL provides a quick and easy summary of the basic parametric and nonparametric statistics that describe a sample of values. Unlike many packages, statistiXL provides modules for both linear data and circular descriptive statistics. The descriptive statistics which can be provided (by user selection) with the linear descriptive statistics module are: mean, median, mode, standard error, standard deviation, variance, coefficient of variation, lower and upper confidence limits, 25th and 75th percentiles, sum, minimum and maximum values, nth smallest and largest values (with user input of n), range, count, skewness (with probability if count >9) and kurtosis (with probability if count >19). If these data are sampled at random from anormal distribution, then the data are best summarized by the parametric statistics such as mean, variance and standard deviation. If the data are sampled from a non-normal distribution, then the non-parametric statistics such as median, mode, percentiles and range may be more appropriate statistics for summarizing these data. Options for graphical output includes a “box and whisker” plot and an error bar plot.
A sample of values collected using a circular scale (e.g. time of day or compass bearing) can be described by a variety of descriptive statistics, the more common being the mean angle, angular variance, and angular standard deviation. Optional graphical output includes a circular plot.
Nonparametric statistical tests are distribution-independent tests that are used to analyse data for which an underlying distribution (such as the normal distribution) is not assumed. Non-parametric statistics have a number of advantages over parametric statistics.
statistiXL provides a diverse array of nonparametric tests. The sign test can be used to examine whether two populations have the same median, and for observations in pairs with one of each pair coming from each population. Various modifications of the sign test can be used for specific tests The Friedman test for blocked data is equivalent to a sign test, but for more than 2 groups. The Mann-Whitney test (U statistic) is a nonparametric test that uses the ranks of two independent samples, from the highest to lowest (or lowest to highest), to calculate the U statistic; it is the nonparametric analog to the parametric two-sample t-test. The Wilcoxon’s signed-rank test ranks the differences between pairs of data (or single data set of a sample) and compares the sum of positive and negative ranks with a critical U value; it is the nonparametric analog to the parametric paired t-test. The Kruskal-Wallis test is a nonparametric test for the comparison of 3 or more treatment groups, which are independent; it is the nonparametric equivalent to analysis of variance (ANOVA). A common nonparametric correlation test is Spearman’s rho rank correlation coefficients; this is analogous to the parametric Pearson’s correlation coefficient. Mood’s Median Test examines whether two or more samples come from a population having the same median. The Wald-Wolfowitz Runs test analyses a sequence of observations, or compares random samples which are mutually independent, for two or more outcomes e.g. two species of antelope, or three brands of automobile. Results are presented in tabular form.
- statistiXL now provides support for Microsoft Office Excel 2007. This overcomes incompatibilities experienced by Version 1.6
- Version 1.7 of statistiXL now provides support for Microsoft Windows Vista Operating Systems.
- statistiXL is compatable with Excel 97, Excel 2000 and Excel 2003.
- To be able to install statistiXL you will require just under 13MB of free disk space.
- Memory and processor requirements are similar to those for Excel (if you can run Excel you can run statistiXL!).
Try before you buy, You can download a trial version to test if it will suit your needs.