Free statistical software: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Gene Shackman
(more tutorials, training, etc)
imported>Gene Shackman
(adding my name, am I supposed to do this?)
Line 27: Line 27:


Several of the packages also have tutorials.  For example, CDC has these tutorials about Epi Info<ref>Epi Info™ Community Health Assessment Tutorial. The Epi Info™ Community Health Assessment Tutorial was produced by the collaborative efforts of the Centers for Disease Control and Prevention (CDC), the Assessment Initiative (AI), and the New York State Department of Health (NYSDOH). http://www.cdc.gov/epiinfo/communityhealth.htm </ref><sup>,</sup><ref>Cholera Outbreak in Rwenshama: Using Epi Info for Windows in an Outbreak Investigation. Coordinating Office for Global Health - DGPHCD, http://www.cdc.gov/cogh/dgphcd/training/softwaretraining.htm</ref>. The CDC page also lists a video slide show tutorial from the University of Nebraska <ref>Introduction to EPI2000. GPVEC Great Plains Veterinary Educational Center. University of Nebraska - Lincoln.  http://gpvec.unl.edu/videos/epi-stats.asp </ref>, and another site has on line training classes<ref>The North Carolina Center for Public Health Preparedness Training Website  http://nccphp.sph.unc.edu/training/index.html</ref>. One faculty member from Emory's School of Public Health has a number of introductory manuals<ref>Kevin's Web Page. Kevin M. Sullivan, PhD, MPH, MHA. Department of Epidemiology and Global Health. Rollins School of Public Health of Emory University. http://www.sph.emory.edu/~cdckms/</ref>.  R has a large number of tutorials and manuals, in English and other languages<ref>Contributed Documentation. http://cran.r-project.org/other-docs.html.</ref><sup>,</sup><ref>William Revelle, Using R for psychological research: A simple guide to an elegant package, 2008, http://personality-project.org/r/</ref><sup>,</sup><ref>Dong-Yun Kim, MAT 356 R Tutorial, Spring 2004. http://www.math.ilstu.edu/dhkim/Rstuff/Rtutor.html</ref>, a faq site<ref>R FAQ. Frequently Asked Questions on R. Version 2.8.2009-03-18. ISBN 3-900051-08-9  http://lib.stat.cmu.edu/R/CRAN/doc/FAQ/R-FAQ.html</ref> and an email list for announcements and help requests<ref>R-help -- Main R Mailing List: Primary help. https://stat.ethz.ch/mailman/listinfo/r-help</ref>.
Several of the packages also have tutorials.  For example, CDC has these tutorials about Epi Info<ref>Epi Info™ Community Health Assessment Tutorial. The Epi Info™ Community Health Assessment Tutorial was produced by the collaborative efforts of the Centers for Disease Control and Prevention (CDC), the Assessment Initiative (AI), and the New York State Department of Health (NYSDOH). http://www.cdc.gov/epiinfo/communityhealth.htm </ref><sup>,</sup><ref>Cholera Outbreak in Rwenshama: Using Epi Info for Windows in an Outbreak Investigation. Coordinating Office for Global Health - DGPHCD, http://www.cdc.gov/cogh/dgphcd/training/softwaretraining.htm</ref>. The CDC page also lists a video slide show tutorial from the University of Nebraska <ref>Introduction to EPI2000. GPVEC Great Plains Veterinary Educational Center. University of Nebraska - Lincoln.  http://gpvec.unl.edu/videos/epi-stats.asp </ref>, and another site has on line training classes<ref>The North Carolina Center for Public Health Preparedness Training Website  http://nccphp.sph.unc.edu/training/index.html</ref>. One faculty member from Emory's School of Public Health has a number of introductory manuals<ref>Kevin's Web Page. Kevin M. Sullivan, PhD, MPH, MHA. Department of Epidemiology and Global Health. Rollins School of Public Health of Emory University. http://www.sph.emory.edu/~cdckms/</ref>.  R has a large number of tutorials and manuals, in English and other languages<ref>Contributed Documentation. http://cran.r-project.org/other-docs.html.</ref><sup>,</sup><ref>William Revelle, Using R for psychological research: A simple guide to an elegant package, 2008, http://personality-project.org/r/</ref><sup>,</sup><ref>Dong-Yun Kim, MAT 356 R Tutorial, Spring 2004. http://www.math.ilstu.edu/dhkim/Rstuff/Rtutor.html</ref>, a faq site<ref>R FAQ. Frequently Asked Questions on R. Version 2.8.2009-03-18. ISBN 3-900051-08-9  http://lib.stat.cmu.edu/R/CRAN/doc/FAQ/R-FAQ.html</ref> and an email list for announcements and help requests<ref>R-help -- Main R Mailing List: Primary help. https://stat.ethz.ch/mailman/listinfo/r-help</ref>.
[[User:Gene Shackman|Gene Shackman]] 05:56, 22 March 2009 (UTC)


==References==
==References==
<references/>
<references/>

Revision as of 23:56, 21 March 2009

This article has a Citable Version.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
This editable Main Article has an approved citable version (see its Citable Version subpage). While we have done conscientious work, we cannot guarantee that this Main Article, or its citable version, is wholly free of mistakes. By helping to improve this editable Main Article, you will help the process of generating a new, improved citable version.

Introduction

There is a wide variety of free statistical software from a variety of sources, including governments, NGSs, universities, and developed by individuals. Most of it is fairly easy to learn, using menu systems, while a few are command driven. Many of these free software packages have been used in academic research in peer reviewed journals or in publications from major organizations. Some are very popular while others are much less frequently used. In general, though, free statistical software should be seen as a reasonable alternative to the commercial packages.

Sources of free statistical software

Some of the free software is from governmental or NGO organizations, such as Epi Info[1], from CDC, and IDAMS[2] from UNESCO. Some other software is from smaller or independent organizations or universities, such as Instat[3] or Irristat[4]. The great majority of free statistical software, however, is from individuals. Some commonly used software from individuals include Easyreg[5], MicrOsiris[6], OpenStat[7], and Zelig[8].

Finally, a couple of other packages are being developed by groups, rather than individuals, but not by established institutions, like universities, governments, or NGOs. Rather these are groups of individuals. PSPP[9], from the GNU project, is developing into a clone of SPSS, but is free. The R project[10] is also frequently used.

Reviews of free statistical software

There are a few reviews of free statistical software. There were two reviews in journals (but not peer reviewed), one by Zhu and Kuljaca[11] and another article by Grant that included mainly a brief review of R[12]. Zhu and Kuljaca outlined some useful characteristics of software, such as ease of use, having a number of statistical procedures and ability to develop new procedures. They review several programs and identified which ones, at that time, had the most functionality. At that time, several of the programs may not have had all of the desired ability for advanced statistics. Grant reviewed some of the programing features of R, and briefly mentioned the availability of other programs. A couple of websites that list software also have very brief reviews of each package. The two sites that have these are by StatCon[13] and by Pezzullo[14]. These sites mainly offer a brief list of the features available in the packages.

There is also a journal specifically for statistical software[15], although the main focus is on commercial software, R and some coding snippets.

These free software packages have been used in a number of scholarly publications, so that at least various journals, NGOs or other organizations regard the packages as valid. For example, OpenStat was used in a research letter to JAMA[16] and in this genome study[17]. Irristat is used in this agricultural report[18] and WinIdams was used in these papers[19], [20].

Using free statistical software

Before using any statistical packages, it is generally a good idea to have a solid background in Statistics. Then the packages can be used to the best advantage, for example, to choose the most appropriate test, to make sure all the necessary assumptions are met, so that the appropriate conclusions can be drawn.

Once the statistical issues are understood, the next step is to decide which package to use. Most of these packages are menu driven, and can be learned a couple of hours at most, except R, which is generally code driven and requires a much longer time to learn, and to some extent CDC's Epi Info, which also takes some time to learn.

Several of the packages also have tutorials. For example, CDC has these tutorials about Epi Info[21],[22]. The CDC page also lists a video slide show tutorial from the University of Nebraska [23], and another site has on line training classes[24]. One faculty member from Emory's School of Public Health has a number of introductory manuals[25]. R has a large number of tutorials and manuals, in English and other languages[26],[27],[28], a faq site[29] and an email list for announcements and help requests[30]. Gene Shackman 05:56, 22 March 2009 (UTC)

References

  1. Epi Info, CDC, 2008 http://www.cdc.gov/epiinfo/index.htm.
  2. IDAMS Statistical Software, http://portal.unesco.org/ci/en/ev.php-URL_ID=2070&URL_DO=DO_TOPIC&URL_SECTION=201.html
  3. Instat - an interactive statistical package, Statistical Services Centre - University of Reading, 2009. http://www.ssc.rdg.ac.uk/software/instat/instat.html
  4. Irristat, International Rice Research Instititue, Biometrics and Bioinformatics Unit, http://www.irri.org/science/software/irristat.asp
  5. Easy Reg International, Herman Bierens, Penn State University, 2008 http://econ.la.psu.edu/~hbierens/EASYREG.HTM
  6. MicOsiris, Neal Van Eck, Van Eck Computer Consulting http://www.microsiris.com/
  7. OpenStat, Bill Miller, 2009 http://www.statpages.org/miller/openstat/
  8. Zelig, Kosuke Imai, Gary King and Olivia Lau , 2009 http://gking.harvard.edu/zelig/
  9. PSPP, 2008 http://www.gnu.org/software/pspp/
  10. The R Project, http://cran.r-project.org/
  11. "A Short Preview of Free Statistical Software Packages for Teaching Statistics to Industrial Technology Majors" Journal of Industrial Technology (Volume 21-2, April 2005), Ms. Xiaoping Zhu and Dr. Ognjen Kuljaca. http://www.nait.org/jit/current.html
  12. Felix Grant, "Free Statistics Software, Yours, Free to keep....", Scientific Computing World, Sept/Oct 2004, http://www.scientific-computing.com/scwsepoct04free_statistics.html
  13. List of free statistical software, Open Source & Public Domain Packages with Source Code. StatCon 2006. http://statistiksoftware.com/free_software.html
  14. Pezzullo, Free Statistical Software, 2009. http://statpages.org/javasta2.html
  15. Journal of Statistical Software, http://www.jstatsoft.org/
  16. Future Salary and US Residency Fill Rate Revisited, Mark Ebell. Research letter in JAMA, September 10, 2008—Vol 300, No. 10, p1131-1132. http://jama.ama-assn.org/cgi/reprint/300/10/1131
  17. Differential gene expression patterns in cyclooxygenase-1 and cyclooxygenase-2 deficient mouse brain. Christopher D Toscano, Vinaykumar V Prabhu, Robert Langenbach, Kevin G Becker, and Francesca Bosetti. Genome Biol. 2007; 8(1): R14. http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1839133
  18. FAO Plant Production and Protection Paper No. 174, Rome, 2003, Genotype x environment interactions. Challenges and opportunities for plant breeding and cultivar recommendations, http://www.fao.org/DOCREP/005/Y4391E/y4391e00.htm
  19. N. S. Sapre, N. Pancholi, and S. Gupta, Computational Modeling of Substitution Effect on HIV–1 Non–Nucleoside Reverse Transcriptase Inhibitors with Kier–Hall Electrotopological State (E– state) Indices, Internet Electron. J. Mol. Des. 2008, 7, 55–67, http://www.biochempress.com/cv07_i03.html
  20. Chawla, Anju. Exploring project selection behavior of academic scientists in India. Research Evaluation, Volume 16, Number 1, March 2007 , pp. 35-45(11). http://www.ingentaconnect.com/content/beech/rev/2007/00000016/00000001/art00004
  21. Epi Info™ Community Health Assessment Tutorial. The Epi Info™ Community Health Assessment Tutorial was produced by the collaborative efforts of the Centers for Disease Control and Prevention (CDC), the Assessment Initiative (AI), and the New York State Department of Health (NYSDOH). http://www.cdc.gov/epiinfo/communityhealth.htm
  22. Cholera Outbreak in Rwenshama: Using Epi Info for Windows in an Outbreak Investigation. Coordinating Office for Global Health - DGPHCD, http://www.cdc.gov/cogh/dgphcd/training/softwaretraining.htm
  23. Introduction to EPI2000. GPVEC Great Plains Veterinary Educational Center. University of Nebraska - Lincoln. http://gpvec.unl.edu/videos/epi-stats.asp
  24. The North Carolina Center for Public Health Preparedness Training Website http://nccphp.sph.unc.edu/training/index.html
  25. Kevin's Web Page. Kevin M. Sullivan, PhD, MPH, MHA. Department of Epidemiology and Global Health. Rollins School of Public Health of Emory University. http://www.sph.emory.edu/~cdckms/
  26. Contributed Documentation. http://cran.r-project.org/other-docs.html.
  27. William Revelle, Using R for psychological research: A simple guide to an elegant package, 2008, http://personality-project.org/r/
  28. Dong-Yun Kim, MAT 356 R Tutorial, Spring 2004. http://www.math.ilstu.edu/dhkim/Rstuff/Rtutor.html
  29. R FAQ. Frequently Asked Questions on R. Version 2.8.2009-03-18. ISBN 3-900051-08-9 http://lib.stat.cmu.edu/R/CRAN/doc/FAQ/R-FAQ.html
  30. R-help -- Main R Mailing List: Primary help. https://stat.ethz.ch/mailman/listinfo/r-help