DAP (software)

Last updated
DAP
Developer(s) GNU Project
Initial release1998;26 years ago (1998) [1]
Stable release
3.10 / April 16, 2014;10 years ago (2014-04-16)
Repository
Written in C
Operating system Cross-platform
Type Statistical Analysis
License GNU General Public License
Website www.gnu.org/software/dap

Dap is a statistics and graphics program based on the C programming language that performs data management, analysis, and C-style graphical visualization tasks without requiring complex syntax. Its name is an acronym for Data Analysis and Presentation.

Contents

Dap was written to be a free replacement for SAS, but users are assumed to have a basic familiarity with the C programming language in order to permit greater flexibility.

It has been designed to be used on large data sets and is primarily used in statistical consulting practices.

However, even with its clear benefits, Dap hasn't been updated since 2014 and hasn't seen widespread use when compared to other statistical analysis programs.

Features

Dap is a command line driven program. Below are various features that DAP can perform.

DAP can compute means and percentiles, correlation, & ANOVA from data sets. This includes Unbalanced as well as Crossed, Nested ANOVA. It can also be used to create scatterplots, line graphs and histograms of data. This can include split plots, treatment combinations, as well as latin squares.

DAP can perform linear regression and can utilize regressions to build linear models. In addition to linear regression, DAP can also perform logistic regression analysis as well. There's a variety of other analysis that DAP can do as well including building loglinear models as well as Logit models for linear-by-linear association.

In terms of models, DAP can create mixed balanced and unbalanced models as well as random unbalanced models.

It has been designed so as to cope with very large data sets; even when the size of the data exceeds the size of the computer's memory due to the fact that the program processes files one line at a time rather than reading entire files into memory.

Applications

Industry Uses

Related Research Articles

Analysis of variance (ANOVA) is a collection of statistical models and their associated estimation procedures used to analyze the differences among means. ANOVA was developed by the statistician Ronald Fisher. ANOVA is based on the law of total variance, where the observed variance in a particular variable is partitioned into components attributable to different sources of variation. In its simplest form, ANOVA provides a statistical test of whether two or more population means are equal, and therefore generalizes the t-test beyond two means. In other words, the ANOVA is used to test the difference between two or more means.

<i>F</i>-test Statistical hypothesis test, mostly using multiple restrictions

An F-test is any statistical test used to compare the variances of two samples or the ratio of variances between multiple samples. The test statistic, random variable F, is used to determine if the tested data has an F-distribution under the true null hypothesis, and true customary assumptions about the error term (ε). It is most often used when comparing statistical models that have been fitted to a data set, in order to identify the model that best fits the population from which the data were sampled. Exact "F-tests" mainly arise when the models have been fitted to the data using least squares. The name was coined by George W. Snedecor, in honour of Ronald Fisher. Fisher initially developed the statistic as the variance ratio in the 1920s.

Analysis of covariance (ANCOVA) is a general linear model that blends ANOVA and regression. ANCOVA evaluates whether the means of a dependent variable (DV) are equal across levels of one or more categorical independent variables (IV) and across one or more continuous variables. For example, the categorical variable(s) might describe treatment and the continuous variable(s) might be covariates or nuisance variables; or vice versa. Mathematically, ANCOVA decomposes the variance in the DV into variance explained by the CV(s), variance explained by the categorical IV, and residual variance. Intuitively, ANCOVA can be thought of as 'adjusting' the DV by the group means of the CV(s).

Perl Data Language is a set of free software array programming extensions to the Perl programming language. PDL extends the data structures built into Perl, to include large multidimensional arrays, and adds functionality to manipulate those arrays as vector objects. It also provides tools for image processing, machine learning, computer modeling of physical systems, and graphical plotting and presentation. Simple operations are automatically vectorized across complete arrays, and higher-dimensional operations are supported.

<span class="mw-page-title-main">Georg C. F. Greve</span> German software developer, physicist and author

Georg C. F. Greve is a software developer, physicist, author and currently co-founder and president at Vereign. He has been working on technology politics since he founded the Free Software Foundation Europe (FSFE) in 2001.

In robust statistics, robust regression seeks to overcome some limitations of traditional regression analysis. A regression analysis models the relationship between one or more independent variables and a dependent variable. Standard types of regression, such as ordinary least squares, have favourable properties if their underlying assumptions are true, but can give misleading results otherwise. Robust regression methods are designed to limit the effect that violations of assumptions by the underlying data-generating process have on regression estimates.

<span class="mw-page-title-main">EViews</span>

EViews is a statistical package for Windows, used mainly for time-series oriented econometric analysis. It is developed by Quantitative Micro Software (QMS), now a part of IHS. Version 1.0 was released in March 1994, and replaced MicroTSP. The TSP software and programming language had been originally developed by Robert Hall in 1965. The current version of EViews is 13, released in August 2022.

<span class="mw-page-title-main">MedCalc</span>

MedCalc is a statistical software package designed for the biomedical sciences. It has an integrated spreadsheet for data input and can import files in several formats.

RATS, an abbreviation of Regression Analysis of Time Series, is a statistical package for time series analysis and econometrics. RATS is developed and sold by Estima, Inc., located in Evanston, IL.

The following tables compare general and technical information for a number of statistical analysis packages.

<span class="mw-page-title-main">PSPP</span> Data analysis software

PSPP is a free software application for analysis of sampled data, intended as a free alternative for IBM SPSS Statistics. It has a graphical user interface and conventional command-line interface. It is written in C and uses GNU Scientific Library for its mathematical routines. The name has "no official acronymic expansion".

PSI-Plot is a scientific and engineering data analysis and technical plotting software developed by Poly Software International, Inc. The software can read multiple formats and perform mathematical transforms and statistical analyses. PSI-Plot is maintained on Microsoft Windows operating systems, and the current version is 9.5 for Windows 98 to Windows 7.

Free statistical software is a practical alternative to commercial packages. Many of the free to use programs aim to be similar in function to commercial packages, in that they are general statistical packages that perform a variety of statistical analyses. Many other free to use programs were designed specifically for particular functions, like factor analysis, power analysis in sample size calculations, classification and regression trees, or analysis of missing data.

One application of multilevel modeling (MLM) is the analysis of repeated measures data. Multilevel modeling for repeated measures data is most often discussed in the context of modeling change over time ; however, it may also be used for repeated measures data in which time is not a factor.

<span class="mw-page-title-main">SegReg</span>

In statistics and data analysis, the application software SegReg is a free and user-friendly tool for linear segmented regression analysis to determine the breakpoint where the relation between the dependent variable and the independent variable changes abruptly.

<span class="mw-page-title-main">JASP</span> Free and open-source statistical program

JASP is a free and open-source program for statistical analysis supported by the University of Amsterdam. It is designed to be easy to use, and familiar to users of SPSS. It offers standard analysis procedures in both their classical and Bayesian form. JASP generally produces APA style results tables and plots to ease publication. It promotes open science via integration with the Open Science Framework and reproducibility by integrating the analysis settings into the results. The development of JASP is financially supported by several universities and research funds. As the JASP GUI is developed in C++ using Qt framework, some of the team left to make a notable fork which is Jamovi which has its GUI developed in JavaScript and HTML5.

<span class="mw-page-title-main">Jamovi</span> Graphical user interface for R programming language

jamovi is a free and open-source computer program for data analysis and performing statistical tests. The core developers of jamovi are Jonathon Love, Damian Dropmann, and Ravi Selker, who are developers for the JASP project. jamovi is a fork of JASP.

References

Sources

See also