Developer(s) | SAS Institute |
---|---|
Initial release | 1972 |
Stable release | 9.4M8 / January 31, 2023 |
Written in | C |
Operating system | Windows, IBM mainframe, Unix/Linux, OpenVMS Alpha |
Type | Data analysis |
License | Proprietary |
Website | www |
SAS (previously "Statistical Analysis System") [1] is a statistical software suite developed by SAS Institute for data management, advanced analytics, multivariate analysis, business intelligence, criminal investigation, [2] and predictive analytics.
SAS was developed at North Carolina State University from 1966 until 1976, when SAS Institute was incorporated. SAS was further developed in the 1980s and 1990s with the addition of new statistical procedures, additional components and the introduction of JMP. A point-and-click interface was added in version 9 in 2004. A social media analytics product was added in 2010.
SAS is a software suite that can mine, alter, manage and retrieve data from a variety of sources and perform statistical analysis on it. [3] SAS provides a graphical point-and-click user interface for non-technical users and more through the SAS language. [3]
SAS programs have DATA steps, which retrieve and manipulate data, PROC (procedures) which analyze the data, and may also have functions. [4] Each step consists of a series of statements. [5]
The DATA step has executable statements that result in the software taking an action, and declarative statements that provide instructions to read a data set or alter the data's appearance. [4] The DATA step has two phases: compilation and execution. In the compilation phase, declarative statements are processed and syntax errors are identified. Afterwards, the execution phase processes each executable statement sequentially. [6] Data sets are organized into tables with rows called "observations" and columns called "variables". Additionally, each piece of data has a descriptor and a value. [4] [7]
PROC statements call upon named procedures. Procedures perform analysis and reporting on data sets to produce statistics, analyses, and graphics. There are more than 300 named procedures and each one performs a substantial body of statistical work. [4] PROC statements can also display results, sort data or perform other operations. [5]
SAS macros are pieces of code or variables that are coded once and referenced to perform repetitive tasks. [8]
SAS data can be published in HTML, PDF, Excel, RTF and other formats using the Output Delivery System, which was first introduced in 2007. [9] SAS Enterprise Guide is SAS's point-and-click interface. It generates code to manipulate data or perform analysis without use of the SAS programming language. [10]
The SAS software suite has more than 200 add-on packages, sometimes called components [11] [12] [13] Some of these SAS components, i.e. add on packages to Base SAS include: [3] [14]
The development of SAS began in 1966 after North Carolina State University re-hired Anthony Barr [15] to program his analysis of variance and regression software so that it would run on IBM System/360 computers. [16] The project was funded by the National Institutes of Health. [17] and was originally intended to analyze agricultural data [12] [18] to improve crop yields. [19] Barr was joined by student James Goodnight, who developed the software's statistical routines, and the two became project leaders. [15] [16] [20] In 1968, Barr and Goodnight integrated new multiple regression and analysis of variance routines. [21] [22] In 1972, after issuing the first release of SAS, the project lost its funding. [17] According to Goodnight, this was because NIH only wanted to fund projects with medical applications. [23] Goodnight continued teaching at the university for a salary of $1 and access to mainframe computers for use with the project, [17] until it was funded by the University Statisticians of the Southern Experiment Stations the following year. [16] [23] John Sall joined the project in 1973 and contributed to the software's econometrics, time series, and matrix algebra. Another early participant, Caroll G. Perkins, contributed to SAS' early programming. Jolayne W. Service and Jane T. Helwig created SAS's first documentation. [21]
The first versions of SAS, from SAS 71 to SAS 82, were named after the year in which they were released. [24] In 1971, SAS 71 was published as a limited release. [3] [25] It was used only on IBM mainframes and had the main elements of SAS programming, such as the DATA step and the most common procedures, i.e. PROCs. [24] The following year a full version was released as SAS 72, which introduced the MERGE statement and added features for handling missing data or combining data sets. [26] The development of SAS has been described as an "inflection point" in the history of artificial intelligence. [27] In 1976, Barr, Goodnight, Sall, and Helwig removed the project from North Carolina State and incorporated it as the SAS Institute, Inc. [28]
SAS was redesigned in SAS 76. The INPUT and INFILE statements were improved so they could read most data formats used by IBM mainframes. Generating reports was also added through the PUT and FILE statements. The ability to analyze general linear models was also added [29] as was the FORMAT procedure, which allowed developers to customize the appearance of data. [24] In 1979, SAS 79 added support for the IBM VM/CMS operating system and introduced the DATASETS procedure. Three years later, SAS 82 introduced an early macro language and the APPEND procedure. [24]
Beginning with SAS 4, released in 1984, SAS releases have followed a sequential naming convention not based on year of release. [30] SAS version 4 had limited features, but made SAS more accessible. Version 5 introduced a complete macro language, array subscripts, and a full-screen interactive user interface called Display Manager. [24] In 1985, SAS was rewritten in the C programming language. This enabled the SAS' MultiVendor Architecture which allows the software to run on UNIX, MS-DOS, and Windows. It was previously written in PL/I, Fortran, and assembly language. [20] [24]
In the 1980s and 1990s, SAS released a number of components to complement Base SAS. SAS/GRAPH, which produces graphics, was released in 1980, as well as the SAS/ETS component, which supports econometric and time series analysis. A component intended for pharmaceutical users, SAS/PH-Clinical, was released in the 1990s. The Food and Drug Administration standardized on using SAS/PH-Clinical for new drug applications in 2002. [20] Vertical products like SAS Financial Management and SAS Human Capital Management (then called CFO Vision and HR Vision respectively) were also introduced. [31] JMP was developed by SAS co-founder John Sall and a team of developers, in order to take advantage of the graphical user interface introduced in the 1984 Apple Macintosh. [32] JMP's name originally stood for "John's Macintosh Project". [33] JMP was shipped for the first time in 1989. Updated versions of JMP were released continuously after 2002 with the most recent release in 2016. [34] [35] [36] [37]
SAS 6 was used throughout the 1990s and was available on a wider range of operating systems, including Macintosh, OS/2, Silicon Graphics, and PRIMOS. SAS introduced new features through dot-releases. From 6.06 to 6.09, a user interface based on the Windows paradigm was introduced and support for SQL was added. [38] Version 7 introduced the Output Delivery System (ODS) and an improved text editor. Subsequent releases improved upon the ODS. For example, more output options were added in version 8. The number of operating systems that were supported was reduced to UNIX, Windows and z/OS, and Linux was added. [39] SAS 8 and SAS Enterprise Miner were released in 1999. [20]
In 2002, the Text Miner software was introduced. Text Miner analyzes text data like emails for patterns in business intelligence applications. [40] In 2004, SAS Version 9.0 was released, referred to as "Project Mercury" internally, and was designed to make SAS accessible to a broader range of business users. [41] [42] SAS 9.0 added custom user interfaces based on the user's role and established the point-and-click user interface of SAS Enterprise Guide as the software's primary graphical user interface (GUI). [41] The Customer Relationship Management (CRM) features were improved in 2004 with SAS Interaction Management. [43] In 2008, SAS announced Project Unity, designed to integrate data quality, data integration, and master data management. [44]
SAS Institute Inc v World Programming Ltd was a lawsuit with developers of a competing implementation, World Programming System, alleging that they had infringed SAS's copyright in part by implementing the same functionality. The case was referred by the United Kingdom's High Court of Justice to the European Court of Justice on 11 August 2010. [45] In May 2012, the European Court of Justice ruled in favor of World Programming, finding that "the functionality of a computer program and the programming language cannot be protected by copyright." [46]
A free version of SAS was introduced for students in 2010. [47] SAS Social Media Analytics, a tool for social media monitoring, engagement and sentiment analysis, was also released that year. [48] SAS Rapid Predictive Modeler (RPM), which creates basic analytical models using Microsoft Excel, was introduced the same year. [49] [50] In 2010, JMP 9 included a new interface for using the R programming language and an add-in for MS Excel. [51] [52] The following year, a High Performance Computing platform was made available in a partnership with Teradata and EMC Greenplum. [53] [54] In 2011, the company released Enterprise Miner 7.1. [55] The company introduced 27 data management products from October 2013 to October 2014 and updates to 160 others. [56] At the SAS Global Forum 2015, SAS announced several new products that were specialized for different industries, as well as new training software. [57]
SAS had many releases since 1972. [58] Since release 9.3, SAS/STAT has its own release numbering.
Release | Date | Comment |
---|---|---|
72 | January 1972 | |
76 | July 1976 | |
79.5 | April 1981 | |
82.4 | January 1983 | |
4.06 | March 1984 | |
5.03 | January 1985 | |
6.01 | July 1986 | PC DOS |
6.03 | March 1988 | |
6.06 | March 1990 | |
6.07 | April 1991 | |
6.08 | March 1993 | |
6.09 | October 1993 | |
6.10 | October 1994 | |
6.11 | October 1995 | |
6.12 | November 1996 | |
7.0 | October 1998 | |
8.0 | November 1999 | |
8.1 | July 2000 | |
8.2 | March 2001 | |
9.0 | October 2002 | |
9.1 | December 2003 | |
9.1.3 | August 2004 | |
9.2 | March 2008 | STAT 9.2 |
9.2M2 | April 2010 | STAT 9.22 |
9.3 | July 2011 | STAT 9.3 |
9.3M2 | August 2012 | STAT 12.1 |
9.4 | July 2013 | STAT 12.3 |
9.4M1 | December 2013 | STAT 13.1 |
9.4M2 | August 2014 | STAT 13.2 |
9.4M3 | July 2015 | STAT 14.1 |
9.4M4 | November 2016 | STAT 14.2 |
9.4M5 | September 2017 | STAT 14.3 |
9.4M6 | November 2018 | STAT 15.1 |
9.4M7 | August 2020 | STAT 15.2 |
9.4M8 | January 2023 | STAT 15.3 |
As of 2011, SAS's largest set of products was its line for customer intelligence. Numerous SAS modules for web, social media and marketing analytics may be used to profile customers and prospects, predict their behaviors and manage and optimize communications. [59]
SAS also provides the SAS Fraud Framework. The framework's primary functionality is to monitor transactions across different applications, networks and partners and use analytics to identify anomalies that are indicative of fraud. [60] [61] [62]
SAS Enterprise GRC (Governance, Risk and Compliance) provides risk modeling, scenario analysis, and other functions [63] [64] in order to manage and visualize risk, compliance and corporate policies. There is also a SAS Enterprise Risk Management product-set designed primarily for banks and financial services organizations. [65]
SAS products for monitoring and managing the operations of IT systems are collectively referred to as SAS IT Management Solutions. [66] SAS collects data from various IT assets on performance and utilization, then creates reports and analyses. [67] SAS's Performance Management products consolidate and provide graphical displays for key performance indicators (KPIs) at the employee, department and organizational level. [68] [69]
The SAS Supply Chain Intelligence product suite is offered for supply chain needs, such as forecasting product demand, managing distribution and inventory and optimizing pricing. [70] There is also a "SAS for Sustainability Management" set of software to forecast environmental, social and economic effects and identify causal relationships between operations and their impact on the environment or ecosystem. [71]
SAS has products for specific industries, such as government, retail, telecommunications, aerospace, marketing optimization, and high-performance computing. [72]
SAS previously offered a Free University Edition which could be downloaded for non-commercial use. SAS put out a press release announcing the Free University Edition on 28 May 2014. [73] However, in 2022, the SAS Free University Edition was replaced by two entirely web-based versions: SAS OnDemand for Academics and SAS Viya for Learners. [74]
In a 2005 article for the Journal of Marriage and Family comparing statistical packages from SAS and its competitors Stata and SPSS, Alan C. Acock wrote that SAS programs provide "extraordinary range of data analysis and data management tasks," but were difficult to learn and use. [75] SPSS and Stata, meanwhile, were both easier to learn but had less capable analytic abilities, though these could be expanded with paid (in SPSS) or free (in Stata) add-ons. Acock concluded that SAS was best for power users, while occasional users would benefit most from SPSS and Stata. [75] A 2014 comparison by the University of California, Los Angeles, gave similar results. [76]
Competitors such as Revolution Analytics and Alpine Data Labs advertise their products as considerably cheaper than SAS's. In a 2011 comparison, Doug Henschen of InformationWeek found that start-up fees for the three are similar, though he admitted that the starting fees were not necessarily the best basis for comparison. [77] SAS's business model is not weighted as heavily on initial fees for its programs, instead focusing on revenue from annual subscription fees. [78] [79]
In 2016, SAS Viya was introduced, [80] with a new architecture optimized for running SAS software in public clouds. Viya also increased interoperability with open source software, allowing models to be built in tools such as R, Python and Jupyter, and then executed on SAS's Cloud Analytics Services (CAS) engine. [81] In 2020, a further architectural revamp in Viya 4 containerized the software. [82] SAS sells Viya alongside SAS 9.4, and has not positioned it as a replacement for SAS 9.4. [83]
According to IDC, SAS is the largest market-share holder in "advanced analytics" with 35.4 percent of the market as of 2013. [84] It is the fifth largest market-share holder for business intelligence (BI) software with a 6.9% share [85] and the largest independent vendor. It competes in the BI market against SAP BusinessObjects, IBM Cognos, SPSS Modeler, Oracle Hyperion, and Microsoft Power BI. [86] SAS has been named in the Gartner Leader's Quadrant for Data Integration Tools and for Business Intelligence and Analytical Platforms. [87] A study published in 2011 in BMC Health Services Research found that SAS was used in 42.6 percent of data analyses in health service research, based on a sample of 1,139 articles drawn from three journals. [88]
SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. Versions of the software released since 2015 have the brand name IBM SPSS Statistics.
SAS Institute is an American multinational developer of analytics software based in Cary, North Carolina. SAS develops and markets a suite of analytics software, which helps access, manage, analyze and report on data to aid in decision-making. The company's software is used by most of the Fortune 500.
Cognos Incorporated was an Ottawa, Ontario-based company making business intelligence (BI) and performance management (PM) software. Founded in 1969, at its peak Cognos employed almost 3,500 people and served more than 23,000 customers in over 135 countries until being acquired by IBM on January 31, 2008. While no longer an independent company, the Cognos name continues to be applied to IBM's line of business intelligence and performance management products.
JMP is a suite of computer programs for statistical analysis developed by JMP, a subsidiary of SAS Institute. It was launched in 1989 to take advantage of the graphical user interface introduced by the Macintosh operating systems. It has since been significantly rewritten and made available also for the Windows operating system. JMP is used in applications such as Six Sigma, quality control, and engineering, design of experiments, as well as for research in science, engineering, and social sciences.
Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence. The applications generally read data that has been previously stored, often - though not necessarily - in a data warehouse or data mart.
EViews is a statistical package for Windows, used mainly for time-series oriented econometric analysis. It is developed by Quantitative Micro Software (QMS), now a part of IHS. Version 1.0 was released in March 1994, and replaced MicroTSP. The TSP software and programming language had been originally developed by Robert Hall in 1965. The current version of EViews is 13, released in August 2022.
Anthony James Barr, aka Tony Barr or Jim Barr, is an American programming language designer, software engineer and inventor. Among his notable contributions are the Statistical Analysis System (SAS), automated lumber yield optimization, and the Automated Classification of Medical Entities (ACME).
The World Programming System, also known as WPS Analytics or WPS, is a software product developed by a company called World Programming.
KXEN was an American software company which existed from 1998 to 2013 when it was acquired by SAP AG.
Primavera is an enterprise project portfolio management software. It includes project management, scheduling, risk analysis, opportunity management, resource management, collaboration and control capabilities, and integrates with other enterprise software such as Oracle and SAP’s ERP systems. Primavera was launched in 1983 by Primavera Systems Inc. which was acquired by Oracle Corporation in 2008.
John P. Sall is an American billionaire businessman and computer software developer, who co-founded SAS Institute and created the JMP statistical software.
PSPP is a free software application for analysis of sampled data, intended as a free alternative for IBM SPSS Statistics. It has a graphical user interface and conventional command-line interface. It is written in C and uses GNU Scientific Library for its mathematical routines. The name has "no official acronymic expansion".
InetSoft Technology Corporation is a privately owned multinational computer software company that develops free and commercial web-based business intelligence applications. The company was founded in 1996, and currently has over 120 employees between its corporate headquarters in Piscataway, New Jersey, and development offices in Beijing and Xi'an, China.
The SAS language is a computer programming language used for statistical analysis, created by Anthony James Barr at North Carolina State University. It can read in data from common spreadsheets and databases and output the results of statistical analyses in tables, graphs, and as RTF, HTML and PDF documents. The SAS language runs under compilers that can be used on Microsoft Windows, Linux, and various other UNIX and mainframe computers. The SAS System and World Programming System (WPS) / Altair SLC are SAS language compilers.
IBM SPSS Modeler is a data mining and text analytics software application from IBM. It is used to build predictive models and conduct other analytic tasks. It has a visual interface which allows users to leverage statistical and data mining algorithms without programming.
Revolution Analytics is a statistical software company focused on developing open source and "open-core" versions of the free and open source software R for enterprise, academic and analytics customers. Revolution Analytics was founded in 2007 as REvolution Computing providing support and services for R in a model similar to Red Hat's approach with Linux in the 1990s as well as bolt-on additions for parallel processing. In 2009 the company received nine million in venture capital from Intel along with a private equity firm and named Norman H. Nie as their new CEO. In 2010 the company announced the name change as well as a change in focus. Their core product, Revolution R, would be offered free to academic users and their commercial software would focus on big data, large scale multiprocessor computing, and multi-core functionality.
Viralheat was a subscription-based software service for social media management that helps clients monitor and analyze consumer-created content. It was first released in beta in May 2009. Viralheat raised $75,000 in seed capital in December 2009 and $4.25 million of venture capital from the Mayfield Fund in 2011.
{{cite web}}
: CS1 maint: location (link)SAS 72 and SAS 76 are attributed to Barr, Goodnight, Service, Perkins, and Helwig
{{cite journal}}
: Cite journal requires |journal=
(help)