Data.gov

Last updated
Data.gov
Muq55HrN 400x400.png
Type of site
Government Web site
Available inEnglish
Owner Government of the United States
URL data.gov
CommercialNo
RegistrationOptional
LaunchedMay 30, 2009;14 years ago (2009-05-30)
Current statusActive

Data.gov is a U.S. Government website launched in late May 2009 by the Federal Chief Information Officer (CIO) of the United States, Vivek Kundra. Data.gov aims to improve public access to high value, machine-readable datasets generated by the Executive Branch of the Federal Government. [1] The site is a repository for Federal, state, local, and tribal government information [2] made available to the public.

Contents

History and background

On March 5, 2009, shortly after his appointment as the first Federal Chief Information Officer, Vivek Kundra announced the creation of Data.gov. [3] The website is managed and hosted by the U.S. General Services Administration, Technology Transformation Services. [4]

The site introduced the philosophy of digital open data to the U.S. Federal government, an approach which according to the book Democratizing Data will have benefits for states including "rebuilding confidence in government and business". [5]

Data.gov has grown from 47 datasets at launch to over 370,000 datasets. Jeanne Holm, Chief Knowledge Architect for the National Aeronautics and Space Administration (NASA), was the Evangelist and knowledge architect for Data.gov, [6] James Hendler, an artificial intelligence researcher at Rensselaer Polytechnic Institute, was at the time named the "Internet Web Expert" and tasked with helping Data.gov exploit advanced Web technologies.

Data.gov was one of the first efforts to create an open data ecosystem—using data as the basis for connecting government agencies, researchers, businesses, and civil society. Communities of practice were created around key topics such as climate, providing a way for researchers to ask for data and to coordinate work across government agencies. By the end of 2010, most Federal agencies had published data on Data.gov. In November 2010, the Data.gov team hosted the first International Open Government Data Conference with 10 nations participating to expand the principles of open data. This conference grew to become the International Open Data Conference.

By 2012, open data from Data.gov was regularly used by civil society and business. Community led efforts like hackathons from Code for America and events such as the National Day of Civic Hacking, relied on government data provided by Data.gov. The Gov Lab created the Open Data 500 [7] to showcase businesses built on open data provided by Data.gov. To ensure open data's sustainability, President Obama created an executive order on "Making Open and Machine Readable the New Default for Government Information" to formalize Data.gov as the permanent repository for open government data. [8]

McKinsey & Company published research [9] showing that open data contributed $3 trillion to the U.S. economy. Two of the biggest datasets for economic impact have been global positioning satellite data from the U.S. Space Force and weather data from the National Weather Service. By 2014, all 175 Federal agencies and 77 other organizations had published data on the site, in both human understandable and machine-readable formats and with open APIs. [10]

On January 14, 2019, the OPEN Government Data Act, as part of the Foundations for Evidence Based Policymaking Act, became law. The OPEN Government Data Act makes Data.gov a requirement in statute, rather than a policy. It requires federal agencies to publish their information online as open data, using standardized, machine-readable data formats, with their metadata included in the Data.gov catalog. Data.gov is working with an expanded group of federal agencies to include their datasets in Data.gov as they implement the new law.

Open Government Directive

The U.S. Open Government Directive of December 8, 2009, required that all agencies post at least three high-value data sets online and register them on Data.gov within 45 days. [11]

OPEN Government Data Act

The Foundations for Evidence-Based Policymaking Act of 2018 (“Evidence Act”) signed into law on January 14, 2019, emphasizes collaboration and coordination to advance data and evidence-building functions in the Federal Government by statutorily mandating Federal evidence-building activities, open government data, and confidential information protection and statistical efficiency.

Title II of the Foundations for Evidence Based Policymaking Act, the OPEN Government Data Act, requires additional agencies to comply with the statute by providing access to free, open, and machine readable data.

Additionally, the Office of Management and Budget is required to collaborate with the Office of Government Information Services and the Administrator of General Services to develop and maintain an online repository of tools, best practices, and schema standards to facilitate the adoption of open data practices across the Federal Government.

Apps

A list of software applications using data from Data.gov can be seen at data.gov/applications.

See also

Related Research Articles

<span class="mw-page-title-main">Data set</span> Collection of data

A data set is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question. The data set lists values for each of the variables, such as for example height and weight of an object, for each member of the data set. Data sets can also consist of a collection of documents or files.

<span class="mw-page-title-main">National Technical Information Service</span> United States government agency

The National Technical Information Service (NTIS) is an agency within the U.S. Department of Commerce. The primary mission of NTIS is to collect and organize scientific, technical, engineering, and business information generated by U.S. government-sponsored research and development, for private industry, government, academia, and the public. The systems, equipment, financial structure, and specialized staff skills that NTIS maintains to undertake its primary mission allow it to provide assistance to other agencies requiring such specialized resources.

<span class="mw-page-title-main">Bureau of Transportation Statistics</span>

The Bureau of Transportation Statistics (BTS), part of the United States Department of Transportation, is a government office that compiles, analyzes, and publishes information on the nation's transportation systems across various modes; and strives to improve the DOT's statistical programs through research and the development of guidelines for data collection and analysis. BTS is a principal agency of the U.S. Federal Statistical System.

The United States federal civil service is the civilian workforce of the United States federal government's departments and agencies. The federal civil service was established in 1871. U.S. state and local government entities often have comparable civil service systems that are modeled on the national system to varying degrees.

<span class="mw-page-title-main">Vivek Kundra</span> American government official

Vivek Kundra is a former American administrator who served as the first chief information officer of the United States from March, 2009 to August, 2011 under President Barack Obama. He is currently the chief operating officer at Sprinklr, a provider of enterprise customer experience management software based in NYC. He was previously a visiting Fellow at Harvard University.

<span class="mw-page-title-main">Aneesh Chopra</span> American executive

Aneesh Paul Chopra is an American executive who served as the first Chief Technology Officer of the United States. He was appointed in 2009 by President Barack Obama and was at the White House through 2012. Chopra previously served as Virginia's Secretary of Technology under Governor Tim Kaine. Chopra was a candidate in 2013 for the Democratic nomination for Lieutenant Governor of Virginia. He is the author of Innovative State: How New Technologies Can Transform Government (2014) and co-founder and president of CareJourney. In 2015 he joined Albright Stonebridge Group as a senior advisor.

<span class="mw-page-title-main">Federal Chief Information Officer of the United States</span> U.S. government position

The federal Chief Information Officer of the United States, also known as the United States Chief Information Officer, is the administrator of the Office of Electronic Government, or the Office of the Federal CIO (OFCIO), which is part of the Office of Management and Budget. The President appoints the Federal CIO. The appointee does not require Senate confirmation. It was created by the E-Government Act of 2002.

The Office of Social Innovation and Civic Participation was an office new to the Obama Administration, created within the White House, to catalyze new and innovative ways of encouraging government to do business differently. Its first director was the economist Sonal Shah. The final director was David Wilkinson.

data.gov.uk United Kingdom government portal for sharing non-personal public information

data.gov.uk is a UK Government project to make available non-personal UK government data as open data. It was launched as closed beta in 30 September 2009, and publicly launched in January 2010. As of February 2015, it contained over 19,343 datasets, rising to over 40,000 in 2017, and more than 47,000 by 2023. data.gov.uk is listed in the Registry of Research Data Repositories re3data.org.

<span class="mw-page-title-main">National Transportation Library</span> U.S. national library

The National Transportation Library (NTL) maintains and facilitates access to information necessary for transportation decision-making in government and coordinates with public and private transportation libraries and information providers to improve information sharing among the transportation community. It is currently under the administration of the Bureau of Transportation Statistics (BTS).

Citizen sourcing is the government adoption of crowdsourcing techniques for the purposes of (1) enlisting citizens in the design and execution of government services and (2) tapping into the citizenry's collective intelligence for solutions and situational awareness. Applications of citizen sourcing include:

<span class="mw-page-title-main">International Open Government Data Conference</span> Conference on the subject of open datasets

Lasting from November 15, 2010 to November 17, 2010, The International Open Government Data Conference was a conference sponsored by the United States General Services Administration and hosted by the United States Department of Commerce on the subject of open datasets globally, in coalition with the United States' previously opened data.gov.

<span class="mw-page-title-main">Nick Sinai</span> Adjunct faculty and a senior in the Obama Administration

Nick Sinai is a venture capitalist, adjunct faculty at Harvard Kennedy School, author, and a former senior official in the Obama Administration.

Open data in the United States refers to the Federal government of the United States' perspectives, policies, and practices regarding open data.

Apps.gov was a cloud storefront run by the U.S. General Services Administration to assist federal agencies in purchasing cloud computing services from the marketplace. The website was initially launched in 2009 under the direction of former Federal Chief Information Officer Vivek Kundra, but was first closed down in 2012 in order to "streamline" procurement and amid reports of low usage. The service was relaunched at the 2016 SXSW festival by a team of Presidential Innovation Fellows following President Obama's keynote address on using technology to improve government. The site has not been available since early 2019, though no official shutdown was ever announced.

A machine-readable document is a document whose content can be readily processed by computers. Such documents are distinguished from more general machine-readable data by virtue of having further structure to provide the necessary context to support the business processes for which they are created.

Open by Default, as widely used in the contexts of Open Government and Open Data, is the principle in which government makes its data accessible to the public by default, unless there is a sufficient justification to explain that greater public interest may be at stake, as a result of disclosure. Since the principle empowers the public's right to know and capacity to oversee government activities, it is closely associated with government transparency, civic engagement, and e-governance in organizing public life. In many cases, the principle is accompanied with the technological commitment to create "metadata standardization for all datasets, publication of a machine-readable data catalogue or inventory of both released and to-be released datasets ... (and) use of open licenses."

The U.S. Commission on Evidence-Based Policymaking was a 15-member agency in the federal government charged by the US Congress and the President with examining how government could better use its existing data to provide evidence for future government decisions.

<span class="mw-page-title-main">Foundations for Evidence-Based Policymaking Act</span> U.S. federal law

The Foundations for Evidence-Based Policymaking Act is a United States law that establishes processes for the federal government to modernize its data management practices, evidence-building functions, and statistical efficiency to inform policy decisions. The Evidence Act contains four parts ("titles"), which address evidence capacity, open data, and data confidentiality.

Richard Y. Wang is the Founder and Executive Director of the Chief Data Officer and Information Quality (CDOIQ) Program at the Massachusetts Institute of Technology. Wang is widely acknowledged as the "Founder of Information Quality"—the scholar who made Information Quality an established field. For the past three decades, he advocated that the importance of information quality must be embraced at the highest level of organizations. He championed and led a movement to establish the position of Chief Data Officers in all organizations. His pioneering work culminated in a wide-scale adoption of the Chief Data Officer role worldwide. Notably, in 2019, the U.S. Congress enacted the Foundations for Evidence-Based Policymaking Act of 2018 into law, which statutorily mandated all federal agencies to establish and appoint a CDO for their agency.

References

  1. "About data.gov" . Retrieved 2011-08-21.
  2. "Non-Federal Data - How to Get Your Data on Data.gov" . Retrieved 2018-02-03.
  3. Hansell, Saul (2009-03-05). "The Nation's New Chief Information Officer Speaks". The New York Times. Retrieved 2009-04-30.
  4. "Who developed Data.gov?". Data.gov. U.S. General Services Administration.
  5. Aliya Sternstein (1 April 2009). "Kundra's Ideas Shape Book". nextgov (Tech Insider). National Journal Group. Archived from the original (blog posting) on 20 February 2017. Retrieved 16 April 2011.
  6. Van Buskirk, Eliot (2010-05-19). "Sneak Peek: Obama Administration's Redesigned data.gov". Wired Epicenter. Retrieved 2010-05-20.
  7. "The Governance Lab". thegovlab.org. Retrieved 2022-07-07.
  8. "Executive Order -- Making Open and Machine Readable the New Default for Government Information". whitehouse.gov. 2013-05-09. Retrieved 2022-07-08.
  9. "How government can promote open data | McKinsey". www.mckinsey.com. Retrieved 2022-07-07.
  10. "Five Years of Open Data—Making a Difference". Data.gov. 2014-05-20. Retrieved 2022-07-08.
  11. Orszag, Peter R. (8 December 2009). "Open Government Directive". Executive Office of the President.