Data tools

Software tools and complementary resources to support the hands-on work of data practitioners

/Developer program

The /Developer program provides support to federal agencies engaged in the production or use of APIs.

Source

General Services Administration

Keywords

API

CKAN

Comprehensive Knowledge Archive Network (CKAN) is a powerful open source data management system that makes data accessible by providing tools to streamline publishing, sharing, finding, and using data. The Data.gov catalog is based on CKAN, a technology that powers many government open data sites. Inventory.data.gov is a separate instance of CKAN hosted at GSA on the same infrastructure as the Data.gov catalog.

Source

ckan.org

Keywords

data schema, data management, data inventory, open data

CSV to API

The CSV to API tool dynamically generates RESTful APIs from static CSVs. It provides JSON, XML, and HTML formats.

Source

Project Open Data

Keywords

API

DKAN

DKAN is community-driven, Drupal-based, free, and open source open data platform that gives organizations and individuals ultimate freedom to publish and consume structured information.

Source

drupal.org

Keywords

open data

Data Visualization Wizard

The Data Visualization Wizard provides a fast way to get data visualizations online using Drupal and uploaded spreadsheets.

Source

drupal.org

Keywords

data visualization

Database to API

The Database to API tool turns a Database into a Secure, RESTful API. It provides JSON, XML, and HTML formats.

Source

Project Open Data

Keywords

API

Digital Analytics Program (DAP)

The Digital Analytics Program (DAP) provides a web analytics tool for measuring digital services in the federal government. Executive branch federal agencies are required to implement DAP on all public-facing federal websites.

Source

digital.gov

Keywords

data analytics

Esri2Open

Esri2Open is an Esri toolbox and tool(s) that exports Esri Feature Classes to open data formats, CSV, JSON, and GeoJSON. Runs inside the Esri ArcGIS desktop suite.

Source

Project Open Data

Keywords

geospatial

GeoNode

GeoNode is a web-based application and platform for developing geospatial information systems (GIS) and for deploying spatial data infrastructures (SDI). It is designed to be extended and modified, and can be integrated into existing platforms.

Source

geonode.org

Keywords

geospatial

Geoportal Server

Geoportal Server allows you to catalog the locations and descriptions of your organization's geospatial resources in a central repository called a geoportal, which you can publish to the Internet or your intranet.

Source

Esri

Keywords

geospatial

Home Mortgage Disclosure Act (HMDA) Tools

The Home Mortgage Disclosure Act (HMDA) is a law that requires financial institutions to maintain and annually disclose data about home purchases, home purchase pre-approvals, home improvement, and refinance applications. These tools from the Consumer Financial Protection Bureau (CFPB) make analyzing mortgage application data much easier.

Source

Consumer Financial Protection Bureau

Keywords

data analytics, data analysis

JSON Validator

JSON Validator is used to check your agency’s data.json against the required federal metadata schema.

ReVal (Reusable Validation Library)

ReVal (Reusable Validation Library) is a customizable Django App for validating data via API.

Source

General Services Administration

Keywords

API, data validation

Standard Application Process Pilot- Lessons Learned Report

The Standard Application Process (SAP) Pilot portal serves as a single access point for data users to request access to restricted use data held by Federal statistical agencies covered under the Confidential Information Protection and Statistical Efficiency Act (CIPSEA). This fulfills a responsibility under Action Step 16 of the Federal Data Strategy 2020 Action Plan.

Source

U.S. Census Bureau- Federal Statistical Research Data Center (FSRDC)

Keywords

federal-data-strategy, icsp, data-access, process-redesign

Format

PDF (23 pages)

Tabula

Tabula is a tool for liberating data tables locked inside PDF files.

Source

Tabula

Keywords

data cleaning

inventory.data.gov

Inventory.data.gov is a tool for federal agencies to create and publish metadata catalogs. Login.gov is required to access this link.

Source

General Services Administration

Keywords

data schema, data management, data inventory, inventory.data.gov

pycsw

pycsw is an OGC CSW server implementation written in Python that allows for the publishing and discovery of geospatial metadata via numerous APIs.

Source

pycsw.org

Keywords

geospatial, API, metadata

qu

qu is an open source data platform created to serve the public data sets of the Consumer Financial Protection Bureau. The goals of this platform are to import data in a Google-Dataset-inspired format, Query data using a Socrata-Open-Data-API-inspired API, and export data in JSON or CSV format.

Source

Consumer Financial Protection Bureau

Keywords

open data, API, data cleaning, data analysis