This page contains links to products from 2000 Census that are available at Columbia or on public Internet sites. Many of the resources at Columbia have value-added features for ease-of-use or for handling complex requests.
Data Sources and Software |
Overview of Products |
General Information |
|---|---|---|
|
|
|
Data Sources and Software
- Online Data Extraction
-
American FactFinder
At this site you can build custom tables from the summary files, obtain demographic profiles, and use the mapping feature for locating and analyzing data. The extraction software is not as powerful as that on DVD/CD products - New York City
Each of these sources reports on sub-city areas including tracts and community districts.- InfoShare (interactive site for creating custom tables)
- New York City Planning Department
- New York City Planning Department Census Tables (Excel tables reporting socio-demographic characteristics)
- New York City Planning, Newest New Yorkers 2000 (Excel tables analyzing the NYC foreign born population).
-
National Historical Geographic Information (NHGIS)
Provides an online extraction tool for selected summary file variables at the national, state, county, tract, metropolitan area, and primary metropolitan area levels allowing for multiple locations at one level of geography to be downloaded at once. Downloads of a single area for other census geographies are available. -
New York State Data Center
Provides summarized data for the state, counties, communities, and school districts. Demographic profiles are available in csv format. -
Department of Education, School District Demographics
Select and extract data tables through the web interface at this site, view maps, or download state-wide data in comma delimited format. - Advanced Query System
This is a web interface for constructing custom tabulations from responses to the long form questionnaire. Tabulations, that meet the systems confidentiality standards, can be done using any standard geographic areas (block groups, tracts, counties, cities, etc.) that have minimum population of 200 persons. Access to this site is restricted to authorized users so those interested should contact eds@columbia.edu. -
Integrated Public Use Microdata Series (IPUMS)
At this site custom extractions of public use microdata can be done. (If working with NYC, see the DataGate options listed below
-
American FactFinder
- GIS and Mapping Products
- Resources in EDS
Boundary files based on Census products are available from many sources. The EDS spatial data collection has many options at all levels of U.S. geography.
-
U.S. Census Bureau's Cartographic Boundary Files
This site has boundary files for selected generalized extracts, from the Census Bureau's TIGER geographic database, that are designed for use in a Geographic Information System (GIS) mapping systems. - TIGER® Products
The TIGER® Line Files are the basis for all mapping products. Refer to the the Census Bureau's, TIGER page for an explanation of your options for obtaining these files and the other mapping products derived from them. - Thematic Mapping
Using Allocate or Geolytics CD-ROM/DVD products described above, you can extract and prepare data for use in GIS software. In addition to data extraction, Geolytics CensusCd+Maps products can display thematic maps. Below is a list of guides for preparing data for GIS software.
- Resources in EDS
- CD/DVD Products on EDS Network
- Summary File Products from Bureau of Census
Summary files are published on DVD/CD-ROM together with a powerful Windows-based extraction program for making both simple and complex data requests. Files currently available on DVD/CD-ROM include:
- Summary File 1
- Summary File 2
- Summary File 3
- 108th Congressional Districts Summary File
- Geolytics CensusCd 2000 Products
Summary file data published on CD-ROM with a Windows-based extraction program for making simple data requests. Has a feature for creating files for importing in to GIS applications. Files currently available are:
- 2000 Short Form Blocks (block level data)
- 2000 Short Form - SF1 (data to the block group level).
- Public Use Microdata (PUMS)
PUMS data on CD-ROM come with a Windows-based extraction program that simplifies data selection and weighting.- PUMS 1% sample
- PUMS 5% sample
- Summary File Products from Bureau of Census
- Files for FTP
Because of the size of these files, locally available sources either in DataGate or in EDS on DVD/CD, may be easier to use. Also the structure of the summary files is complex with retrieval of data for a single geography requiring accessing and merging data from multiple files. Extraction software on the DVD/CD products and at American FactFinder can meet most needs.
-
DataGate
Searching for EDS Study ID# 2000 will generate a list of titles for 2000 products. Most titles will reference non-ftp formats (urls or CDs) since these are the usually the most efficient way to access the data. One exception is the PUMS 5%, Study 2000-PA. This study contains files that are subset of census microdata, a file for each of the five borough and one for all of NYC. -
Census Bureau's Census 2000 Gateway
Links to each of the available data products are listed in the " "Data Releases" list. Links to the American FactFinder interface for creating custom tables are also given. -
ICPSR Census Home Page
Committed to archiving all the Census 2000 files, the full files, program code (SAS and SPSS), meta-data, and tips for working with the files are found here. There is a time lag between when files appear here and when they are first published on the Bureau's site.
-
DataGate
Overview of Products
This briefly introduces some key data products. Each product may come in a variety of formats so refer to the Data Sources section of this guide for a list of the options offered here at Columbia. The Census 2000 Gateway page has information on all the 2000 products and details about the options for public access.
- Selected Products From the Short Form
Counts reported from the short form represent exact counts as these questions were asked of 100% of the population. Refer to the Census Bureau's overview page for details on topics covered are few. They are reported on the following products:- Re-Districting File (PL-94)
Selected topics tabulated for the purpose of congressional reapportionment. - Summary File 1 (SF1)
Data for all topics tabulated for the nation and by basic race categories and hispanic/non-hispanic populations. - Summary File 2 (SF2)
Data for all topics tabulated for the nation and by detailed race categories and hispanic/non-hispanic populations. - Demographic Profiles (DP1)
Preformatted reports summarizing, for named geographic locations, data reported on the short form. Data are derived from SF1.
- Re-Districting File (PL-94)
- Selected Products From the Long Form
Counts reported from the long form represent estimated counts based on the large sample of the population asked to complete the long form. The topics cover those asked on the short form plus a wide range socio-demographic topics. For details see the Census Bureau's overview page. They are reported on the following products:- Summary File 3 (SF3)
Data for all topics tabulated for the nation and by basic race categories and hispanic/non-hispanic populations. - Summary File 4 (SF4)
Data for all topics tabulated for the nation for 336 population groups (groups included detailed race and hispanic origin groups, and selected ancestry). - Demographic Profiles
Preformatted reports summarizing, for named geographic locations, data reported on the short form. Data are derived from SF3. - Microdata
The responses from individual questionnaires are released in two formats: Public Use Microdata (PUMS) files and through a web search interface. For details about microdata, refer the Census 2000 Microdata overview page.- PUMS 1% - a sample of 1% of the housing units that filled out the long form questionnaire.
- PUMS 5% - a sample of 5% of the housing units that filled out the long form questionnaire.
- Advanced Query System - web interface that will allow for custom tabulations based of all long form responses.
- County-To-County Worker Flow Files
These files are tabulations done on the answer to the question where people worked. At the county level they analyze both where workers live and where residents work.
- Summary File 3 (SF3)
- 108th Congressional District Summary Files
The tabulations are based on the newly defined boundaries for the 108th Congressional Districts (tabulations for 106th boundaries are in the other SF products).
All the geographies supported by the Census Bureau have published in their TIGER® Line Files product. These are the basis for all Census, and many commercial, map products. The types of products include.
- US Census Bureau TIGER® Line Files
- Cartographic Products (selected files in GIS formats.) developed from the TIGER® Line Files
- Maps for Viewing and Printing
A list of all 2000 Data products with their release date, medium in which they are published, and their geographic coverage.
General Information
-
Documentation for the Summary File Tables
(Geography Definitions are Appendix A and Subject Definitions are Appendix B) - Complete Technical Documentation (PDF format)
- Census 2000 Microdata Overview
- Geographic Terms and Concepts
- NY County FIPS Codes - Census 2000
- Metropolitan Areas Definitions (current and past lists of MSAs)
- Geographic Correspondence Engine with Census 2000 Geography
- New York State School District Codes
- Place, County Subdivision, and ZCTA Codes
- Obtaining Data by Congressional District
- Race and Hispanic Origin
- Sources for Data on Ethnicity
- Coding of Race (from the Population Studies Center, U. of Michigan)
- Tabulating Data on Race and Hispanic Origin from the 2000 Census
- Understanding Census 2000: Race Category Changes & Comparisons
- Comparing 1990 and 2000
-
Columbia Library Holdings
Includes print, CD/DVD, and Internet data resources, plus reference guides and links to useful sites and sources that summarize the results. - Training

