The Geographic Names Information System (GNIS) is a database of name and location information about more than two million physical and cultural features throughout the United States and its territories, Antarctica, and the associated states of the Marshall Islands, Federated States of Micronesia, and Palau. It is a type of gazetteer. It was developed by the United States Geological Survey (USGS) in cooperation with the United States Board on Geographic Names (BGN) to promote the standardization of feature names.
The database is part of a system that includes topographic map names and bibliographic references. The names of books and historic maps that confirm the feature or place name are cited. Variant names, alternatives to official federal names for a feature, are also recorded. Each feature receives a permanent, unique feature record identifier, sometimes called the GNIS identifier. The database never removes an entry, "except in cases of obvious duplication."
The GNIS was originally designed for four major purposes: to eliminate duplication of effort at various other levels of government that were already compiling geographic data, to provide standardized datasets of geographic data for the government and others, to index all of the names found on official U.S. government federal and state maps, and to ensure uniform geographic names for the federal government.
Phase 1 lasted from 1978 to 1981, with a precursor pilot project run over the states of Kansas and Colorado in 1976, and produced 5 databases. It excluded several classes of feature because they were better documented in non-USGS maps, including airports, the broadcasting masts for radio and television stations, civil divisions, regional and historic names, individual buildings, roads, and triangulation station names.
The databases were initially available on paper (2 to 3 spiral-bound volumes per state), on microfiche, and on magnetic tape encoded (unless otherwise requested) in EBCDIC with 248-byte fixed-length records in 4960-byte blocks.
The feature classes for association with each name included (for examples) "locale" (a "place at which there is or was human activity" not covered by a more specific feature class), "populated place" (a "place or area with clustered or scattered buildings"), "spring" (a spring), "lava" (a lava flow, kepula, or other such feature), and "well" (a well). Mountain features would fall into "ridge", "range", or "summit" classes.
A feature class "tank" was sometimes used for lakes, which was problematic in several ways. This feature class was undocumented, and it was (in the words of a 1986 report from the Engineer Topographic Laboratories of the United States Army Corps of Engineers) "an unreasonable determination", with the likes of Cayuga Lake being labelled a "tank". The USACE report assumed that "tank" meant "reservoir", and observed that often the coordinates of "tanks" were outside of their boundaries and were "possibly at the point where a dam is thought to be".
National Geographic Names database
The National Geographic Names database (NGNDB hereafter) was originally 57 computer files, one for each state and territory of the United States (except Alaska which got two) plus one for the District of Columbia. The second Alaska file was an earlier database, the Dictionary of Alaska Place Names that had been compiled by the USGS in 1967. A further two files were later added, covering the entire United States and that were abridged versions of the data in the other 57: one for the 50,000 most well known populated places and features, and one for most of the populated places. The files were compiled from all of the names to be found on USGS topographic maps, plus data from various state map sources.
In phase 1, elevations were recorded in feet only, with no conversion to metric, and only if there was an actual elevation recorded for the map feature. They were of either the lowest or highest point of the feature, as appropriate. Interpolated elevations, calculated by interpolation between contour lines, were added in phase 2.
Names were the official name, except where the name contained diacritic characters that the computer file encodings of the time could not handle (which were in phase 1 marked with an asterisk for update in a later phase). Generic designations were given after specific names, so (for examples) Mount Saint Helens was recorded as "Saint Helens, Mount", although cities named Mount Olive, not actually being mountains, would not take "Mount" to be a generic part and would retain their order "Mount Olive".
The primary geographic coordinates of features which occupy an area, rather than being a single point feature, were the location of the feature's mouth, or of the approximate center of the area of the feature. Such approximate centers were "eye-balled" estimates by the people performing the digitization, subject to the constraint that centers of areal features were not placed within other features that are inside them. alluvial fans and river deltas counted as mouths for this purpose. For cities and other large populated places, the coordinates were taken to be those of a primary civic feature such as the city hall or town hall, main public library, main highway intersection, main post office, or central business district regardless of changes over time;[a] these coordinates are called the "primary point".[b]
Secondary coordinates were only an aid to locating which topographic map(s) the feature extended across, and were "simply anywhere on the feature and on the topographic map with which it is associated". River sources were determined by the shortest drain, subject to the proxmities of other features that were clearly related to the river by their names.
USGS Topographic Map Names database
The USGS Topographic Map Names database (TMNDB hereafter) was also 57 computer files containing the names of maps: 56 for 1:24000 scale USGS maps as with the NGNDB, the 57th being (rather than a second Alaska file) data from the 1:100000 and 1:250000 scale USGS maps. Map names were recorded exactly as on the maps themselves, with the exceptions for diacritics as with the NGNDB.
The Generic database was in essence a machine-readable glossary of terms and abbreviations taken from the map sources, with their definitions, grouped into collections of related terms.
National Atlas database
The National Atlas database was an abridged version of the NGNDB that contained only those entries that were in the index to the USGS National Atlas of the United States, with the coördinates published in the latter substituted for the coördinates from the former.
Board on Geographic Names database
The Board on Geographic Names database was a record of investigative work of the USGS Board on Geographic Names' Domestic Names Committee, and decisions that it had made from 1890 onwards, as well as names that were enshrined by Acts of Congress. Elevation and location data followed the same rules as for the NGNDB. So too did names with diacritic characters.
Phase 2 was broader in scope than phase 1, extending the scope to a much larger set of data sources. It ran from the end of phase 1 and had managed to completely process data from 42 states by 2003, with 4 still underway and the remaining 4 (Alaska, Kentucky, Michigan, and New York) awaiting the initial systematic compilation of the sources to use.
Many more feature classes were included, including abandoned Native American settlements, ghost towns, railway stations on railway lines that no longer existed, housing developments, shopping centers, and highway rest areas.
The actual compilation was outsourced by the U.S. government, state by state, to private entities such as university researchers.
Antarctica Geographic Names database
The Antarctica Geographic Names database (AGNDB hereafter) was added in the 1990s and comprised records for BGN-approved names in Antarctica and various off-lying islands such as the South Orkney Islands, the South Shetland Islands, the Balleny Islands, Heard Island, South Georgia, and the South Sandwich Islands. It only contained records for natural features, not for scientific outposts.
The media on which one could obtain the databases were extended in the 1990s (still including tape and paper) to floppy disc, over FTP, and on CD-ROM. The CD-ROM edition only included the NGNDB, the AGNDB, the GCNDB, and a bibliographic reference database (RDB); but came with database search software that ran on PC DOS (or compatible) version 3.0 or later. The FTP site included extra topical databases: a subset of the NGNDB that only included the records with feature classes for populated places, a "Concise" subset of the NGNDB that listed "major features", and a "Historical" subset that included the features that no longer exist.
There is no differentiation amongst different types of populated places. In the words of the aforementioned 1986 USACE report, "[a] subdivision having one inhabitant is as significant as a major metropolitan center such as New York City".
In comparing GNIS populated place records with data from the Thematic Mapper of the Landsat program, researchers from the University of Connecticut in 2001 discovered that "a significant number" of populated places in Connecticut had no identifiable human settlement in the land use data and were at road intersections. They found that such populated places with no actual settlement often had "Corner" in their names, and hypothesized that either these were historical records or were "cartographic locators". In surveying in the United States, a "Corner" is a corner of the surveyed polygon enclosing an area of land, whose location is, or was (since corners can become "lost" or "obliterated"), marked in various ways including with trees known as "bearing trees" ("witness trees" in older terminology) or "corner monuments".
From analysing Native American names in the database in order to compile a dictionary, professor William Bright of UCLA observed in 2004 that some GNIS entries are "erroneous; or refer to long-vanished railroad sidings where no one ever lived". Such false classifications have propagated to other geographical information sources, such as incorrectly classified train stations appearing as towns or neighborhoods on Google Maps.
The usual sources of name change requests are an individual state's board on geographic names, or a county board of governors. This does not always succeed, the State Library of Montana having submitted three large sets of name changes that have not been incorporated into the GNIS database.
Conversely, a group of middle school students in Alaska succeeded, with the help of their teachers, a professor of linguistics, and a man who had been conducting a years-long project to collect Native American placenames in the area, in changing the names of several places that they had spotted in class one day and challenged for being racist, including renaming "Negrohead Creek" to an Athabascan name Lochenyatth Creek and "Negrohead Mountain" to Tl'oo Khanishyah Mountain, both of which translate to "grassy tussocks" in Lower Tanana and Gwichʼin respectively. Likewise, in researching a 2008 book on ethnic slurs in U.S. placenames Mark Monmonier of Syracuse University discovered "Niger Hill" in Potter County, Pennsylvania, an erroneous transcription of "Nigger Hill" from a 1938 map, and persuaded the USBGN to change it to "Negro Hill".
Removal of racial and ethnic slurs
In November 2021, the United States Secretary of the Interior issued an order instructing that "Squaw" be removed from usage by the U.S. federal government. Prior efforts had included a 1962 replacement of the "Nigger" racial pejorative for African Americans with "Negro" and a 1974 replacement of the "Jap" racial pejorative for Japanese Americans with "Japanese".
In 2015, a cross-reference of the GNIS database against the Racial Slur Database had found 1441 racial slur placenames, every state of the United States having them, with California having 159 and the state with the most such names being Arizona. One of the two standard reference works for placenames in Arizona is Byrd Howell Granger's 1983 book Arizona's Names: X Marks the Place, which contains many additional names with racial slurs not in the GNIS database. Despite "Nigger" having been removed from federal government use by Stewart Udall, its replacement "Negro" still remained in GNIS names in 2015, as did "Pickaninny", "Uncle Tom", and "Jim Crow" and 33 places named "Niggerhead". There were 828 names containing "squaw", including 11 variations on "Squaw Tit" and "Squaw Teat", contrasting with the use of "Nipple" in names with non-Native American allusions such as "Susies Nipple".
- The United States Census Bureau (USCB) defines Census Designated Places as a subset of locations in the National Geographic Names Database.
- United States Postal Service (USPS) Publication 28 gives standards for addressing mail. In this publication, the postal service defines two-letter state abbreviations, street identifiers such as boulevard (BLVD) and street (ST), and secondary identifiers such as suite (STE).
- Canadian Geographical Names Database (GNBC), a similar, but non-public-domain, database for locations within Canada only
- GEOnet Names Server (GNS), a similar database for locations outside the United States
- United Nations Conference on the Standardization of Geographical Names
- "Additional guidelines were suggested for determining the center of large populated places, such as the location of the city hall or town hall, main post office, main library, central business district, or main intersection." 
- "Primary Point: The official feature location is a single point to which the official feature name is associated in order to ensure positive and unique identification and association, also referred to as the primary point. The location is determined by the authoritative source and is approved or recognized as official by the U.S. Board on Geographic Names by decision or by policy delegation to the authoritative source. The placing of the location point is governed by policies of the Board as defined in the Names Data Users Guide and GNIS metadata, generally at or near the geographic center, but there are exceptions for certain classes of features." "Question 17: In the GNIS database, the Primary coordinate values for communities are taken at the center of the "original" community meaning the city hall, main post office, main intersection, etc. "Primary Point: The official feature location is a single point to which the official feature name is associated in order to ensure positive and unique identification and association, also referred to as the primary point. The location is determined by the authoritative source and is approved or recognized as official by the U.S. Board on Geographic Names by decision or by policy delegation to the authoritative source. The placing of the location point is governed by policies of the Board as defined in the Names Data Users Guide and GNIS metadata, generally at or near the geographic center, but there are exceptions for certain classes of features. For example, the official feature location of flowing bodies of water (streams, rivers, creeks, etc.) or trending linear features (valleys, gulchs, gullys, hollows, etc.) is at the mouth. A geographic feature may have only one official location regardless of size, extent, composition, structure, or boundaries. The location point is coincident with, but in addition to and independent of, any other geospatial representation or boundary definition that may be attached to the feature in other datasets. Linear and aerial features may have secondary points as defined in the Names Data Users Guide and GNIS metadata. Locations are stored in the Geographic Names Information System as latitude and longitude in decimal degrees to seven places."
- Monmonier 2008, p. 30.
- Monmonier 2008, p. 31.
- "United States Census County Based TIGER/Line 2009 Data Dictionary: Entity, Joins, Attributes and Domains". Archived from the original on 27 June 2014. Retrieved 27 June 2014.
- Cartographic Users Advisory Council (CUAC) (26–27 April 2007). 2007 Agency Presentation Minutes. Reston, VA: United States Geological Survey (USGS). Archived from the original on 11 January 2014.
- Payne 1983, p. 1.
- Payne 1983, pp. 1, 3.
- Payne 1985, p. 2.
- Payne 1983, p. 18.
- Payne 1985, pp. 19–20.
- Payne 1983, p. 20–22.
- Monmonier 2008, p. 32.
- Heard 1986, p. 25.
- Payne 1983, p. 3.
- Payne 1985, p. 4.
- Payne 1983, p. 4.
- Payne 1983, p. 6.
- Payne 1983, p. 5.
- Heard 1986, pp. 4–5.
- "Geographic Names Information System (GNIS): Domestic Names - Metadata". U.S. Geological Survey. 2009-08-18. Archived from the original on 2012-02-06. Retrieved 2023-07-30.
- "Domestic Names - Frequently Asked Questions (FAQs) #17". U.S. Geological Survey. 2014-10-17. Archived from the original on 2015-03-18. Retrieved 2023-07-30.
- "GNIS States, Territories, Associated Areas of the United States Text Format for National tabular digital data". U.S. Geological Survey. 2017. Archived from the original on 2023-07-30. Retrieved 2023-07-30.
- Payne 1985, p. 7.
- Heard 1986, p. 4.
- USGS 1998, p. 1.
- Payne 1983, p. 8.
- Payne 1983, p. 9.
- Payne 1983, p. 11.
- Payne 1983, p. 13.
- Payne 1983, p. 14.
- USGS 1998, p. 2.
- Heard 1986, p. 12.
- McEathron et al. 2001, p. 5.
- BLM 1980, p. 31, Lost corner.
- BLM 1980, p. 37, Obliterated corner.
- BLM 1980, p. 7, Bearing tree.
- BLM 1980, pp. 62–63, Witness tree.
- BLM 1980, p. 13, Corner.
- Bright 2004, p. 3.
- Schultz, Isaac (2019-10-15). "The Brief, Baffling Life of an Accidental New York Neighborhood". Atlas Obscura. Retrieved 2022-05-06.
- Brown, Reznik & Gilat 2015.
- Smetzer 2012.
- Monmonier 2008, pp. 33–34.
- Haaland 2021.
- Nuessel 2019, p. 188.
- STARL 2017.
- Payne, Roger L. (1983). McEwen, Robert B.; Winter, Richard E.; Ramey, Benjamin S. (eds.). Geographic Names Information System (PDF). Geological Survey Circular. United States Geological Survey. 895-F.
- Payne, Roger L. (1985). Geographic Names Information System: Data Users Guide (6 ed.). Reston, Virginia: United States Geological Survey.
- Monmonier, Mark (2008). From Squaw Tit to Whorehouse Meadow: How Maps Name, Claim, and Inflame. University of Chicago Press. ISBN 9780226534640.
- Geographic Names Information System (PDF). Fact Sheet. United States Geological Survey. August 1998. 127-95.
- Heard, Andrew M. (August 1986). Automatic correlation of USGS digital line graph geographic features to GNIS names data (PDF). United States Army Corps of Engineers. AD-A 192 787.
- McEathron, Scott R.; McGlamery, Patrick; Shin, Dong-Guk; Smith, Ben; Su, Yuan (August 2001). Naming the Landscape: Building the Connecticut Digital Gazetteer (PDF). 67th IFLA Council and General Conference August 16–25, 2001. ED 459 759.
- Glossaries of BLM Surveying and Mapping Terms (2nd ed.). United States Department of the Interior, Bureau of Land Management. 1980.
- Haaland, Deb (2021-11-19). "Order number 3404" (PDF). Washington.
- Brown, Jennings; Reznik, Tal; Gilat, Matan (2015-10-29). "Racial Slurs Are Woven Deep Into The American Landscape". vocativ. Archived from the original on 2019-10-30.
- The State Library of Arizona (2017-09-20). "Researching Arizona's place names". Blog of the State of Arizona Research Library.
- "Data Construction". Montana State Library. Retrieved 2022-04-23.
- Nuessel, Frank (2019). "Ethnophaulic toponyms in the United States". In Felecan, Oliviu (ed.). Onomastics between Sacred and Profane. Series in Language and Linguistics. Vernon Press. ISBN 9781622734016.
- Bright, William (2004). Native American Placenames of the United States. University of Oklahoma Press. ISBN 9780806135984.
- Smetzer, Mary (2012-06-04). "Students take action to remove racist place names from map". Ahcorage Daily News.
- Orth, Donald J.; Payne, Roger L. (1987). "The National Geographic Names Data Base: Phase II instructions". Circular. Geological Survey Circular. Vol. 1011. United States Geological Survey. doi:10.3133/cir1011. ISSN 1067-084X.
- United States Department of the Interior (DOI), Digital Gazeteer: Users Manual, (Reston, Virginia: United States Geological Survey (USGS), 1994).
- Least Heat Moon, William, Blue Highways: A Journey Into America, (Boston: Little Brown and Company, 1982). ISBN 0-316-35329-9
- Jouris, David, All Over The Map, (Berkeley, California: Ten Speed Press, 1994.) ISBN 0-89815-649-1
- Report: "Countries, Dependencies, Areas of Special Sovereignty and Their Principal Administrative Divisions", Federal Information Processing Standards (FIPS), FIPS 10-4. Standard was withdrawn in September 2008, See Federal Register (FR) Notice: Vol. 73, No. 170, page 51276 (September 2, 2008)
- Report: "Principles, Policies and Procedures: Domestic Geographic Names", United States Board on Geographic Names (BGN), 1997.
- United States Postal Service Publication 28.
- Soranno, Patricia A.; Webster, Katherine E.; Smith, Nicole J.; Díaz Vázquez, Jessica; Spence Cheruvelil, Kendra (2020-02-03). "What Is in a "Lake" Name? That Which We Call a Lake by Any Other Name". Limnology and Oceanography Bulletin. Association for the Sciences of Limnology and Oceanography. 29 (1): 1–7. doi:10.1002/lob.10355. S2CID 214102434.
- Shelley, Fred M. (2019-10-23). "The Board of Geographic Names and the Removal of Derogatory and Offensive Toponyms in the United States". In Brunn, S.; Kehrein, R. (eds.). Handbook of the Changing World Language Map. Springer. pp. 2097–2106. doi:10.1007/978-3-030-02438-3_177. ISBN 9783030024383.
- Vaughan, Champ Clark (2008). "The Oregon Geographic Names Board: One Hundred Years of Toponymic Nomenclature". Oregon Historical Quarterly. 109 (3): 412–433. doi:10.1353/ohq.2008.0017. JSTOR 20615877. S2CID 165705955.