A Working Definition
of Geocoding:
Making data "mappable".
Before geocoding:
Name
Address
Zip
Bill Smith
123 Orca St
98000
After geocoding:
The Technical Process
of Geocoding:
Matching records in
two databases: your database (without map coordinates but with
some street, Zip Code or other geographic reference)
and a reference map (*with* map coordinates).
Geocoding software (e.g., MapInfo Pro, MapMarker,
or Envinsa) links
records in the two databases by matching text fields: e.g., zip
codes, street names and address
numbers.
When matched to a
reference map, your records are tagged with the map positions,
typically lat-lon coordinates.
Typically, after geocoding, your data contains its own position
information and can be mapped without the reference street map or address dictionary.
MapInfo Pro hides map coordinates of a geocoded
table:
i.e., it does not display the Lat-Lon values in the table by default. But
the coordinates are there behind the scenes.
Before geocoding:
Name
Address
Zip
XCoord
YCoord
Bill Smith
123 Orca St
98000
Blank
Blank
After geocoding:
Name
Address
Zip
XCoord
YCoord
Bill Smith
123 Orca St
98000
-122.345678
47.234567
Why geocoding precision matters
Precision options
You can map data at many levels of spatial precision
depending on your data and your available maps or geocoding software:
Zip Code (+/- 1 mile);
Zip + 4 Code (+/- 1 block);
Street Address (approx. position on block);
Land Parcel (centered on real estate parcel);
Building (in building footprint)
Exact (hydrant, electric meter, mail box, building
doorway, gate, etc.)
Geocoding precision affects analytic
precision
Maps showing values by Zip Code paint a very coarse picture compared to a
address-level point map.
After geocoding by Zip Code
After geocoding by address
We may make assumptions about households based
on the characteristics of the neighborhoods where they live. If
geocoded by street address, we can associate household records with
smaller, more homogenous areas such as block groups. This
results in more accuracy.
As shown below, the
demographics of block group areas can be dramatically different than the demographics of
Zip Codes.
Higher income Zip Codes
Higher income block groups
Spatial precision: What's "correct" depends on your
application
What is the appropriate geocoding precision for
projects such as the following?
Health: Mapping and analyzing national patterns of
chicken pox?
Public safety: Planning volcano evacuation routes?
Planning electric meter reader routes?
Transit: Determining number of residents within
200 meters of a proposed bus route.
Door-to-door delivery: Creating efficient route
maps for newspaper or UPS deliveries?
Retail: Choosing best location for new clinic,
bank branch or retail store?
Under-appreciated reason for geocoding
precision -
Summarizing data by custom regions.
Which district office should handle the new client? Geocode
client addresses by street address and see which territory region they fall within.
Are capital improvement projects benefiting all legislative
districts equally? Find out by geocoding the street addresses of the capital projects then
overlaying the legislative districts.
Banks and insurance companies often report statistics by census
tract. To determine the census tract for a loan or depositor, geocode
the street addresses, overlay the tract map, then use Query > SQL
Select to summarize by tract.
Geocoding: Part of modern corporate customer information systems
More and more Fortune 1000 corporations geocode all
their customer addresses as a standard practice. By making map position
part of their overall corporate information system design, all departments take advantage of spatial analysis techniques.
Before
buying new geocoding software, understand the three technical "legs" of the
geocoding "stool": data, algorithms, and interface.
Data
To what street map or other map content does the software match your data to?
I.e., how good is the underlying "address dictionary"? Is it updated
quarterly as streets are built? Does it place the streets in the
right place on the map? Does it have current street names?
Does it have alternative street names? Can you edit/add your own
data to the prepackaged out-of-the box data? Is data available
nationally off-the-shelf? Internationally?
Algorithms & features
How effectively does it standardize and match your addresses to its
internal address dictionary? Can it optionally fall-back to Zip + 4 or
other less precise matches, if exact match is impossible? How fast
is it? Is there a scoring system for ranking possible matches?
A user-configurable emphasis on exact match of Zip Code, City name or House
Number?
User interface
How easy is it to use? Must you translate your data to/from a
proprietary format or can you geocode addresses "in place" in SQL Server
or other DBMS? What is the interface for deciding among "close
match" alternatives, when there is no exact match? Can you call
geocoding functions programmatically from SQL Server, MapBasic, or
even batch files?
Non-technical comparison factors
Price
# of Transactions/year?
Server or desktop use?
# of users who geocode?
License rights
Do you need to distribute geocoded data outside your organization?
Maintenance required
How much work is it to update the address dictionaries or other data?
MapMarker is a standalone geocoder software program, introduced by
MapInfo Corp. in 1996. Version 14.2 was released in February 2009.
It runs independent of MapInfo Pro and is optimized to do one thing:
geocode quickly.
It is a combined software & data product: i.e., it comes with its
own street address reference data files (aka "address dictionary"),
and regular data updates.
It is available in three versions, each with different pricing and
different levels of precision: MapMarker Zip + 4, MapMarker, and
MapMarker Plus.
Advantages of MapMarker
Cheaper, except for single-county applications.
One-step geocoding, even if addresses are in many different
counties across the US.
Cleans and freshens addresses, Zip Codes, & Zip + 4 Codes on
the fly. MapMarker Plus produces USPS "CASS-certified" mailing lists.
[CASS-certified means lower postage rates for bulk mailers.]
Includes Zip + 4 reference files and can automatically compute
the Zip + 4 code and geocode to the Zip + 4 location as a fallback, if it cannot match to exact street
address.
Very fast: Can process 1,000,000s of records/hour.
Appends census tract, block group, and block codes to the
address record as part of its single-pass processing. No separate SQL processing
required.
Friendly, straightforward user-interface. Easy to use.
Is engineered as a true client-server application and
can be used as the geocoding engine for a web server, for example.
More sophisticated "rule based" matching
algorithms are better than those in MapInfo Professional. Offers a better set of
"close-match" candidates than MapInfo Pro.
Can geocode address lists "in place" when they are stored in a remote server database
(Oracle, Informix, etc.)
Address and street data is not directly user-editable.
However, power users can build & compile their own custom
MapMarker-format databases and use them to supplement MapMarker's
standard databases.
Can process sets of address files, running in
"batch" mode.
MapInfo Pro + MapMarker or Envinsa Server
This option became available in 2006, with the
release of MapInfo Pro v8.0. It is essentially MapMarker
Standalone, but now with the advantage of full integration within the
MapInfo Pro menu system.
Geocoding is accessible via the MapInfo menu
option "Table > Geocode using Server...
"
Requires MapMarker installed locally as a Server, or
an online Envinsa account for "per Transaction" usage.
MapInfo Pro + Detailed Street map
The original way to geocode used MapInfo's built-in
geocoding engine along with a street map. MapInfo Corp pioneered
this technique for PCs, releasing MapInfo (for DOS) in 1987.
MapInfo for Windows, released in 1992, also offered street level
geocoding using displayable and user-editable street maps.
Geocoding is accessible via the MapInfo menu
option Table > Geocode ... Just add
street map data.
Requires a street map with street names & address
ranges: e.g., "StreetPro", "StreetInfo" or comparable.
Uses an "abbreviation file" (e.g., mapinfow.abb)
to standardize some address components: e.g., "Street", "Str", and
"St".
Can use a City or Zip Code boundary map to
distinguish addresses that occur in more than one place in a county.
Advantages of MapInfo Pro +
Detailed Street map
Can be cheaper than MapMarker, if all addresses
are in a single or few counties, especially if local government street
data is available.
Street address information in street tables is
completely editable. Missing a street? Just draw it and fill in
the street name and address fields. You can match against this
new street immediately..
Can easily use new street map information
generated by local governments.
Suitable for use in Mexico & other countries for
which MapMarker versions may not yet be available.
Can be used for reverse-geocoding (EAL version
only): ie., look up nearest address based on lat-lon. This
requires customization.
Geocoding pricing is subject to many variables related to users, usage,
update frequency, volume user discounts, 'local government pricing' and
more. The following examples are subject to change without notice
and just that: examples. Costs are for a single "named user"
license and reflect 3-year subscription discounts when available.
Zip Code
MapInfo Pro + Free data
included with MI Pro
$Free
Geocode to Zip Code centroids:
A MapInfo Pro user can geocode to the Zip Code centroid level using the
US Zip Code maps bundled with MapInfo Pro at no charge.
Limitations:
The free Zip Code table is not
updated regularly so new Zip Codes won't be in it.
Street
MapInfo Pro + Free street data
from Local Government
$Free
Geocoding to the interpolated street address:
City of Seattle, King County and other local governments have digital
street maps that are easily converted to MapInfo street map format.
Limitations:
If address records span counties,
geocoding requires multiple street files and/or more elbow grease.
Some addresses occur in several
places even in a single street map for one county. To resolve
these, MapInfo Pro users need to find a Zip Code or city boundary map
layer.
MapInfo Pro geocoding does not offer
automatic fallback to Zip + 4 centroid or other algorithmic features of
MapMarker.
Street
MapInfo Pro + StreetPro/Enhanced
Address content
$650/county
Geocoding to the interpolated street address using
enhanced street map layer from StreetPro. Unlike "StreetPro
Display", "StreetPro/Enhanced Address" maps include address number
ranges and are pre-formatted for MapInfo Pro geocoding.
Limitations:
Same as with Local Government
street maps.
Street
MapInfo Pro + 50,000 online
Geocode transactions/yr
$1800/yr
Geocode to the interpolated street address anywhere
in the USA, with fallback to Zip + 4, Zip Code and all the features of
MapMarker Plus, incl quarterly data updates. Connect
to the remote geocoder online using the "MapInfo Pro > Table > Geocode
using server" menu command.
Limitations:
Requires internet connection and
online account setup (call SGSI at 206-224-0800).
Extra costs apply if > 50,000
records to geocode.
Prepaid; No refund or carryover if
< 50,000 records geocoded.
Street
MapMarker Plus Standalone (Desktop
Use)
$1,537/state
$11,547/US
(Annual 1-user cost)
Install MapMarker Plus locally then geocode to the
interpolated street address anywhere in the licensed geography, with
fallback to Zip + 4, Zip Code. Includes quarterly data updates.
No limit to transaction volume.
Connect to the
MapMarker Client (standalone) or -- with special configuration by SGSI
-- directly from MapInfo Pro via "Table > Geocode using server".
Limitations:
Internal use only.
Not licensed for use with
internet/intranet server.
"Per user" license cost is more
expensive than "per server/CPU" for ~25+ users.
Street/Parcel
MapMarker Plus Standalone +
Parcel Precision (Desk)
$2,306/state
$17,320/US
(Annual 1-user cost)
Same as MapMarker Plus option (above) but with added
land parcel content. No limit to transaction volume. No
parcel outlines are viewable; parcel content is for geocoding only.
Limitations:
Same as MapMarker Plus option
above.
Parcel map content varies by state.
Street
Server Use: MapMarker
Plus / Envinsa / GeoStan
$Call
Same features as MapMarker Plus option (above) but
with right to use Server APIs include XML web services, Java, and C
api's. Parcel Precision options available. License terms
available for:
- Direct DBMS integration with Oracle or SQL Server;
- Intranet/internet web server use; and more. License fees vary
depending on # servers, # server CPUs, transaction count, and more.