Tags
I was reading David Loshin’s (@davidloshin) blog Refinements to Location Standards: Geocoding yesterday afternoon. David says geocode is an answer to finding locations that do not map directly to an address.
Geocoding, as most of us know, is the process of finding geographic coordinates which are expressed as latitude and longitude. For example: [13.746076, 100.540073] represents an address – Thanon Ratchadamri, Pathum Wan, Bangkok 10110 in Thailand.
It’s a truly amazing process which serves as a great guide for people who want to get to their destinations faster. We are seeing geocoding being commonly used on web today for services like finding driving directions, locating geographically nearest gas stations, restaurants or service locations when on road. (Heck, there are 1000’s of location service based apps today which wouldn’t have existed if this process wasn’t there).
Extending David’s discussion further – In future, we would like to see employment of geocoding in the way we store the location information in our Master Data Management systems. Today, MDM products provide provision to store address attributes like – street name, street number, city, state, postal code, country etc. This will have to change eventually as we will be using only 2 attributes to represent the location of our customer with accuracy at its finest.
The biggest advantage which we can gain with geocoding is identifying duplicate addresses.
The missing zip codes and non-standard representation of address fields are some of the issues we have with address formats today. Added to this, each country has its own way of representing addresses. These aspects of address lead to a wrong matching no matter how powerful our algorithms are, causing significant number of false positive (different member – match) and false negative (same member – don’t match) scenarios.
Geocoding resolves these issues as it provides a consistent location format across geographies and is compact and accurate.
So, few decades down the line, if someone asks you your address, act like miss smarty pants and tell them the coordinates.
I can’t wait to get a rebirth & be in 22nd century! If only nature permits!! How about you!?

Geocoding sounds like an interesting idea for dealing with the wide variety or locations (distance of bridge along a road, postal address etc.) although I don’t think two coordinates are going to be enough in many cases. For example, which level of a flyover are you trying to locate? Or what floor is the customer on? Also, David points out, “Geocodes specified with enough precision provide a very good way to pinpoint a location.” however I think that would need to be very high degree of accuracy to reliably identify even reasonably large locations; a tower block could have many flats on each floor, hotel rooms are smaller still, or parking spaces, seats in a stadium, books in a library… Maybe I went too far!!
Thanks for your comments James.
You bought up very interesting point. Indeed, two coordinates may not be sufficient in the 3D world we live. Longitude and latitude helps us locate a specific location on earth’s surface and as this technology improves we might see a better precision than what we have today. But it won’t solve the location data like multiple flats built vertically in a residential tower.
However, we may not be very far from 3D geocoding too. I stumbled upon this article today which talks about 3D geocoding tool to visualize earth. Obviously, it’s too early to think this technology in itself will replace the system we are using for decades. But, it has potential to locate every place on earth and can benefit to a large degree. I really hope we get there faster
Good one Prash. However, as we say around my geocode: “The trees never grow into heaven”.
I have had my share of issues when matching geocodes, because there are of course some complications:
• A geocode can be expressed in many different systems. Examples are latitude and longitude picturing a round world, UTM X,Y coordinates picturing peels of the world and WGS84 X,Y coordinates picturing a world as flat as a computer screen.
• The coordinates of a single family house may be where the ground meets a public street, suitable for route planning, or the center of the house or a lot of other points.
• Sometimes geocodes for addresses on a street are interpolated, so they don’t represent the exact positions (but I guess that is getting better and better around).
Oh, I hate myself for being the Whinger in the Data Quality Zoo . And I actually guess that you are right that street names eventually will go away.
Henrik, Thank you for sharing your experience.
As James too pointed out in his comment, geocodes may be a fairly advanced technology, but it too has its share of challenges. Longitude and latitudes we currently use in many services may just not be enough to locate a particular customer. We need more tuning to this technology. In response to James I provided a link to an article about 3D geocoding. It will be interesting to see how this technology evolves and help us in creating accurate location master data.
Also, thanks for more thoughts on this topic in your new blog – Down the Street. Good read!
Pingback: Down the Street « Liliendahl on Data Quality
I observed the hyperlinks are not working in the comments. Here is the link to the 3D Geocoding I mentioned.
http://blogs.msdn.com/b/virtualearth3d/archive/2009/02/11/microsoft-photo-calibration-tool.aspx
PC – geo coding may not replace the address after all due to many factors. for example if you want to do suspect duplicate processing, address do play an important role in that, if I am not wrong. Now the address of same person with different phonetic names and a slight difference of longitude and latitude could not be able to identify a suspect. What I feel is a combination of latitude, longitude and the address information can help us to identify a suspect. Just my thoughts, may not be true…After all, you are the expert in MDM..:))
Regards
Bhavik
Thank you for your comment Bhavik,
Yes you are right. Geocoding in its current form will be a hard nut to crack. We relay hugely on address information today to decide whether or not two parties belong to an A1 match category. (Off course along with all other critical data elements like Name, DOB, Gender and Identification numbers).
Geocodes need to be more accurate in locating a specific address. We should be able to locate every location (house/flat/condo/office etc) using it. Also, making standard geocodes so those “slight” differences you mentioned go away is going to be key.
Your suggestion to use longitude and latitude along with address is a good starting point. In that way at least we are making an effort to employ them