diff options
author | Karsten Loesing <karsten.loesing@gmx.net> | 2014-02-25 13:20:04 +0100 |
---|---|---|
committer | Karsten Loesing <karsten.loesing@gmx.net> | 2014-02-25 13:20:04 +0100 |
commit | 1d2179bc900f1646a5491b65294e78b175e70056 (patch) | |
tree | 441cda665e67e91ef2ce6fc6f87fbaafd38930ab /src/config/mmdb-convert.py | |
parent | 0efa2821c7e1865d8f515df735fd8ad0db5ff467 (diff) | |
download | tor-1d2179bc900f1646a5491b65294e78b175e70056.tar.gz tor-1d2179bc900f1646a5491b65294e78b175e70056.zip |
Fall back to registered country if necessary.
When extracting geoip and geoip6 files from MaxMind's GeoLite2 Country
database, we only look at country->iso_code which is the two-character ISO
3166-1 country code of the country where MaxMind believes the end user is
located.
But if MaxMind thinks a range belongs to anonymous proxies, they don't put
anything there. Hence, we omit those ranges and resolve them all to '??'.
That's not what we want.
What we should do is first try country->iso_code, and if there's no such
key, try registered_country->iso_code which is the country in which the
ISP has registered the IP address.
In short: let's fill all A1 entries with what ARIN et. al think.
Diffstat (limited to 'src/config/mmdb-convert.py')
-rw-r--r-- | src/config/mmdb-convert.py | 13 |
1 files changed, 13 insertions, 0 deletions
diff --git a/src/config/mmdb-convert.py b/src/config/mmdb-convert.py index 21d170adf6..4245738542 100644 --- a/src/config/mmdb-convert.py +++ b/src/config/mmdb-convert.py @@ -339,11 +339,24 @@ def parse_mm_file(s): def format_datum(datum): """Given a Datum at a leaf of the tree, return the string that we should write as its value. + + We first try country->iso_code which is the two-character ISO 3166-1 + country code of the country where MaxMind believes the end user is + located. If there's no such key, we try registered_country->iso_code + which is the country in which the ISP has registered the IP address. + Without falling back to registered_country, we'd leave out all ranges + that MaxMind thinks belong to anonymous proxies, because those ranges + don't contain country but only registered_country. In short: let's + fill all A1 entries with what ARIN et. al think. """ try: return bytesToStr(datum.map['country'].map['iso_code'].data) except KeyError: pass + try: + return bytesToStr(datum.map['registered_country'].map['iso_code'].data) + except KeyError: + pass return None IPV4_PREFIX = "0"*96 |