Importing OSM Overpass API Data To MYSQL Importing OSM Overpass API Data To MYSQL

January 4, 2022

laravel londinium openstreetmap

I wanted to improve the area maps on londinium.com by adding a list of the websites nearby to each location. I started with the query I blogged about a few posts back, so I started with just the nodes, but large objects like hospitals didnt appear in the first list. This is because they are not nodes, rather ways and relations. The problem with these non-nodes is that they didnt have a lat/long in the same way the nodes did. But I found you could use the center output mode and replace out with out center so i my query for extracting websites is now put in a query file called query.osm like so:

data=[out:json];nwr[~"^(website|twitter|facebook|contact:website|contact:twitter|contact:facebook)$"~"."]  (51.11386850819646,-1.197509765625,51.92394344554469,0.85418701171875);out center;

Note that the export area is encoded in the query, and the output format is json.

To download the osm data with this command using the 'query.osm' file above run:

wget -O osm.json --post-file=query.osm "https://overpass-api.de/api/interpreter"

To parse the osm.json file to extract data and create an insert sql file, I amended my parseXml.php file like so

I am now donwloading json, so i needed to change the parse file a lot. It is much cleaner and easier with json to extract the data

<?php

error_reporting(E_ALL ^ E_WARNING); // disable warnings as this is an sql import

$json = file_get_contents('osm.json');
$phpDataArray = json_decode($json, false ); 

foreach($phpDataArray as $value){

    foreach($value as $row){

        if ($row->tags->website){    $website = $row->tags->website;}
        else    $website = $row->tags->{'contact:website'};

        // if the website is not valid or blank - continue, ignoring this row
        if (filter_var($website, FILTER_VALIDATE_URL) === false) {
            continue;
        }

        if ($row->id){
            $nwr = $row->type; // Node, Way, Relation

            if ($row->lat){
                $lat = $row->lat;
            }
            else
                $lat = $row->center->lat;

            if ($row->lon){
                $lon = $row->lon;
            }
            else
                $lon = $row->center->lon;


            $address = $row->tags->{'addr:housenumber'} . " " . $row->tags->{'addr:housename'} . " " .
            $row->tags->{'addr:unit'} . " " . $row->tags->{'addr:street'} . " " .
            $row->tags->{'addr:place'} . " " . $row->tags->{'addr:city'} . " " .
            $row->tags->{'addr:postcode'};            

            $feature = "";

            if($row->tags->railway){ $feature = $row->tags->railway;}
            if($row->tags->public_transport){ $feature = $row->tags->public_transport;}
            if($row->tags->highway){ $feature = $row->tags->historic;}
            if($row->tags->building && ($row->tags->building != "yes") ){ $feature = $row->tags->building;}
            if($row->tags->natural){ $feature = $row->tags->natural;}
            if($row->tags->sport){ $feature = $row->tags->sport;}
            if($row->tags->leisure){ $feature = $row->tags->leisure;}
            if($row->tags->landuse){ $feature = $row->tags->landuse;}
            if($row->tags->craft){ $feature = $row->tags->craft;}
            if($row->tags->office){ $feature = $row->tags->office;}
            if($row->tags->tourism){ $feature = $row->tags->tourism;}
            if($row->tags->shop){ $feature = $row->tags->shop;}
            if($row->tags->amenity){ $feature = $row->tags->amenity;}

            $amenity = "";
            if($row->tags->amenity){ $amenity = $row->tags->amenity;}

            $building = "";            
            if($row->tags->building && ($row->tags->building != "yes") ){ $building = $row->tags->building;}

            $tourism = "";            
            if($row->tags->tourism){ $tourism = $row->tags->tourism;}

            $craft = "";
            if($row->tags->craft){ $craft = $row->tags->craft;}

            //social

            if ($row->tags->twitter){    $twitter = $row->tags->twitter;}
            else    $twitter = $row->tags->{'contact:twitter'};

            if ($row->tags->facebook){    $facebook = $row->tags->facebook;}
            else    $facebook = $row->tags->{'contact:facebook'};

            if ($row->tags->instagram){    $instagram = $row->tags->instagram;}
            else    $instagram = $row->tags->{'contact:instagram'};

            if ($row->tags->linkedin){    $linkedin = $row->tags->linkedin;}
            else    $linkedin = $row->tags->{'contact:linkedin'};

            // make the sql using INSERT IGNORE to prevent errors
            echo "INSERT IGNORE INTO `points` (`id`, `website`, `address`, `feature` ,`amenity` ,`building` ,
            `tourism` ,`craft` , `twitter`,  `facebook`,  `linkedin`, `instagram`,  `name`, `nwr`, `point`) ";
            echo "VALUES(" . $row->id .", '" . addslashes($website) . "', '". addslashes(trim($address)) . "', '".
             addslashes($feature) . "', '" .addslashes($amenity) . "', '" .addslashes($building) . "', '" .
             addslashes($tourism) . "', '" .addslashes($craft) . "', '" . $twitter . "', '". $facebook 
             . "', '". $linkedin . "', '". $instagram . "', '" . addslashes($row->tags->name). "', '".
              $nwr . "', POINT(" . $lat . " , " . $lon ." ));"; 

            echo PHP_EOL;
        }
    }
}


To create the sql file, I ran it like this

parseJson.php > points.sql

I had to extract the nwr (Node/Way/Relation) to be able to link to the live db database on openstreetmap. I also renamed the field type as feature so the new database table is like so:

CREATE TABLE `points` (
  `id` varchar(11) NOT NULL,
  `nwr` varchar(255) NOT NULL,
  `feature` varchar(255) NOT NULL,
  `amenity` varchar(255) NOT NULL,
  `building` varchar(255) NOT NULL,
  `tourism` varchar(255) NOT NULL,
  `craft` varchar(255) NOT NULL,
  `name` varchar(255) NOT NULL,
  `address` varchar(255) NOT NULL,
  `website` varchar(255) NOT NULL,
  `twitter` varchar(255) NOT NULL,
  `facebook` varchar(255) NOT NULL,
  `linkedin` varchar(255) NOT NULL,
  `instagram` varchar(255) NOT NULL,
  `point` point NOT NULL,

  PRIMARY KEY (`id`),
  SPATIAL KEY `point` (`point`)
) ENGINE=InnoDB;

import into db using pipe viewer to see progress

pv points.sql | mysql -u root -p  londinium

And hey presto, a table full of data with each entity having geographic data.

There is still a number of 'dodgy' entries, which i plan to clean up at the next stage of this this import.


If you would like to contact me with this form on londinium.com, ilminster.net or via Twitter @andylondon