Implement importing of extracts to database #5

lilioid · 2020-05-29T11:54:12Z

This depends on Map-Data/tileserver-mapping#10

I tried reproducing the steps from https://github.com/Map-Data/regiontileserver/blob/master/import_data.sh as much as possible with the addition of coordinating files to tileserver-mapping.

osm_tile_data_extract/import_extracts.py

Akasch · 2020-05-29T13:57:38Z

osm_tile_data_extract/import_extracts.py

+    def _create_postgres_db(self):
+        # create new postgresql cluster
+        print_stage('Creating PostgreSQL cluster')
+        db_dir = os.path.join(self.working_dir, 'pg_data')


This will do unexpected things if one starts two containers to process in parallel with the same dirs, maybe also use the tilecoordinates?

Akasch · 2020-05-29T13:58:19Z

osm_tile_data_extract/import_extracts.py

+        db_dir = os.path.join(self.working_dir, 'pg_data')
+        subprocess.run(['rm', '-rf', db_dir], check=True)
+        os.makedirs(db_dir)
+        subprocess.run(['pg_createcluster', PG_VERSION, 'main', '--start', '--datadir', db_dir], check=True,


We prpably should edit the configuration of the database and allow more RAM use

Akasch · 2020-05-29T13:59:44Z

osm_tile_data_extract/import_extracts.py

+        print_stage("Importing data with osm2pgsql")
+        subprocess.run([
+            'su', 'postgres', '-c',
+            f'osm2pgsql --slim --hstore-all -C 3000 '


is the 3000 a good choice for RAM usage?

that's what the original script used

Akasch · 2020-05-29T14:01:48Z

osm_tile_data_extract/import_extracts.py

+            f'-P {self.db_port} '
+            f'-U postgres '
+            f'--number-processes {max(1, os.cpu_count() - 2)} '
+            f'{os.path.join(self.working_dir, self.pbf_file_name)}'


use unchecked tables as we will drop it later on?

Akasch · 2020-05-29T14:02:17Z

osm_tile_data_extract/import_extracts.py

+        with open(file_path, 'wb') as f:
+            subprocess.run(['su', 'postgres', '-c',
+                            f'pg_dump -p {self.db_port} -d {self.db_dbname} --format custom'],
+                           check=True, cwd=self.out_dir, stdout=f)


add compression?

Akasch · 2020-05-29T14:03:01Z

osm_tile_data_extract/import_extracts.py

+        print_stage('Uploading PostgreSQL dump to tileserver-mapping')
+        file_path = os.path.join(self.working_dir, 'db.pg_dump')
+        self.api.upload_sql_dump(self.tile, file_path)
+        subprocess.run(['rsync', file_path, self.out_dir], check=True)


if it is uploaded i would not put it into local storage, it would only waste space

Akasch · 2020-05-29T14:03:14Z

osm_tile_data_extract/import_extracts.py

+    def _upload_dump(self):
+        print_stage('Uploading PostgreSQL dump to tileserver-mapping')
+        file_path = os.path.join(self.working_dir, 'db.pg_dump')
+        self.api.upload_sql_dump(self.tile, file_path)


does it retry upload?

lilioid added 5 commits May 26, 2020 14:48

restructure to allow for sql import script

a3fa0f9

do import in docker database

0ee5732

implement importing to postgresql

9bd9afa

implement pg_dump upload

fd7fe77

implement uploading of finished files

4d797e3

lilioid requested a review from Akasch May 29, 2020 11:54

Akasch requested changes May 29, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement importing of extracts to database #5

Implement importing of extracts to database #5

Uh oh!

lilioid commented May 29, 2020

Uh oh!

Uh oh!

Akasch May 29, 2020

Uh oh!

lilioid May 29, 2020

Uh oh!

Akasch May 29, 2020

Uh oh!

Akasch May 29, 2020

Uh oh!

lilioid May 29, 2020

Uh oh!

Akasch May 29, 2020

Uh oh!

Akasch May 29, 2020

Uh oh!

Akasch May 29, 2020

Uh oh!

Akasch May 29, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement importing of extracts to database #5

Are you sure you want to change the base?

Implement importing of extracts to database #5

Uh oh!

Conversation

lilioid commented May 29, 2020

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants