Format Change of Files in Daylight

Oct 11th, 2023

Starting in v1.32, we are changing the way we compress OSC files. Instead of using bzip2, we will now be using gzip. This means that OSC file names will change from .osc.bz2 to .osc.gz. We have made this change because gzip is faster and more energy-efficient than bzip2.

We have also decided to release the machine learning buildings as a PBF instead of an OSC. The old building osc only included “create” operations and no “modify”, or “delete” operations so there was no real need to utilize the OSC format. This change also reduces the size of the released file, making it faster to download and process. This change will require people to use “osmium merge” instead of “osmium apply-changes” which we found has memory limitations when working with datasets of this size. The merge command uses less memory and is faster. The name of the buildings file is also changed from ms-buildings-v1.30.osc.gz to ml-buildings-v1.32.osm.pbf.

We hope these changes do not cause any inconvenience. We’d love to hear any feedback or comments.

How To Reach The Team

If you have any questions about this data distribution, we have created a #daylightdistro_feedback Slack channel in OSM US. Members of the team will be there periodically to answer questions. You can also email the team at osm@fb.com.