Re: [geotrellis-user] Area of intersection for vector data in Spark

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [geotrellis-user] Area of intersection for vector data in Spark

From: Lukasz Tracewski <lukasz.tracewski@xxxxxxxxxxx>
Date: Tue, 4 Apr 2017 17:18:09 +0000
Accept-language: en-US
Delivered-to: geotrellis-user@xxxxxxxxxxxxxxxx
List-archive: <https://dev.locationtech.org/mhonarc/lists/geotrellis-user>
List-help: <mailto:geotrellis-user-request@locationtech.org?subject=help>
List-subscribe: <https://dev.locationtech.org/mailman/listinfo/geotrellis-user>, <mailto:geotrellis-user-request@locationtech.org?subject=subscribe>
List-unsubscribe: <https://dev.locationtech.org/mailman/options/geotrellis-user>, <mailto:geotrellis-user-request@locationtech.org?subject=unsubscribe>
Thread-index: AQHSqSSmW5odLxPiDEOjvQNR4rFM+aG0JQCAgAFXh4A=
Thread-topic: [geotrellis-user] Area of intersection for vector data in Spark

Thanks Rob! My data is rather large and I have cluster at hand, which is exactly why I was looking into options what could work for me. Would you have any suggestions where to start? I am happy to contribute to the project and don't mind at all working with not-out-of-the-box solutions! I am only starting with Scala, but have good software engineering experience and know functional programming.

Cheers,
Lukasz

On 03.04.2017 22:48, Rob Emanuele wrote:

Hi Lukasz,

If you can all the features in GeoJSON, is the amount of data actually too big to fit into memory and to do on one machine? GeoTrellis would be able to help with that if it had enough memory, without spark - read in the feature collection, spatially partition the one collection, and do a bounds query using the features of the other collection to pull out intersecting geometries.

We have some functionality contained in an object called VectorJoin, which will efficiently join two vector datasets - with the caveat that those RDDs of vector data are spatially partitioned before the join. We actually don't have a good way to do this type of spatial partitioning - I've written up an issue to track this, so that next release we'd have a good solution to that problem (https://github.com/locationtech/geotrellis/issues/2116)

There's a possibility to lean on GeoTrellis for components of not-out-of-the-box solution to this problem, so if you are interested in diving deeper into this.

Thanks,

Rob

On Thu, Mar 30, 2017 at 3:10 AM, Lukasz Tracewski <lukasz.tracewski@xxxxxxxxxxx> wrote:

Hi,

I am trying to figure out how - or if at all - I can use your library to
calculate an area of intersection for two relatively large GeoJSONs.
Both are FeatureCollections that contain a number of features. Due to
the size of the data I'd like to use Spark. Can you point me in the
right direction?

Thanks,
Lucas

_______________________________________________
geotrellis-user mailing list
geotrellis-user@locationtech.org
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.locationtech.org/mailman/listinfo/geotrellis-user

--

Robert Emanuele, VP of Research
Azavea | 990 Spring Garden Street, 5th Floor, Philadelphia, PA
remanuele@xxxxxxxxxx | @lossyrob
_______________________________________________
geotrellis-user mailing list
geotrellis-user@xxxxxxxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.locationtech.org/mailman/listinfo/geotrellis-user

Follow-Ups:
- Re: [geotrellis-user] Area of intersection for vector data in Spark
  - From: Ross Bernet

References:
- [geotrellis-user] Area of intersection for vector data in Spark
  - From: Lukasz Tracewski
- Re: [geotrellis-user] Area of intersection for vector data in Spark
  - From: Rob Emanuele

Prev by Date: Re: [geotrellis-user] Area of intersection for vector data in Spark
Next by Date: Re: [geotrellis-user] Area of intersection for vector data in Spark
Previous by thread: Re: [geotrellis-user] Area of intersection for vector data in Spark
Next by thread: Re: [geotrellis-user] Area of intersection for vector data in Spark
Index(es):
- Date
- Thread

Breadcrumbs