Showing results for

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

Highlighted
##

mzp

Beginner

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

09-12-2018
02:03 PM

17 Views

Data thinning

Hello,

Can someone recommend routine(s) that would help with data thinning based on Euclidean distance between points? The locations may be e.g. latitudes and longitudes.

Thanks,

2 Replies

Highlighted
##

mecej4

Black Belt

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

09-12-2018
07:02 PM

17 Views

I don't know of any such routines, but it appears to me that no routine is needed. Please expand on what you mean by "data thinning", and why you wish to use euclidean distance instead of geodesic distance.

If you really wish to use euclidean distance, use the elementary transformation from spherical polar coordinates to rectangular coordinates. If you only wish to rank items by distance, the distance^{2} is equally good and more convenient for sorting.

If the distances are large (relative to the radius of the sphere), you really should use the great circle distance since the flat-earth approximation is quite inaccurate. See the Wikipedia article on the Haversine Formula: https://en.wikipedia.org/wiki/Haversine_formula . Again, if ranking is the goal, it may be sufficient to rank by {sin(distance/diameter)}^{2} instead of the distances, You may also consider pre-computing and storing a table of haversines.

The distance calculation is more complicated if you wish to account for deviations from sphericity.

Highlighted
##

mzp

Beginner

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

09-13-2018
08:57 AM

17 Views

Thanks mecej4 for your response.

Yes, I will use the geodesic distance but wanted to make it simple since I thought more people would understand what I mean.

The thinning is for satellite observations which might be available at very high spatial resolutions (i.e. very close distance from each other) but such density is required for certain applications. So I am looking for a code that would retain sample of these observations. E.g. data come in pixels that are 1 km apart but keeping just pixels that are 50 km apart is sufficient for this purpose. There are some thinning algorithms but writing own code that would include parallel processing can be quite tedious. So, I am looking for a shortcut to use a code that is already available possibly for a similar purpose. Thanks for any suggestions.

For more complete information about compiler optimizations, see our Optimization Notice.