Downsample array in Python [closed]

Question

Closed. This question is seeking recommendations for software libraries, tutorials, tools, books, or other off-site resources. It does not meet Stack Overflow guidelines. It is not currently accepting answers.

We don’t allow questions seeking recommendations for software libraries, tutorials, tools, books, or other off-site resources. You can edit the question so it can be answered with facts and citations.

Closed 5 months ago.

Improve this question

I have basic 2-D numpy arrays and I'd like to "downsample" them to a more coarse resolution. Is there a simple numpy or scipy module that can easily do this? I should also note that this array is being displayed geographically via Basemap modules.

SAMPLE: enter image description here

K.-Michael Aye · Accepted Answer · 2015-03-17 00:54:35Z

14

scikit-image has implemented a working version of downsampling here, although they shy away from calling it downsampling for it not being a downsampling in terms of DSP, if I understand correctly:

http://scikit-image.org/docs/dev/api/skimage.measure.html#skimage.measure.block_reduce

but it works very well, and it is the only downsampler that I found in Python that can deal with np.nan in the image. I have downsampled gigantic images with this very quickly.

answered Mar 17, 2015 at 0:54

K.-Michael Aye

5,6357 gold badges46 silver badges58 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Mike T · Accepted Answer · 2022-08-10 07:59:05Z

13

When downsampling, interpolation is the wrong thing to do. Always use an aggregated approach.

I use block means to do this, using a "factor" to reduce the resolution.

import numpy as np
from scipy import ndimage

def block_mean(ar, fact):
    assert isinstance(fact, int), type(fact)
    sx, sy = ar.shape
    X, Y = np.ogrid[0:sx, 0:sy]
    regions = sy//fact * (X//fact) + Y//fact
    res = ndimage.mean(ar, labels=regions, index=np.arange(regions.max() + 1))
    res.shape = (sx//fact, sy//fact)
    return res

E.g., a (100, 200) shape array using a factor of 5 (5x5 blocks) results in a (20, 40) array result:

ar = np.random.rand(20000).reshape((100, 200))
block_mean(ar, 5).shape  # (20, 40)

edited Aug 10, 2022 at 7:59

answered Sep 9, 2013 at 23:09

Mike T

44.3k18 gold badges166 silver badges213 bronze badges

5 Comments

wuffwuff Over a year ago

Thanks, Mike. I think your solution is more of what I am looking for. When applying your code, I am getting an error due to mismatch of array size:

File "diffplot.py", line 38, in block_mean     res.shape = (sx/fact, sy/fact) ValueError: total size of new array must be unchanged

wuffwuff Over a year ago

The problem above was due to the need for the factor to be equally divisible into the original array shape. However, this function still provides the improper results. Interesting. Does not seem to be 're-sampling' like what I am looking for. Instead, it took the diff array and plotted it multiple times in the basemap window. I think I need some sort of an aggregation or dissolve technique. Thanks for your input thus far.

ali_m Over a year ago

Hi Mike, would you mind explaining why interpolation is a bad way to downsample? If interpolating is bad, is there a nice way of dealing with cases where the image dimensions aren't divisible by the desired block size?

keflavich Over a year ago

This is an alternative implementation of the same thing, I believe: github.com/keflavich/image_registration/blob/master/…. I'm not sure how the speed compares, but I'd bet scipy.ndimage is a bit faster.

K.-Michael Aye Over a year ago

does not work: ValueError: total size of new array must be unchanged

Hammer · Accepted Answer · 2013-09-06 21:28:54Z

6

imresize and ndimage.interpolation.zoom look like they do what you want

I haven't tried imresize before but here is how I have used ndimage.interpolation.zoom

a = np.array(64).reshape(8,8)
a = ndimage.interpolation.zoom(a,.5) #decimate resolution

a is then a 4x4 matrix with interpolated values in it

edited Sep 6, 2013 at 21:28

answered Sep 6, 2013 at 20:30

Hammer

10.4k1 gold badge39 silver badges52 bronze badges

8 Comments

wuffwuff Over a year ago

Here is a code snippet:

findiff = scipy.misc.imresize(diff, 30., interp='bilinear', mode=None) frefcobj = m.pcolormesh(x,y,findiff,shading='flat',vmin=-15,vmax=15,cmap=cmap,zorder=1) colbar = m.colorbar(frefcobj,"bottom",size="4%",pad="5%",extend='both',ticks=intervals)

diff is a 699x699 array. Does not seem to be achieving the task.

Hammer Over a year ago

I haven't tried imresize before, but I added a snippet using zoom. Is that not what you are looking for? I can't test imresize at the moment because I have an older version of scipy which doesn't seem to include it

wuffwuff Over a year ago

Interesting. Does not seem to be 're-sampling' like what I am looking for. Instead, it took the diff array and plotted it multiple times in the basemap window. I think I need some sort of an aggregation or dissolve technique. Thanks for your input thus far.

Hammer Over a year ago

By downsample you mean you want fewer samples than when you started right? Or do you mean you want to blur your matrix?

wuffwuff Over a year ago

I'd like to make the new array more "coarse," so fewer samples.

|

Kolibril · Accepted Answer · 2019-07-29 08:28:25Z

4

Easiest way: You can use the array[0::2] notation, which only considers every second index. E.g.

array= np.array([[i+j for i in range(0,10)] for j in range(0,10)])
down_sampled=array[0::2,0::2]

print("array \n", array)
print("array2 \n",down_sampled)

has the output:

array 
[[ 0  1  2  3  4  5  6  7  8  9]
 [ 1  2  3  4  5  6  7  8  9 10]
 [ 2  3  4  5  6  7  8  9 10 11]
 [ 3  4  5  6  7  8  9 10 11 12]
 [ 4  5  6  7  8  9 10 11 12 13]
 [ 5  6  7  8  9 10 11 12 13 14]
 [ 6  7  8  9 10 11 12 13 14 15]
 [ 7  8  9 10 11 12 13 14 15 16]
 [ 8  9 10 11 12 13 14 15 16 17]
 [ 9 10 11 12 13 14 15 16 17 18]]
array2 
[[ 0  2  4  6  8]
 [ 2  4  6  8 10]
 [ 4  6  8 10 12]
 [ 6  8 10 12 14]
 [ 8 10 12 14 16]]

answered Jul 29, 2019 at 8:28

Kolibril

1,48120 silver badges23 bronze badges

1 Comment

Xyndra Over a year ago

you can also use [::2]

blaylockbk · Accepted Answer · 2020-04-10 04:16:57Z

xarray's "coarsen" method can downsample a xarray.Dataset or xarray.DataArray

For example:

import xarray as xr
import numpy as np
import matplotlib.pyplot as plt

fig, (ax1, ax2, ax3) = plt.subplots(1, 3, figsize=(15,5))

# Create a 10x10 array of random numbers
a = xr.DataArray(np.random.rand(10,10)*100, dims=['x', 'y'])

# "Downscale" the array, mean of blocks of size (2x2)
b = a.coarsen(x=2, y=2).mean()

# "Downscale" the array, mean of blocks of size (5x5)
c = a.coarsen(x=5, y=5).mean()


# Plot and cosmetics
a.plot(ax=ax1)
ax1.set_title("Full Data")

b.plot(ax=ax2)
ax2.set_title("mean of (2x2) boxes")

c.plot(ax=ax3)
ax3.set_title("mean of (5x5) boxes")

Anshul Rai · Accepted Answer · 2019-04-02 03:05:41Z

3

Because the OP just wants a courser resolution, I thought I would share my way for reducing number of pixels by half in each dimension. I takes the mean of 2x2 blocks. This can be applied multiple times to reduce by factors of 2.

from scipy.ndimage import convolve
array_downsampled = convolve(array, 
                 np.array([[0.25,0.25],[0.25,0.25]]))[:array.shape[0]:2,:array.shape[1]:2]

edited Apr 2, 2019 at 3:05

Anshul Rai

7827 silver badges22 bronze badges

answered Oct 12, 2017 at 13:51

Josh Albert

1,12413 silver badges17 bronze badges

Comments

lmjohns3 · Accepted Answer · 2013-09-06 21:54:50Z

1

This might not be what you're looking for, but I thought I'd mention it for completeness.

You could try installing scikits.samplerate (docs), which is a Python wrapper for libsamplerate. It provides nice, high-quality resampling algorithms -- BUT as far as I can tell, it only works in 1D. You might be able to resample your 2D signal first along one axis and then along another, but I'd think that might counteract the benefits of high-quality resampling to begin with.

answered Sep 6, 2013 at 21:54

lmjohns3

7,6325 gold badges39 silver badges57 bronze badges

1 Comment

wuffwuff Over a year ago

Yes, that won't work for this situation, but thanks for the input. I need something that can aggregate spatially.

Q2Learn · Accepted Answer · 2020-02-21 16:36:02Z

0

This will take an image of any resolution and return only a quarter of its size by taking the 4th index of the image array.

import cv2
import numpy as np

def quarter_res_drop(im):

    resized_image = im[0::4, 0::4]
    cv2.imwrite('resize_result_image.png', resized_image)

    return resized_image

im = cv2.imread('Your_test_image.png', 1)

quarter_res_drop(im)

answered Feb 21, 2020 at 16:36

Q2Learn

3214 silver badges10 bronze badges

Collectives™ on Stack Overflow

Downsample array in Python [closed]

8 Answers 8

Comments

5 Comments

8 Comments

1 Comment

Comments

Comments

1 Comment

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

Comments

5 Comments

8 Comments

1 Comment

Comments

Comments

1 Comment

Comments

Linked

Related