调整图像及其边界框的大小

Question

新手上路，请多包涵

我有一个带有边界框的图像，我想调整图像的大小。

 img = cv2.imread("img.jpg",3)
x_ = img.shape[0]
y_ = img.shape[1]
img = cv2.resize(img,(416,416));

现在我想计算比例因子：

 x_scale = ( 416 / x_)
y_scale = ( 416 / y_ )

并绘制图像，这是原始边界框的代码：

 ( 128, 25, 447, 375 ) = ( xmin,ymin,xmax,ymax)
x = int(np.round(128*x_scale))
y = int(np.round(25*y_scale))
xmax= int(np.round  (447*(x_scale)))
ymax= int(np.round(375*y_scale))

但是使用这个我得到：

在此处输入图像描述

而原来的是：

在此处输入图像描述

我在这个逻辑中没有看到任何标志，怎么了？

整个代码：

 imageToPredict = cv2.imread("img.jpg",3)
print(imageToPredict.shape)

x_ = imageToPredict.shape[0]
y_ = imageToPredict.shape[1]

x_scale = 416/x_
y_scale = 416/y_
print(x_scale,y_scale)
img = cv2.resize(imageToPredict,(416,416));
img = np.array(img);

x = int(np.round(128*x_scale))
y = int(np.round(25*y_scale))
xmax= int(np.round  (447*(x_scale)))
ymax= int(np.round(375*y_scale))
Box.drawBox([[1,0, x,y,xmax,ymax]],img)

和抽屉

def drawBox(boxes, image):
    for i in range (0, len(boxes)):
        cv2.rectangle(image,(boxes[i][2],boxes[i][3]),(boxes[i][4],boxes[i][5]),(0,0,120),3)
    cv2.imshow("img",image)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

边界框的图像和数据是分开加载的。我正在图像内部绘制边界框。图像不包含框本身。

原文由 jejjejd 发布，翻译遵循 CC BY-SA 4.0 许可协议

python-3.x numpy opencv math image-processing

阅读 643

1 个回答

得票最新

社区维基

1

发布于
2022-11-16

我认为有两个问题：

You should swap x_ and y_ because shape[0] is actually y-dimension and shape[1] is the x-dimension
您应该在原始图像和缩放后的图像上使用相同的坐标。 On your original image the rectangle is (160, 35) - (555, 470) rather than (128,25) - (447,375) that you use in the code.

如果我使用以下代码：

 import cv2
import numpy as np

def drawBox(boxes, image):
    for i in range(0, len(boxes)):
        # changed color and width to make it visible
        cv2.rectangle(image, (boxes[i][2], boxes[i][3]), (boxes[i][4], boxes[i][5]), (255, 0, 0), 1)
    cv2.imshow("img", image)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

def cvTest():
    # imageToPredict = cv2.imread("img.jpg", 3)
    imageToPredict = cv2.imread("49466033\\img.png ", 3)
    print(imageToPredict.shape)

    # Note: flipped comparing to your original code!
    # x_ = imageToPredict.shape[0]
    # y_ = imageToPredict.shape[1]
    y_ = imageToPredict.shape[0]
    x_ = imageToPredict.shape[1]

    targetSize = 416
    x_scale = targetSize / x_
    y_scale = targetSize / y_
    print(x_scale, y_scale)
    img = cv2.resize(imageToPredict, (targetSize, targetSize));
    print(img.shape)
    img = np.array(img);

    # original frame as named values
    (origLeft, origTop, origRight, origBottom) = (160, 35, 555, 470)

    x = int(np.round(origLeft * x_scale))
    y = int(np.round(origTop * y_scale))
    xmax = int(np.round(origRight * x_scale))
    ymax = int(np.round(origBottom * y_scale))
    # Box.drawBox([[1, 0, x, y, xmax, ymax]], img)
    drawBox([[1, 0, x, y, xmax, ymax]], img)

cvTest()

并将您的“原始”图像用作“49466033\img.png”，

我得到以下图像

正如您所看到的，我的细蓝线正好位于您原来的红线内，并且无论您选择什么 targetSize 它都留在那里（因此缩放实际上可以正常工作）。

原文由 SergGr 发布，翻译遵循 CC BY-SA 3.0 许可协议

撰写回答

你尚未登录，登录后可以

和开发者交流问题的细节
关注并接收问题和回答的更新提醒
参与内容的编辑和改进，让解决方法与时俱进

推荐问题

Stack Overflow 翻译

子站问答

访问

本篇内容翻译自 Stack Overflow，如果你觉得翻译结果值得改进，欢迎直接编辑修改，感谢你为社区贡献。

相似问题

找不到问题？创建新问题

调整图像及其边界框的大小

你尚未登录，登录后可以

DataCap 中验证码无法显示，后台出现 NullPointerException 错误?

springboot项目使用ffmpeg和opencv运行时如何加载so依赖？

分部积分法怎么用？？？

Stack Overflow 翻译