使用 cv2.rectangle() 的多个边界框

Question

使用 cv2.rectangle() 的多个边界框

Sil*_*t-J 1 python opencv bounding-box yolo

我目前有边界框的坐标数据，包含在嵌套数据结构中，如下所示：

 defaultdict(list,
            {'giraffe': [{'conf': 0.9869,
               'coords': {'center_x': 0.360333,
                'center_y': 0.532274,
                'height': 0.596343,
                'width': 0.144651}},
              {'conf': 0.253321,
               'coords': {'center_x': 0.016296,
                'center_y': 0.565007,
                'height': 0.580526,
                'width': 0.03498}}],
             'zebra': [{'conf': 0.998863,
               'coords': {'center_x': 0.545974,
                'center_y': 0.693267,
                'height': 0.301859,
                'width': 0.257102}}]})

Run Code Online (Sandbox Code Playgroud)

我想迭代数据结构 ( img_obj_data) 并为每个 object_class 的每个对象绘制矩形。

然后我想保存图像（绘制了方框），以便稍后打开它。

我的第一次尝试如下：

import cv2

img = cv2.imread(img_path)
img_h, img_w = img.shape[:2]

for obj_class in img_obj_data.keys():
    for sub_dict in img_obj_data[obj_class]:
        x, y, w, h = sub_dict['coords'].values()
        
        # coords cannot be floats
        x = int(x*img_w)
        y = int(y*img_h)
        x_max = int(w*img_w)
        y_max = int(y*img_h)

        cv2.rectangle(img, (x, y), (x_max, y_max), color=(0, 255, 0), thickness=2)
cv2.imwrite('/content/foobar.jpg', img)

Run Code Online (Sandbox Code Playgroud)

我现在遇到了两个问题：

问题 1) 边界框未与对象正确对齐，并且从图像中被裁剪掉。（坐标最初是浮点数，我将它们乘以图像宽度和高度，但我有预感这是错误的做法？）

问题2)

我目前的代码将所有框的颜色设置为相同。我该如何处理才能使每个对象类别的颜色不同？

这是有问题的图像，以及我的代码生成的图像：

Answer 1

pon*_*d27 6

其中一个例子如下。*如有必要，请根据您的环境切换h和w。

import cv2
from random import randint

img = cv2.imread(img_path)
img_h, img_w = img.shape[:2]
colors = {}

for obj_class in img_obj_data.keys():
    if obj_class not in colors:
        colors[obj_class] = [randint(0, 255), randint(0, 255), randint(0, 255)]

    for sub_dict in img_obj_data[obj_class]:
        x, y, h, w = sub_dict['coords'].values()
        # coords cannot be floats
        x_min = int((x-w/2)*img_w)
        y_min = int((y-h/2)*img_h)
        x_max = int((x+w/2)*img_w)
        y_max = int((y+h/2)*img_h)
        cv2.rectangle(img, (x_min, y_min), (x_max, y_max), color=colors[obj_class], thickness=2)
cv2.imwrite('/content/foobar.jpg', img)

Run Code Online (Sandbox Code Playgroud)

此示例根据类名称随机更改颜色，但如果您知道类名称，则可以colors提前定义。

归档时间：	4 年，10 月前
查看次数：	5587 次
最近记录：	4 年，10 月前