��YOLOv8�Ĳ�ݮ��⣬��EMAע��GPFN��

ԭ��

AIС��

�� 2023-12-19 21:58:17

7711

�� 2023-12-19 21:58:17

�ٱ�

��±��¼��ר��YOLO��սYOLO��ս

��ժҪ��YOLOv8�Ĳ�ݮ��⣬��EMAע��GPFN��ֱܷ��mAP0.5��ԭʼ��0.815��0.818��0.831

1.YOLOv8��

Ultralytics YOLOv8��Ultralytics��˾��YOLOĿ��ͼ��ָ�ģ�͵��°汾��YOLOv8��һ�ּ�˵ġ��Ƚ��ģ�SOTA��ģ�ͣ��ǰYOLO�ɹ��ϣ��¹��ܺ͸Ľ��Խ�һ��ܺ��ԡ��ڴ��ݼ��Ͻ��ѵ��ܹ��ڸ��Ӳ��ƽ̨��У��CPU��GPU��

��Ľ��£�

Backbone��ʹ�õ��CSP��˼�룬��YOLOv5�е�C3ģ�鱻�滻��C2fģ�飬ʵ��˽�һ��ͬʱYOLOv8��ʹ��YOLOv5�ȼܹ��ʹ�õ�SPPFģ�飻
PAN-FPN��YOLOv8��ʹ��PAN��˼�룬��ͨ��Ա�YOLOv5��YOLOv8�Ľṹͼ��Կ��YOLOv8��YOLOv5��PAN-FPN�ϲ��׶��еľ��ṹɾ��ˣ�ͬʱҲ��C3ģ��滻Ϊ��C2fģ�飻
Decoupled-Head��ǲ��ᵽ�˲�һ��ζ��ǵģ�YOLOv8��Decoupled-Head��
Anchor-Free��YOLOv8��Anchor-Base��ʹ��Anchor-Free��˼�룻
��ʧ����YOLOv8ʹ��VFL Loss��Ϊ��ʧ��ʹ��DFL Loss+CIOU Loss��Ϊ��ʧ��
��ƥ����YOLOv8��IOUƥ��ߵ��߱��ķ��䷽ʽ��ʹ��Task-Aligned Assignerƥ�䷽ʽ

2.��ݮ��ݼ��

��ݼ��Сһ��1450�ţ��

names: ['Angular Leafspot', 'Anthracnose Fruit Rot', 'Blossom Blight', 'Gray Mold', 'Leaf Spot', 'Powdery Mildew Fruit']

2.1��ݼ��

ͨ��split_train_val.py�õ�trainval.txt��val.txt��test.txt

# coding:utf-8
 
import os
import random
import argparse
 
parser = argparse.ArgumentParser()
#xml�ļ��ĵ�ַ�������Լ������ݽ����޸� xmlһ������Annotations��
parser.add_argument('--xml_path', default='Annotations', type=str, help='input xml label path')
#���ݼ��Ļ��֣���ַѡ���Լ������µ�ImageSets/Main
parser.add_argument('--txt_path', default='ImageSets/Main', type=str, help='output txt label path')
opt = parser.parse_args()
 
trainval_percent = 0.9
train_percent = 0.7
xmlfilepath = opt.xml_path
txtsavepath = opt.txt_path
total_xml = os.listdir(xmlfilepath)
if not os.path.exists(txtsavepath):
    os.makedirs(txtsavepath)
 
num = len(total_xml)
list_index = range(num)
tv = int(num * trainval_percent)
tr = int(tv * train_percent)
trainval = random.sample(list_index, tv)
train = random.sample(trainval, tr)
 
file_trainval = open(txtsavepath + '/trainval.txt', 'w')
file_test = open(txtsavepath + '/test.txt', 'w')
file_train = open(txtsavepath + '/train.txt', 'w')
file_val = open(txtsavepath + '/val.txt', 'w')
 
for i in list_index:
    name = total_xml[i][:-4] + '\n'
    if i in trainval:
        file_trainval.write(name)
        if i in train:
            file_train.write(name)
        else:
            file_val.write(name)
    else:
        file_test.write(name)
 
file_trainval.close()
file_train.close()
file_val.close()
file_test.close()

?

2.2 ͨ��voc_label.py�õ��ʺ�yolov8ѵ��Ҫ��

# -*- coding: utf-8 -*-
import xml.etree.ElementTree as ET
import pickle
import os
from os import listdir, getcwd
from os.path import join
sets = ['train','val','test']
classes = ['Angular Leafspot', 'Anthracnose Fruit Rot', 'Blossom Blight', 'Gray Mold', 'Leaf Spot', 'Powdery Mildew Fruit']

def convert(size, box):
    dw = 1. / size[0]
    dh = 1. / size[1]
    x = (box[0] + box[1]) / 2.0
    y = (box[2] + box[3]) / 2.0
    w = box[1] - box[0]
    h = box[3] - box[2]
    x = x * dw
    w = w * dw
    y = y * dh
    h = h * dh
    return (x, y, w, h)
def convert_annotation(image_id):
    in_file = open('Annotations/%s.xml' % (image_id))
    out_file = open('labels/%s.txt' % (image_id), 'w')
    tree = ET.parse(in_file)
    root = tree.getroot()
    size = root.find('size')
    w = int(size.find('width').text)
    h = int(size.find('height').text)
    for obj in root.iter('object'):
        difficult = obj.find('difficult').text
        cls = obj.find('name').text
        if cls not in classes or int(difficult) == 1:
            continue
        cls_id = classes.index(cls)
        xmlbox = obj.find('bndbox')
        b = (float(xmlbox.find('xmin').text), float(xmlbox.find('xmax').text), float(xmlbox.find('ymin').text),
             float(xmlbox.find('ymax').text))
        bb = convert((w, h), b)
        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')
wd = getcwd()
print(wd)
for image_set in sets:
    if not os.path.exists('labels/'):
        os.makedirs('labels/')
    image_ids = open('ImageSets/Main/%s.txt' % (image_set)).read().strip().split()
    list_file = open('%s.txt' % (image_set), 'w')
    for image_id in image_ids:
        list_file.write('images/%s.jpg\n' % (image_id))
        convert_annotation(image_id)
    list_file.close()
?

3.ѵ��

F1_curve.png��F1��Ŷȣ�x�ᣩ֮��Ĺ�ϵ��F1��Ƿ��һ��׼��Ǿ�ȷ�ʺ��ٻ��ʵĵ��ƽ��0��1֮�䡣Խ��Խ�á�

TP��ʵΪ�棬Ԥ��Ϊ�棻

FN��ʵΪ�棬Ԥ��Ϊ�٣�

FP��ʵΪ�٣�Ԥ��Ϊ�棻

TN��ʵΪ�٣�Ԥ��Ϊ�٣�

��ȷ�ʣ�precision��=TP/(TP+FP)

�ٻ��(Recall)=TP/(TP+FN)

F1=2*��ȷ��*�ٻ��ʣ�/��ȷ��+�ٻ��ʣ�

PR_curve.png ��PR��е�P��precision��׼�ʣ���R��recall��ٻ��ʣ���Ǿ�׼��ٻ��ʵĹ�ϵ��

4.�Ż��

4.1��EMAע��

��ӽṹ��˳��ʹ��ȡ��д��ԣ��EMAģ��в��EMA��ṹ��ͼ3 (b)��ʾ��ڱ��У��ǽ��EMA��ھ��в��ͨ��ά��ѧϰ��Ч��ͨ��Ϊ�߼��ͼ��õ��ؼ�ע��˵��ֻ��CAģ��ѡ��1x1��Ĺ��ǵ�EMA�н��Ϊ1x1��֧��Ϊ�˾ۺ϶�߶ȿռ�ṹ��Ϣ��3x3�ں��1x1��֧��з��ʵ�ֿ��Ӧ��ǽ��Ϊ3x3��֧��ǵ��Ͷ�߶Ƚṹ��Ч�ؽ��ںͳ��ڻ�ø��õ��ܡ�

��

mAP0.5��ԭʼ��0.815��0.818

4.2 ��GFPN

?FPNּ�ڶ�CNN�Ǹ��ȡ�Ĳ�ͬ�ֱ��ʵĶ�߶��ںϡ��ͼ��FPN�Ľ��FPN��PANet�ٵ�BiFPN��ע�⵽��ЩFPN�ܹ��۽��ںϣ�ȱ��˿��ӡ��ˣ��һ��µ�·��ں�GFPN��߶��ӣ��ͼd��

ʵ��

mAP0.5��ԭʼ��0.815��0.831

by CSDN AIС��

https://blog.csdn.net/m0_63774211/article/details/135094636

��ڲ��2023��Ѷ��ѵӪ��н��ģ��ҹϷִ󽱣�

ԭ��ϵ��Ȩ��Ѷ�ƿ��δ��ɣ��ת�ء�

��Ȩ��ϵ cloudcommunity@tencent.com ɾ��

2023��Ѷ��ѵӪ ��

ԭ��ϵ��Ȩ��Ѷ�ƿ��δ��ɣ��ת�ء�

��Ȩ��ϵ cloudcommunity@tencent.com ɾ��

2023��Ѷ��ѵӪ ��

��

��¼��

0 ��

�ȶ�

��