目标检测第5步-使用keras版YOLOv3训练

潇洒坤

发布于 2019-03-15 16:22:45

2.4K10

代码可运行

文章被收录于专栏：简书专栏简书专栏

运行总次数：0

代码可运行

致谢声明

1.本文学习Patrick_Lxc的博客《Keras/Tensorflow+python+yolo3训练自己的数据集》并优化其中代码。

原博客链接：https://blog.csdn.net/Patrick_Lxc/article/details/80615433

2.本文学习qqwweee的github工程《keras-yolo3》并优化其中代码。

github链接：https://github.com/qqwweee/keras-yolo3

0.配置代码运行环境

0.1 硬件配置要求

YOLOv3对于电脑的显卡要求高，根据本文作者的经验，至少需要8GB的显存才能继续本文下面的实验。

只有Nvidia品牌的显卡可以运行深度学习，AMD品牌的显卡不可以运行深度学习。

那么Nvidia品牌具有8GB显存的最低价格显卡的型号为GTX1070。

2019年2月28日查询，京东上原装GTX1070的价格如下图所示：

Screenshot from 2019-02-21 17-15-49.png

2019年2月28日查询，淘宝上网吧二手拆机显卡GTX1070的价格为1800元左右，如下图所示：

Screenshot from 2019-02-21 17-21-07.png

如果购买资金充足，建议购买GTX1080Ti，11G显存可以运行准确率更高的YOLOv3-spp算法模型。

如果购买资金不足，最少也得购买GTX1070，否则无法继续本文下面的实验。

如果没有nvidia显卡，但有visa信用卡，请阅读我的另一篇文章《在谷歌云服务器上搭建深度学习平台》，链接：https://www.jianshu.com/p/893d622d1b5a

0.2 软件配置要求

各种软件版本：

Ubuntu ：16.04

Anaconda ：5.2

python ： 3.6

CUDA ：9.0

cudnn ：7.3

tensorflow_gpu ：1.10

Keras ：2.2.4

如果有显存为8G的显卡，在Windows操作系统下也无法运行本文下面的代码。会报错"显存不足"。

所以读者需要安装Ubuntu操作系统，建议选择Ubuntu16.04，制作系统U盘后非常容易安装。

如果有显存为11G的显卡，在Windows操作系统下可以继续本文下面的实验。

有显卡之后需要配置深度学习环境，请阅读我的另一篇文章《深度学习环境搭建-CUDA9.0、cudnn7.3、tensorflow_gpu1.10的安装》，链接：https://cloud.tencent.com/developer/article/1371243

1.数据准备

1.1 数据下载

如果读者有自己已经使用labelImg软件标注好的数据，可以直接跳到1.4节图片压缩。

本文作者给读者演示的图片数据是来自ImageNet中的鲤鱼分类。

数据集压缩文件下载链接: https://pan.baidu.com/s/1NksESNqBX--YqMJ4zptGdw 提取码: 6p3u

本文作者在桌面中创建文件夹keras_YOLOv3，并将下载好的数据集压缩文件n01440764.tar放到其中，如下图所示：

image.png

在文件夹keras_YOLOv3中鼠标右击，在显示的菜单中选择Open in Terminal，即在文件夹keras_YOLOv3中打开Terminal。

作为合格的Ubuntu系统使用者，要求会使用终端Terminal中的命令完成操作。

运行命令mkdir n01440764创建文件夹n01440764。

运行命令tar -xvf n01440764.tar -C n01440764完成压缩文件的解压，命令其中的-C参数后面必须为已经存在的文件夹，否则运行命令会报错。

解压完成后，文件夹和终端Terminal中的情况如下图所示：

image.png

1.2 在Ubuntu中安装软件labelImg

需要使用软件labelImg做图片的数据标注。

软件labelImg的下载地址：https://github.com/tzutalin/labelImg，页面如下图所示：

image.png

在文件夹keras_YOLOv3中打开Terminal，运行下列命令：

1.加快apt-get命令的下载速度，需要做Ubuntu系统的换源。方法如下：

在Ubuntu的设置Settings中选择Software & Updates，将Download from的值设置为http://mirrors.aliyun.com/ubuntu，如下图所示：

image.png

2.运行命令wget https://codeload.github.com/tzutalin/labelImg/zip/master -O labelImg-master.zip从github上下载labelImg的源码压缩文件。

3.运行命令unzip labelImg-master.zip完成压缩文件的解压。

4.运行命令sudo apt-get install pyqt5-dev-tools安装软件pyqt5-dev-tools。

5.运行命令cd labelImg-master进入文件夹labelImg-master。

6.运行命令pip install -r requirements/requirements-linux-python3.txt安装软件labelImg运行时需要的库，如果已经安装Anaconda此步可能不用进行。如果pip下载库的速度慢，请查看我的另外一篇文章《pip换源》，

链接：https://cloud.tencent.com/developer/article/1331861

7.运行命令make qt5py3编译产生软件labelImg运行时需要的组件。

8.运行命令python labelImg.py运行代码文件labelImg.py，运行结果如下图所示：

image.png

1.3 获取像素足够的图片

新建一个代码文件generate_qualified_images.py或generate_qualified_images.ipynb，将下面一段代码复制到其中。

运行代码可以获取文件夹n01440764中的200张像素足够的图片，存放在文件夹selected_images中。

因为文件夹n01440764中有一部分图片像素不足416 * 416，不利于模型训练，所以本节内容有必要进行。

import os
import random
from PIL import Image
import shutil

#获取文件夹中的文件路径
def getFilePathList(dirPath, partOfFileName=''):
    allFileName_list = list(os.walk(dirPath))[0][2]
    fileName_list = [k for k in allFileName_list if partOfFileName in k]
    filePath_list = [os.path.join(dirPath, k) for k in fileName_list]
    return filePath_list

#获取一部分像素足够，即长，宽都大于416的图片
def generate_qualified_images(dirPath, sample_number, new_dirPath):
    jpgFilePath_list = getFilePathList(dirPath, '.JPEG')
    random.shuffle(jpgFilePath_list)
    if not os.path.isdir(new_dirPath):
        os.makedirs(new_dirPath)
    i = 0
    for jpgFilePath in jpgFilePath_list:
        image = Image.open(jpgFilePath)
        width, height = image.size
        if width >= 416 and height >= 416:
            i += 1
            new_jpgFilePath = os.path.join(new_dirPath, '%03d.jpg' %i)
            shutil.copy(jpgFilePath, new_jpgFilePath)
        if i == sample_number:
            break

#获取数量为100的合格样本存放到selected_images文件夹中
generate_qualified_images('n01440764', 200, 'selected_images')

1.4 数据标注

数据标注是一件苦力活，本文作者标记200张图片花费90分钟左右。

本节演示单张图片的标注。

如下图红色箭头标记处所示，打开数据集文件夹。

image.png

如下图红色箭头标记处所示，选择文件夹keras_YOLOv3中的文件夹selected_images，选中后点击如下图下方红色箭头标记处所示的Choose按钮。

image.png

如下图红色箭头标记处所示，给这张图标记了2个物体：人脸human_face、鱼fish

在软件labelImg界面中，按w键即可开始标记物体。

image.png

本文作者标注好200张图片，上传到百度云盘，便于读者直接复现。

下载链接: https://pan.baidu.com/s/1-bZ5B5JKFB7R6aWUPBdP_w 提取码: 9rjg

1.5 检查标注数据

新建一个代码文件check_annotations.py或check_annotations.ipynb，将下面一段代码复制到其中。

运行代码可以完成2个检查功能：

1.检查代码检查标记好的文件夹是否有图片漏标

2.此段代码检查标记的xml文件中是否有物体标记类别拼写错误

#获取文件夹中的文件路径
import os
def getFilePathList(dirPath, partOfFileName=''):
    allFileName_list = list(os.walk(dirPath))[0][2]
    fileName_list = [k for k in allFileName_list if partOfFileName in k]
    filePath_list = [os.path.join(dirPath, k) for k in fileName_list]
    return filePath_list

#此段代码检查标记好的文件夹是否有图片漏标
def check_1(dirPath):
    jpgFilePath_list = getFilePathList(dirPath, '.jpg')
    allFileMarked = True
    for jpgFilePath in jpgFilePath_list:
        xmlFilePath = jpgFilePath[:-4] + '.xml'
        if not os.path.exists(xmlFilePath):
            print('%s this picture is not marked.' %jpgFilePath)
            allFileMarked = False
    if allFileMarked:
        print('congratulation! it is been verified that all jpg file are marked.')

#此段代码检查标记的xml文件中是否有物体标记类别拼写错误        
import xml.etree.ElementTree as ET
def check_2(dirPath, className_list):
    className_set = set(className_list)
    xmlFilePath_list = getFilePathList(dirPath, '.xml')
    allFileCorrect = True
    for xmlFilePath in xmlFilePath_list:
        with open(xmlFilePath) as file:
            fileContent = file.read()
        root = ET.XML(fileContent)
        object_list = root.findall('object')
        for object_item in object_list:
            name = object_item.find('name')
            className = name.text
            if className not in className_set:
                print('%s this xml file has wrong class name "%s" ' %(xmlFilePath, className))
                allFileCorrect = False
    if allFileCorrect:
        print('congratulation! it is been verified that all xml file are correct.')

if __name__ == '__main__':
    dirPath = 'selected_images'
    className_list = ['fish', 'human_face']
    check_1(dirPath)
    check_2(dirPath, className_list)

1.6 图像压缩

预先压缩好图像，模型训练时不用再临时改变图片大小，或许可以加快模型训练速度。

新建一个代码文件compress_images.py或compress_images.ipynb，将下面一段代码复制到其中。

运行代码可以完成2个功能：

1.将旧文件夹中的jpg文件压缩后放到新文件夹中。

2.将旧文件夹中的jpg文件对应的xml文件修改后放到新文件夹中。

#获取文件夹中的文件路径
import os
def getFilePathList(dirPath, partOfFileName=''):
    allFileName_list = list(os.walk(dirPath))[0][2]
    fileName_list = [k for k in allFileName_list if partOfFileName in k]
    filePath_list = [os.path.join(dirPath, k) for k in fileName_list]
    return filePath_list

#生成新的xml文件
import xml.etree.ElementTree as ET
def generateNewXmlFile(old_xmlFilePath, new_xmlFilePath, new_size):
    new_width, new_height = new_size
    with open(old_xmlFilePath) as file:
        fileContent = file.read()
    root = ET.XML(fileContent)
    #获得图片宽度变化倍数，并改变xml文件中width节点的值
    width = root.find('size').find('width')
    old_width = int(width.text)
    width_times = new_width / old_width
    width.text = str(new_width)
    #获得图片高度变化倍数，并改变xml文件中height节点的值
    height = root.find('size').find('height')
    old_height = int(height.text)
    height_times = new_height / old_height
    height.text = str(new_height)
    #获取标记物体的列表，修改其中xmin,ymin,xmax,ymax这4个节点的值
    object_list = root.findall('object')
    for object_item in object_list:
        bndbox = object_item.find('bndbox')
        xmin = bndbox.find('xmin')
        xminValue = int(xmin.text)
        xmin.text = str(int(xminValue * width_times))
        ymin = bndbox.find('ymin')
        yminValue = int(ymin.text)
        ymin.text = str(int(yminValue * height_times))
        xmax = bndbox.find('xmax')
        xmaxValue = int(xmax.text)
        xmax.text = str(int(xmaxValue * width_times))
        ymax = bndbox.find('ymax')
        ymaxValue = int(ymax.text)
        ymax.text = str(int(ymaxValue * height_times))
    tree = ET.ElementTree(root)
    tree.write(new_xmlFilePath)
    
#修改文件夹中的若干xml文件
def batch_modify_xml(old_dirPath, new_dirPath, new_size):
    xmlFilePath_list = getFilePathList(old_dirPath, '.xml')
    for xmlFilePath in xmlFilePath_list:
        xmlFileName = os.path.split(xmlFilePath)[1]
        new_xmlFilePath = os.path.join(new_dirPath, xmlFileName)
        generateNewXmlFile(xmlFilePath, new_xmlFilePath, new_size)
        
#生成新的jpg文件
from PIL import Image
def generateNewJpgFile(old_jpgFilePath, new_jpgFilePath, new_size):
    old_image = Image.open(old_jpgFilePath)
    new_image = old_image.resize(new_size, Image.ANTIALIAS)
    new_image.save(new_jpgFilePath)
    
#修改文件夹中的若干jpg文件
def batch_modify_jpg(old_dirPath, new_dirPath, new_size):
    if not os.path.isdir(new_dirPath):
        os.makedirs(new_dirPath)
    xmlFilePath_list = getFilePathList(old_dirPath, '.xml')
    for xmlFilePath in xmlFilePath_list:
        old_jpgFilePath = xmlFilePath[:-4] + '.jpg'
        jpgFileName = os.path.split(old_jpgFilePath)[1]
        new_jpgFilePath = os.path.join(new_dirPath, jpgFileName)
        generateNewJpgFile(old_jpgFilePath, new_jpgFilePath, new_size)
        
if __name__ == '__main__':
    old_dirPath = 'selected_images'
    new_width = 416
    new_height = 416
    new_size = (new_width, new_height)
    new_dirPath = 'images_%sx%s' %(str(new_width), str(new_height))
    batch_modify_jpg(old_dirPath, new_dirPath, new_size)
    batch_modify_xml(old_dirPath, new_dirPath, new_size)

标注好的200张压缩后的图片，上传到百度云盘，便于读者直接复现。

链接: https://pan.baidu.com/s/121Vh8gzDElpzz190kbdUOw 提取码: a2tb