DL之Mask R-CNN:2018.6.26世界杯阿根廷队VS尼日利亚比赛2:1实现Mask R-CNN目标检测
vs 实现 检测 目标 26 CNN 比赛 DL
2023-09-14 09:04:47 时间
DL之Mask R-CNN:2018.6.26世界杯阿根廷队VS尼日利亚比赛2:1实现Mask R-CNN目标检测
目录
输出结果
先上目标检测结果
人身检测
观众席:众里寻他千百度,蓦然回首,那人却在,灯火阑珊处。藏得再深也能找到你……
核心代码
import os
import sys
import random
import math
import numpy as np
import skimage.io
import matplotlib
import matplotlib.pyplot as plt
# Root directory of the project
ROOT_DIR = os.path.abspath("../")
# Import Mask RCNN
sys.path.append(ROOT_DIR) # To find local version of the library
from mrcnn import utils
import mrcnn.model as modellib
from mrcnn import visualize
# Import COCO config
sys.path.append(os.path.join(ROOT_DIR, "samples/coco/")) # To find local version
import coco
# Directory to save logs and trained model
MODEL_DIR = os.path.join(ROOT_DIR, "logs")
# Local path to trained weights file
COCO_MODEL_PATH = os.path.join(ROOT_DIR, "mask_rcnn_coco.h5")
# Download COCO trained weights from Releases if needed
if not os.path.exists(COCO_MODEL_PATH):
utils.download_trained_weights(COCO_MODEL_PATH)
# Directory of images to run detection on
IMAGE_DIR = os.path.join(ROOT_DIR, "images01")
class InferenceConfig(coco.CocoConfig):
# Set batch size to 1 since we'll be running inference on
# one image at a time. Batch size = GPU_COUNT * IMAGES_PER_GPU
GPU_COUNT = 1
IMAGES_PER_GPU = 1
config = InferenceConfig()
config.display()
#Create Model and Load Trained Weights
# Create model object in inference mode.
model = modellib.MaskRCNN(mode="inference", model_dir=MODEL_DIR, config=config)
# Load weights trained on MS-COCO
model.load_weights(COCO_MODEL_PATH, by_name=True)
Configurations:
BACKBONE resnet101
BACKBONE_STRIDES [4, 8, 16, 32, 64]
BATCH_SIZE 1
BBOX_STD_DEV [0.1 0.1 0.2 0.2]
COMPUTE_BACKBONE_SHAPE None
DETECTION_MAX_INSTANCES 100
DETECTION_MIN_CONFIDENCE 0.7
DETECTION_NMS_THRESHOLD 0.3
FPN_CLASSIF_FC_LAYERS_SIZE 1024
GPU_COUNT 1
GRADIENT_CLIP_NORM 5.0
IMAGES_PER_GPU 1
IMAGE_MAX_DIM 1024
IMAGE_META_SIZE 93
IMAGE_MIN_DIM 800
IMAGE_MIN_SCALE 0
IMAGE_RESIZE_MODE square
IMAGE_SHAPE [1024 1024 3]
LEARNING_MOMENTUM 0.9
LEARNING_RATE 0.001
LOSS_WEIGHTS {'rpn_class_loss': 1.0, 'rpn_bbox_loss': 1.0, 'mrcnn_class_loss': 1.0, 'mrcnn_bbox_loss': 1.0, 'mrcnn_mask_loss': 1.0}
MASK_POOL_SIZE 14
MASK_SHAPE [28, 28]
MAX_GT_INSTANCES 100
MEAN_PIXEL [123.7 116.8 103.9]
MINI_MASK_SHAPE (56, 56)
NAME coco
NUM_CLASSES 81
POOL_SIZE 7
POST_NMS_ROIS_INFERENCE 1000
POST_NMS_ROIS_TRAINING 2000
ROI_POSITIVE_RATIO 0.33
RPN_ANCHOR_RATIOS [0.5, 1, 2]
RPN_ANCHOR_SCALES (32, 64, 128, 256, 512)
RPN_ANCHOR_STRIDE 1
RPN_BBOX_STD_DEV [0.1 0.1 0.2 0.2]
RPN_NMS_THRESHOLD 0.7
RPN_TRAIN_ANCHORS_PER_IMAGE 256
STEPS_PER_EPOCH 1000
TOP_DOWN_PYRAMID_SIZE 256
TRAIN_BN False
TRAIN_ROIS_PER_IMAGE 200
USE_MINI_MASK True
USE_RPN_ROIS True
VALIDATION_STEPS 50
WEIGHT_DECAY 0.0001
Processing 1 images
image shape: (506, 900, 3) min: 0.00000 max: 255.00000 uint8
molded_images shape: (1, 1024, 1024, 3) min: -123.70000 max: 151.10000 float64
image_metas shape: (1, 93) min: 0.00000 max: 1024.00000 float64
anchors shape: (1, 261888, 4) min: -0.35390 max: 1.29134 float32
相关文章
- 一文吃透 VS Code+Git 操作(vs code中git的相关配置与使用)
- VS code安装和使用技巧
- vs2017注释快捷键设置_vs注释多行代码快捷键
- Java vs Kotlin
- X86 vs ARM 架构同台竞技: 生物大数据大规模并行计算(如何将WGS全基因组计算成本降到1美元)
- Winform Vs Installer之添加自定义安装流程
- DeepMind VS Meta:实现纳什均衡理性最优解,还是多人非零和博弈算法更强大?
- 比较MySQL vs. Other Databases: A Comparison(mysql两者之间)
- 系统苹果系统VS Linux: 比较和选择(苹果系统与linux)
- 深度比较:Oracle vs. SAP HANA(oraclehana)
- 如何编写 VS Code 扩展
- 硬核观察 | 微软的 VS Code 有 1400 万用户,而全球开发者才 2400 万
- 谁能驾驭马赛克?微软AI打码手艺 VS 谷歌AI解码绝活儿
- VS链接MSSQL搭建本地数据库环境(vs链接本地mssql)
- VS连接SQL Server:让你从此走向数据库的新世界(vs连sqlserver)
- VS 开发 SQL Server 数据库应用实践(vs的sqlserver)
- 服务【VS帮助快速开启MSSQL服务】(vs开启mssql)
- C语言实现MySQL数据库连接的秘诀(c vs连接mysql)
- 比较C和MySQL 略显不均之下的选择(c++ mysql vs)
- VS下使用Redis数据库体验更佳数据交互性能(vs访问redis数据库)
- 比较VS Redis教程,学习新技能(vs redis教程)
- Oracle与VS插件的竞技之战(oracle vs插件)
- 桌面锐龙7 5700G VS.移动锐龙9 5900X 性能不相上下
- vs.net控件updatePanel实现无刷新的方法