Datatyk

: Datatyk Co.Ltd; Datatyk; 27 August 2018; Hits: 10472

Microsoft Custom Vision AI - Object Detection

Tool and Service

Using Microsoft Custom Vision AI for training and testing

Type of Custom Vision AI: Object Detection with general category

What does Custom Vision Service do well?

The Custom Vision Service works best when the item you're trying to classify is prominent in your image.

To start the training model, the custom vision requires at least 15 images per class, 50 images per class are enough to start the prototype. In this project, we created 4 classes with around 100 images per class.

Custom Vision Service accepts training images in .jpg, .png, and .bmp format, up to 6 MB per image. (Prediction images can be up to 4 MB per image.) We recommend that images be 256 pixels on the shortest edge. Any images shorter than 256 pixels on the shortest edge are scaled up by Custom Vision Service.

With an image, we can have multiclass objects in.

Train datasets

To demo, we have 4 products for training and testing

Class	Train Image quantity	Test image quantity	Split Ratio	Type
Bia 333	111	22	80/20	JPEG
Bia Heineken	108	22	80/20	JPEG
Dau an Neptune	104	21	80/20	JPEG
Dau an Simply	106	21	80/20	JPEG

The input parameters

Threshold Type	Threshold Value	The meaning of threshold
Probability Threshold	50%	This 50% is the average accuracy score that gives the balance result of Precision and Recall values
Overlap Threshold	30%	This is calculated based on the regions of custom vision suggest and user’s drawing, if we increase this value, it may not exclude more image, then the Precision and Recall values will be decreased

Duration of training: 6 minutes

To validate the result, Microsoft Custom Vision AI is using a process called k-fold cross validation.

Result of training

The result after training with the above datasets

Test and retrain a model

Test an image

From the test result, we can change the classified result to use for the next train, it will make the model more accuracy.

How to improve the train dataset

First-round training
Add more images and balance data
Retrain
Add images with varying background, lighting, object size, camera angle, and style
Retrain & feed in image for prediction
Examine prediction results
Modify existing training data

Train Dataset Suggestion

Base on the type of object, we should choose the object that is easy to recognize what is that object. With above datasets, we suggest focusing on the logo of product, that would be used to distinguish the difference among equivalent objects

End-to-End solution

This is a general proposed solution with custom vision service.

Demo

Reference

Tags: Microsoft Custom Vision AI, Computer Vision

: Datatyk Co.Ltd; Datatyk; 27 August 2018; Hits: 7724

Nhận dạng sản phẩm dùng Google AutoML Vision

Mục đích

Phân lớp hình ảnh dựa trên tập hình ảnh đã được gán nhãn.

AutoML Vision hoạt động như thế nào?

Bước chuẩn bị tập dữ liệu huấn luyện:

AutoML Vision cho phép chúng ta có thể huấn luyện mô hình máy học để phân lớp hình ảnh theo tập nhãn đã được định nghĩa trước.

Yêu cầu tối thiểu của AutoML Vision là 10 hình/nhãn. Khả năng phân lớp thành công phụ thuộc vào số lượng hình ảnh có độ phân giải cao. Google khuyên cáo nên dùng ít nhất 100 hình/ nhãn để đạt được kết quả tốt. Các định dạng ảnh hỗ trợ bao gồm JPEG, PNG, WEBP GIF, BMP, TIFF, ICO với dung lượng tối đa 30MB cho tập huấn luyện và định dạng JPEG, PNG, GIF với dung lượng tối đa 1.5MB cho hình ảnh dùng để dự đoán.

Hình ảnh có thể được gán 1 hoặc nhiều nhãn tại thời điểm tạo bộ dữ liệu (dataset) huấn luyện

Huấn luyện mô hình máy học:

Mặc định, AutoML Vision sẽ chia bộ dữ liệu thành 3 tập:

TRAIN - 80% được sử dụng để huấn luyện.
VALIDATION - 10% được sử dụng để lựa chọn parameter phù hợp để tối ưu hóa thuật toán hoặc/và quyết định khi nào sẽ kết thúc quá trình huấn luyện.
TEST - 10% được sử dụng để kiểm tra model.

Google sử dụng parameter “computer hours” để cải thiện độ chính xác của model với giá như sau:

1 computer hour: miễn phí 10 model trong 1 tháng
N computer hour: 1 < n<=24, 20$/hour, với điều kiện bộ dữ liệu lớn hơn 1000 hình

Trong giai đoạn training, Google dùng ngưỡng mặc định (score threshold) là 0.5 để kiểm tra độ chính xác của mô hình. Với ngưỡng 0.5, kết quả Precision và Recall như sau

Đánh giá mô hình huấn luyện:

Sau khi training, chúng ta có thể hiệu chỉnh score threshold để xem sự thay đổi của Precision và Recall và quyết định giá trị score threshold phù hợp với bài toán.

AutoML hiển thị confusion matrix cho chúng ta biết lớp nào được phân loại đúng nhiều nhất và dữ liệu thuộc lớp nào thường bị phân loại nhầm vào lớp khác

Transfer learning: với bộ dữ liệu trên 1000 hình ảnh, chúng ta có thể sử dụng kết quả vừa train để train model mới với độ chính xác cao hơn.

Kiểm tra mô hình huấn luyện:

Trên Google Cloud Portal

Với Python API

Giải pháp đề xuất

Demo

Tài liệu tham khảo:

https://cloud.google.com/vision/automl/docs/

Tags: Google AutoML Vision , nhận dạng, Computer Vision

: Datatyk Co.Ltd; Datatyk; 10 July 2018; Hits: 6155

Phân tích dữ liệu trong bảo hiểm

Tags: insurance, bảo hiểm

Datatyk

Microsoft Custom Vision AI - Object Detection

Tool and Service

What does Custom Vision Service do well?

Train datasets

Result of training

Test and retrain a model

How to improve the train dataset

Train Dataset Suggestion

End-to-End solution

Demo

Reference

Nhận dạng sản phẩm dùng Google AutoML Vision

Mục đích

AutoML Vision hoạt động như thế nào?

Bước chuẩn bị tập dữ liệu huấn luyện:

Huấn luyện mô hình máy học:

Đánh giá mô hình huấn luyện:

Kiểm tra mô hình huấn luyện:

Giải pháp đề xuất

Demo

Tài liệu tham khảo:

Phân tích dữ liệu trong bảo hiểm

Subcategories

Blog

Video

Solutions

Address

Usefull Links

Map

Social

DataTyk