Learn A Compression for Objection Detection - VAE with a Bridge

Recent advances in sensor technology and wide deployment of visual sensors lead to a new application whereas compression of images are not mainly for pixel recovery for human consumption, instead it is for communication to cloud side machine vision tasks like classification, identification, detectio...

Full description

Saved in:
Bibliographic Details
Published in:Visual communications and image processing (Online) pp. 1 - 5
Main Authors: Mei, Yixin, Li, Fan, Li, Li, Li, Zhu
Format: Conference Proceeding
Language:English
Published: IEEE 05.12.2021
Subjects:
ISSN:2642-9357
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recent advances in sensor technology and wide deployment of visual sensors lead to a new application whereas compression of images are not mainly for pixel recovery for human consumption, instead it is for communication to cloud side machine vision tasks like classification, identification, detection and tracking. This opens up new research dimensions for a learning based compression that directly optimizes loss function in vision tasks, and therefore achieves better compression performance vis-a-vis the pixel recovery and then performing vision tasks computing. In this work, we developed a learning based compression scheme that learns a compact feature representation and appropriate bitstreams for the task of visual object detection. Variational Auto-Encoder (VAE) framework is adopted for learning a compact representation, while a bridge network is trained to drive the detection loss function. Simulation results demonstrate that this approach is achieving a new state-of-the-art in task driven compression efficiency, compared with pixel recovery approaches, including both learning based and handcrafted solutions.
ISSN:2642-9357
DOI:10.1109/VCIP53242.2021.9675387