Multi-label Video Classification for Underwater Ship Inspection

Abstract

Today ship hull inspection including the examination of the external coating, detection of defects, and other types of external degradation such as corrosion and marine growth is conducted underwater by means of Remotely Operated Vehicles (ROVs). The inspection process consists of a manual video analysis which is a time-consuming and labor-intensive process. To address this, we propose an automatic video analysis system using deep learning and computer vision to improve upon existing methods that only consider spatial information on individual frames in underwater ship hull video inspection. By exploring the benefits of adding temporal information and analyzing frame-based classifiers, we propose a multi-label video classification model that exploits the self-attention mechanism of transformers to capture spatiotemporal attention in consecutive video frames. Our proposed method has demonstrated promising results and can serve as a benchmark for future research and development in underwater video inspection applications.

Client

Research Council of Norway (RCN) / 317854

Language

English

Author(s)

Md Abulkalam Azad
Ahmed Kedir Mohammed
Maryna Waszak
Brian Elvesæter
Martin Ludvigsen

Affiliation

Norwegian University of Science and Technology
SINTEF Digital / Smart Sensors and Microsystems
University of Toulon
SINTEF Digital / Sustainable Communication Technologies

Year

2023

Publisher

IEEE conference proceedings

Book

IEEE OCEANS 2023 LIMERICK

ISBN

979-8-3503-3226-1

DOI

https://doi.org/10.1109/oceanslimerick52467.2023.10244578

View this publication at Cristin

Contact us

Our services

Career

Sustainability

Management and board

Institutes

Other units

About us

Follow us