How to speed Connected Component Labeling up with SIMD RLE algorithms (WPMVP 2020 - Workshop on Programming Models for SIMD/Vector Processing)

Sat 22 - Wed 26 February 2020 San Diego, California, United States

Who

Florian Lemaitre, Arthur Hennequin, Lionel Lacassagne

Track

WPMVP 2020

Abstract

The research in Connected Component Labeling, although old, is still very active and several efficient algorithms for CPUs and GPUs have emerged during the last years and are always improving the performance. This article introduces a new SIMD run-based algorithm for CCL. We show how RLE compression can be SIMDized and used to accelerate scalar run-based CCL algorithms. A benchmark done on Intel, AMD and ARM processors shows that this new algorithm outperforms the State-of-the-Art by an average factor of x1.7 on AVX2 machines and x1.9 on Intel Xeon Skylake with AVX512.

How to speed Connected Component Labeling up with SIMD RLE algorithms

Florian Lemaitre

Sorbonne University — LIP6

France

Arthur Hennequin

LIP6 Sorbonne University + CERN

Lionel Lacassagne

Sorbonne University — LIP6

France

Tracks

Workshops