Research on handwriting analysis, object tracking and segmentation based on machine learning (b194dc-DocTraSeg)

Internally funded project

Acronym: b194dc-DocTraSeg

Start date : 16.11.2023

End date : 01.05.2026

Overview

Publications

(5)

Project details

Scientific Abstract

This project investigates two computer vision tasks: 1. Handwriting analysis; 2. Object tracking and segmentation.

Handwriting document analysis aims to evaluate and recognize the handwritten manuscripts according to different intentions, such as text recognition, spotting, layout analysis, text alignment, and writer recognition. As an important issue in the first step of digitizing scanned documents, this project will focus on layout analysis and line segmentation.

Object tracking and segmentation aims at continuously estimating the state of an object based on a given bounding box extracted by a simple rectangle/mask from the initial frame of a video sequence. It is widely applied in various applications such as surveillance, autonomous driving, human-computer interaction, etc. Despite the progress made so far, its main challenge lies in the limited discriminative power of the classifiers. Also, it is prone to the introduced endless distractors in real-world surveillance applications. For example, Siamese trackers dominate single-object-tracking field. Their balanced tracking paradigm coupled with fast inference speed and relatively high performance has caught the researchers’ attention. However, Siamese trackers mostly rely on large dataset offline training to learn the general representative capability for an arbitrary given target object. This ignores the target context relationship from adjacent frames. In addition, both CNNs and ViTs are used as feature extractor while the interaction between the local fine-grained and global coarse representation is still unexplored. This project will investigate state-of-the-art algorithms for achieving accurately and stably object tracking and segmentation.

Involved:

Fei Wu Project Leader

Contributing FAU Organisations:

Lehrstuhl für Informatik 5 (Mustererkennung)