UPDF AI

Skeleton-based Generative Adversarial Networks and Novel Evaluation Metrics for Font Structural Style Transfer

Thanaphon Thanusan,K. Patanukhom

2025 · DOI: 10.37936/ecti-cit.2025193.260297
ECTI Transactions on Computer and Information Technology · 0 Citations

TLDR

This work presents a novel approach for font style transfer using Generative Adversarial Networks (GANs) to enhance the scene text editing process, enabling text editing from any characters to any characters, including cross-language editing.

Abstract

We present a novel approach for font style transfer using Generative Adversarial Networks (GANs) to enhance the scene text editing process, enabling text editing from any characters to any characters, including cross-language editing. Our GAN model utilizes pairs of sample images of the target font style and the corresponding skeleton-based features to learn their key structural details without relying on pre-trained models. Once the generator is trained, it can transform any character from the base font style to the target font style. Our approach offers the flexibility to select a base font similar to the target font for enhancing results and the ability to manipulate the stroke width of the output text. Additionally, in few-shot scenarios, we introduce a double generator scheme that integrates other existing methods with our approach. In this work, we also introduce two new evaluation metrics: Difference in Histogram of Oriented Gradients and Stroke Width Similarity. Our experimental results demonstrate that the proposed evaluation metrics can better measure font style similarity with greater robustness compared to conventional metrics. We evaluate the performance of our GAN model on style transfer for six target fonts and real scene text editing tasks, comparing it with existing methods. Our approach provides better structural similarity, readability, and visual appeal than other methods, especially for generating unseen characters.

Cited Papers
Citing Papers