AI & ML interests

Computer Vision Technology and Data Collection for Anime Waifu

Who We Are

We are a team focused on building infrastructure for anime data information, including images, text, audiovisuals, and more.

Our goal is to automate all common processes for handling anime information, including data acquisition, data filtering, training, step selection, and platform deployment, in order to save manpower and optimally balance quality and performance requirements.

Our team comprises a Ph.D. in Software Engineering, a Ph.D. candidate in Computer Vision, professionals in art and design, and several AI waifu enthusiasts.

We are a purely non-profit team, and all our work is completely open, without any form of charge.

Our Technical Outputs

dghs-imgutils

Project Link: https://github.com/deepghs/imgutils

Project Documentation: https://deepghs.github.io/imgutils/main/index.html

This is a library for various common operations on anime images, including but not limited to:

  • Tachie (Difference) Detection and Clustering
  • Contrastive Character Image Pretraining
  • Object Detection
  • Edge Detection / Lineart Generation
  • Monochrome Image Detection
  • Truncated Image Check
  • Image Tagging
  • Character Extraction

Check out the documentation for more features.

Waifuc

Project Link: https://github.com/deepghs/waifuc

A data pipeline framework based on dghs-imgutils, supporting:

  • Fast data retrieval (local disk, danbooru, pixiv, zerochan, etc.)
  • Swift data filtering (comic exclusion, monochrome image exclusion, multi-character image exclusion, irrelevant character exclusion, etc.)
  • Rapid data saving (local, cloud; with metadata, saved in stable diffusion dataset format, etc.)
  • Quick building of processing pipelines (connecting multiple aforementioned stages)

Note: This tool is currently a work in progress, although it's in use. It hasn't been released on PyPI and lacks comprehensive documentation. These aspects will be addressed soon.

Model Zoo

We manage our models and datasets on Huggingface: https://huggingface.co/deepghs

Anything More?

In fact, our plans go beyond what's mentioned here. Other tools are continuously improving and will soon be released. Stay tuned!