Video fingerprinting tool. Finding duplicate movies in a large dataset.
Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition
The world's simplest facial recognition api for Python and the command line
The FFmpeg build script provides an easy way to build a static ffmpeg on OSX and Linux with non-free codecs included.
This repository collects the state-of-the-art algorithms for video/image enhancement using deep learning (AI) in recent years, including super resolution, compression artifact reduction, deblocking, denoising, image/color enhancement, HDR.
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.