Mathpix 的开源免费替代工具,支持将图片转换成可编辑的文本表示,支持80多种语言。可识别 PDF 或图像中的复杂版面、表格、数学公式和文本,并将它们合并转换为 Markdown 格式。
Coin-CLIP: fine-tuned with a vast collection of coin images from CLIP using contrastive learning. It enhances feature extraction for coins, boosting image search accuracy. This model merges Visual Transformer (ViT) with CLIP's multimodal learning, optimized for numismatic applications.