monicaarnaud's Projects
I put mining related articles here as references for my writings.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Config files for my GitHub profile.
Curated list of project-based tutorials
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
The simplest, fastest repository for training/finetuning medium-sized GPTs.
HashLips Art Engine is a tool used to create multiple different instances of artworks based on provided layers.
Implementation of Nougat Neural Optical Understanding for Academic Documents
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
Get up and running with Llama 2 and other large language models locally
Examples and guides for using the OpenAI API
如何快速开发一个OpenAI/GPT应用:国内开发者笔记
Robust Speech Recognition via Large-Scale Weak Supervision
Open Multilingual Chatbot for Everyone
An effective and flexible tool for data annotation
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
🤖️ 桌面端AI语言练习应用
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
A repository of scripts that helps to automate daily tasks
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.
A cross platform OCR Library based on PaddleOCR & OnnxRuntime & OpenVINO.
Sync untimed transcripts with Youtube auto-generated captions