microsoft / OmniParser
A simple screen parsing tool towards pure vision based GUI agent
See what the GitHub community is most excited about this month.
A simple screen parsing tool towards pure vision based GUI agent
🪄 Create rich visualizations with AI
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Truly independent web browser
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Vision agent
Master programming by recreating your favorite technologies from scratch.
Automate the process of making money online.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Build your own AI friend
An AI Hedge Fund Team
Docmost is an open-source collaborative wiki and documentation software. It is an open-source alternative to Confluence and Notion.
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.