Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
-
Updated
Dec 21, 2024 - Rust
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
Cartographic rendering and mesh analytics powered by PyVista
Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications
A small, embeddable BASIC interpreter in C.
An unstrctured 3-D Euler equation solver ( tetrahedral cell type)
Customized bot with langchain and gpt4
Dynamic Multi-Agent RAG solution tailor made for long documents in finance and legal domains
Extract your docs (CSV, PDF, JSON, HTML, DOCS, Sheets and more) for your own GPT and LLM projects using Unstructured.io via streamlit
Benchmarking unstructured data extraction libraries
Generating Structured Local Area Model (SLAM) grids from unstructured meshes
UnsServ implementation in Python
The PDF Chatbot project uses advanced NLP models and Unstructured.io for parsing complex PDFs, enabling streamlined extraction and querying of information, including tables, graphs, and images, through a user-friendly interface.
PDF 문서에서 GPU 가속 처리로 고품질 질의응답(QA) 데이터를 자동 생성하고 LLM을 효율적으로 파인튜닝하는 솔루션입니다. Unstructured 라이브러리와 AWS Bedrock Claude로 도메인 특화 QA 쌍을 생성하고, LoRA 기법으로 경량 모델을 훈련합니다.
Butter is a networking stack and framework for building peer-to-peer applications (dapps).
An introductory tutorial to unstructured mesh support within Iris and its ecosystem.
Weaviate, Unstructured | Ingest PDFs into Weaviate
GitHub repository for Unstructured MCP Hackathon.
Add a description, image, and links to the unstructured topic page so that developers can more easily learn about it.
To associate your repository with the unstructured topic, visit your repo's landing page and select "manage topics."