This is the official website for Docling, a powerful document processing tool that prepares your files for GenAI, RAG, and beyond.
Docling transforms messy PDFs, DOCX, and slides into clean, structured data—ready for RAG, GenAI apps, or anything downstream. It handles complex layouts, tables, and formulas so you don't have to.
- 🗂️ Multi-format Support: Parse PDF, DOCX, PPTX, XLSX, HTML, audio, and images
- 📑 Deep PDF Understanding: Layout, tables, reading order, code, and formulas
- 🧬 Unified Format: Consistent DoclingDocument structure for all outputs
- ↪ Multiple Export Formats: Markdown, HTML, DocTags, or lossless JSON
- 🔒 Local Processing: Run locally for sensitive or air-gapped environments
- 🤖 AI Framework Integration: Works with LangChain, LlamaIndex, Haystack, and more
- 🔍 OCR Support: Handle scanned PDFs and images
- 👓 Visual Language Models: Compatible with SmolDocling
- 🎙️ Audio Support: Automatic speech recognition (ASR)
- 💻 Developer Friendly: Fast CLI and easy integration
This website is built with:
- Next.js 14 - React framework
- TypeScript - Type safety
- Sass - CSS preprocessing
- Bootstrap 5 - UI components
First, install dependencies:
npm install
# or
yarn install
# or
pnpm install
Then, run the development server:
npm run dev
# or
yarn dev
# or
pnpm dev
Open http://localhost:3000 with your browser to see the result.
src/
├── app/ # Next.js app router
├── components/ # React components
│ ├── pages/ # Page-specific components
│ ├── ui/ # Reusable UI components
│ ├── icons/ # SVG icons
│ ├── header/ # Header components
│ └── footer/ # Footer components
├── styles/ # Global styles and fonts
├── lib/ # Utilities and types
└── utils/ # Constants and helpers
- Concepts: Learn Docling fundamentals
- Examples: Try out recipes for various use cases
- Integrations: Popular frameworks and tools
- Reference: See more API details
This website is part of the Docling project. Your feedback and contributions are welcome!
This project is part of the Docling ecosystem. See the main Docling repository for license information.