Wanting to use your personal or organizational data in AI workflows, but it’s stuck in PDFs and other document formats? Docling is here to help
Docling is an open source tool from IBM Research that converts files like PDFs and DocX into easy-to-use Markdown and JSON while keeping everything structured. In this video, Cedric Clyburn shows you how it works. We’ll walk through a demo using LlamaIndex for a question-answering app, and share some interesting details and benchmarks.
Let’s dig in and see how Docling can make working with your data so much easier for RAG, Fine-Tuning models, and more.
Subscribe for more demos like this one: @redhat
Resources:
https://github.com/DS4SD/docling
https://www.redhat.com/en/blog/docling-missing-document-processing-companion-generative-ai