procedure
The DocumentParser component utilizes the open-source Docling library to convert input documents in formats such as PDF, HTML, XLSX, and CSV into a unified intermediate representation in JSON or Markdown format, while retaining layout, tables, and structural metadata.

Authors

Sources

Referenced by nodes (1)