Skip to content

v25.3.0

Compare
Choose a tag to compare
@muhammadumargroupdocs muhammadumargroupdocs released this 20 Mar 11:38
· 9 commits to main since this release

Initial Release of Openize.MarkItDown for Python

The Openize.MarkItDown for Python library is a utility tool for converting various files to Markdown, making it useful for indexing, text analysis, and further processing. Additionally, it supports integrating the converted Markdown data into LLM models for enhanced AI-driven applications.

It presently supports:

  • PDF (.pdf)
  • PowerPoint (.pptx)
  • Word (.docx)
  • Excel (.xlsx)

Simple API Usage

from openize.markitdown.core import MarkItDown

# Define input file and output directory
input_file = "report.pdf"
output_dir = "output_markdown"

# Create MarkItDown instance
converter = MarkItDown(output_dir)

# Convert document and send output to LLM
converter.convert_document(input_file, insert_into_llm=True)

print("Conversion completed and data sent to LLM.")

We welcome feedback and contributions to enhance Openize.MarkItDown. Feel free to submit issues, suggestions, or pull requests to our repository.

Happy converting!