Commit graph

11 commits

Author SHA1 Message Date
lumin
62313c15f5
Merge 3047a299e6 into 73ba69d8cd 2025-02-10 17:18:01 +05:30
KennyZhang1
bfde857420
Add support for conversion via Document Intelligence (#303)
* added cli params for doc intel

* added DocumentIntelligenceConverter class implementation

* initialized doc intel client instance field

* added isolated doc_intel main conversion function

* temp fix for ContentFormat import bug

* ran tests for docintel and offline for many filetypes

* push doc intel converter to the top of the stack

* formatting changes

* modified project toml file
2025-01-24 14:09:32 -08:00
yeungadrian
08ed32869e
Feature/ Add xls support (#169)
* add xlrd
* add xls converter with tests
2025-01-03 13:58:17 -08:00
Murat Can Kurtuluş
d248621ba4
feat: outlook ".msg" file converter (#196)
* feat: outlook .msg converter
* add test, adjust docstring
2025-01-03 13:34:39 -08:00
lumin
fde036f08f chore: update GitHub Actions and Python versions
Update the GitHub Actions checkout and setup-python 
versions to v4 and v5 respectively. Add Python 3.13 
to the testing matrix in both the workflow and 
pyproject.toml to ensure compatibility with the latest 
Python release.
2024-12-21 12:09:27 +09:00
lumin
c86287b7e3 feat: add project description in pyproject.toml 2024-12-19 13:02:47 +09:00
Adam Fourney
248d64edd0 Added llm tests to the local test set. 2024-12-17 12:13:19 -08:00
Divyansh Singh
52b723724c Fix character decoding issues with text-like files 2024-12-15 10:37:59 +05:30
Simon Willison
3b88696777
Remove invalid classifiers
requires-python says 3.10 and higher only
2024-12-13 10:53:35 -08:00
Adam Fourney
997c7af53c Added a simple CLI. 2024-11-14 07:50:21 -08:00
Adam Fourney
f20c964f99 Initial commit. 2024-11-13 13:00:01 -08:00