Commit graph

12 commits

Author SHA1 Message Date
yeungadrian
87e6fe4e86
Merge 1c0362f375 into 73ba69d8cd 2025-02-10 19:10:21 +07:00
KennyZhang1
bfde857420
Add support for conversion via Document Intelligence (#303)
* added cli params for doc intel

* added DocumentIntelligenceConverter class implementation

* initialized doc intel client instance field

* added isolated doc_intel main conversion function

* temp fix for ContentFormat import bug

* ran tests for docintel and offline for many filetypes

* push doc intel converter to the top of the stack

* formatting changes

* modified project toml file
2025-01-24 14:09:32 -08:00
yeungadrian
bbacf89b53 remove pandas, use calamine + tabulate 2025-01-04 00:22:59 +00:00
yeungadrian
b95312172f combine xlsx and xls to excel, replace openpxyl/xlrd with calamine 2025-01-03 23:49:37 +00:00
yeungadrian
08ed32869e
Feature/ Add xls support (#169)
* add xlrd
* add xls converter with tests
2025-01-03 13:58:17 -08:00
Murat Can Kurtuluş
d248621ba4
feat: outlook ".msg" file converter (#196)
* feat: outlook .msg converter
* add test, adjust docstring
2025-01-03 13:34:39 -08:00
lumin
c86287b7e3 feat: add project description in pyproject.toml 2024-12-19 13:02:47 +09:00
Adam Fourney
248d64edd0 Added llm tests to the local test set. 2024-12-17 12:13:19 -08:00
Divyansh Singh
52b723724c Fix character decoding issues with text-like files 2024-12-15 10:37:59 +05:30
Simon Willison
3b88696777
Remove invalid classifiers
requires-python says 3.10 and higher only
2024-12-13 10:53:35 -08:00
Adam Fourney
997c7af53c Added a simple CLI. 2024-11-14 07:50:21 -08:00
Adam Fourney
f20c964f99 Initial commit. 2024-11-13 13:00:01 -08:00