yeungadrian
87e6fe4e86
Merge 1c0362f375 into 73ba69d8cd
2025-02-10 19:10:21 +07:00
KennyZhang1
bfde857420
Add support for conversion via Document Intelligence ( #303 )
...
* added cli params for doc intel
* added DocumentIntelligenceConverter class implementation
* initialized doc intel client instance field
* added isolated doc_intel main conversion function
* temp fix for ContentFormat import bug
* ran tests for docintel and offline for many filetypes
* push doc intel converter to the top of the stack
* formatting changes
* modified project toml file
2025-01-24 14:09:32 -08:00
yeungadrian
bbacf89b53
remove pandas, use calamine + tabulate
2025-01-04 00:22:59 +00:00
yeungadrian
b95312172f
combine xlsx and xls to excel, replace openpxyl/xlrd with calamine
2025-01-03 23:49:37 +00:00
yeungadrian
08ed32869e
Feature/ Add xls support ( #169 )
...
* add xlrd
* add xls converter with tests
2025-01-03 13:58:17 -08:00
Murat Can Kurtuluş
d248621ba4
feat: outlook ".msg" file converter ( #196 )
...
* feat: outlook .msg converter
* add test, adjust docstring
2025-01-03 13:34:39 -08:00
lumin
c86287b7e3
feat: add project description in pyproject.toml
2024-12-19 13:02:47 +09:00
Adam Fourney
248d64edd0
Added llm tests to the local test set.
2024-12-17 12:13:19 -08:00
Divyansh Singh
52b723724c
Fix character decoding issues with text-like files
2024-12-15 10:37:59 +05:30
Simon Willison
3b88696777
Remove invalid classifiers
...
requires-python says 3.10 and higher only
2024-12-13 10:53:35 -08:00
Adam Fourney
997c7af53c
Added a simple CLI.
2024-11-14 07:50:21 -08:00
Adam Fourney
f20c964f99
Initial commit.
2024-11-13 13:00:01 -08:00