Commit graph

23 commits

Author SHA1 Message Date
lumin
6da4b08f8c feat(tests): add comprehensive tests for MarkItDown functionality
Add new test cases for MarkItDown to cover LLM, remote, and 
local file conversions. Implement tests for handling 
deprecation warnings, external URL queries, and EXIF data 
processing. Ensure tests are skipped when necessary 
environment conditions are not met, improving test reliability 
and maintainability.
2024-12-26 19:15:07 +09:00
lumin
52d73080c7
refactor(tests): add helper function for tests (#87)
* refactor(tests): simplify string validation in tests

Introduce a helper function `validate_strings` to streamline the 
validation of expected and excluded strings in test cases. Replace 
repetitive string assertions in the `test_markitdown_local` function 
with calls to this new helper, improving code readability and 
maintainability.

* run pre-commit

---------

Co-authored-by: lumin <71011125+l-melon@users.noreply.github.com>
Co-authored-by: gagb <gagb@users.noreply.github.com>
2024-12-20 11:42:32 -08:00
afourney
9e546a8588
Merge branch 'main' into main 2024-12-17 15:37:28 -08:00
Adam Fourney
8d5f16ecd2 Fixed formatting. 2024-12-17 15:27:06 -08:00
afourney
a571021199
Merge branch 'main' into main 2024-12-17 15:12:59 -08:00
Adam Fourney
95188a4a27 Merge main. 2024-12-17 13:46:26 -08:00
Adam Fourney
03a7843a0a Added deprecation warnings for mlm_* arguments. 2024-12-17 13:22:48 -08:00
Adam Fourney
248d64edd0 Added llm tests to the local test set. 2024-12-17 12:13:19 -08:00
Soulter
752fbd333c feat: add tests of rss convertor 2024-12-17 22:45:27 +08:00
afourney
afaff11ef0
Merge branch 'main' into main 2024-12-16 14:40:58 -08:00
afourney
e7636656d8
Merge branch 'main' into support-comments-in-docx 2024-12-16 14:23:14 -08:00
afourney
12ce5e95b2
Merge branch 'main' into feature/add-pptx-chart-support 2024-12-16 14:06:14 -08:00
gagb
9e6a19987b
Merge branch 'main' into main 2024-12-16 13:51:39 -08:00
Om Gupta
a3208f2bd0 feat: Add IpynbConverter
- Implemented IpynbConverter class for converting Jupyter Notebook (.ipynb) files into Markdown format.
- Supports markdown cells, code cells and raw cells.
- First markdown heading is used as the title if no title is found in notebook metadata.
- Created a test notebook (`test_notebook.ipynb`) to verify the functionality of the converter.
2024-12-17 01:00:41 +05:30
Ville Puuska
0a7203b876
add style_map prop to MarkItDown class 2024-12-15 17:23:57 +02:00
Ville Puuska
0704b0b6ff
pass 'style_map' kwarg to mammoth when converting docx 2024-12-15 16:59:21 +02:00
sakasegawa
0dd4e95584 Remove _is_chart 2024-12-15 21:14:58 +09:00
sakasegawa
93130b5ba5 Add PPTX chart support 2024-12-15 20:42:55 +09:00
Divyansh Singh
52b723724c Fix character decoding issues with text-like files 2024-12-15 10:37:59 +05:30
Josh XT
4987201ef6 test 2024-12-14 08:49:03 -05:00
Josh XT
571c5bbc0e add test 2024-12-14 08:45:51 -05:00
Adam Fourney
1787b83d7d Fix remote tests. 2024-11-13 14:37:47 -08:00
Adam Fourney
f20c964f99 Initial commit. 2024-11-13 13:00:01 -08:00