rong-xyz
f9510656e0
Merge pull request #8 from pathintegral-institute/rong/tech-150-add-kwarg
...
make pydub an optional import
2025-04-24 14:57:24 +08:00
rong-xyz
2ac9cdc120
make pydub an optional import
2025-04-24 06:56:00 +00:00
rong-xyz
24ac33a1a3
Merge pull request #7 from pathintegral-institute/rong/tech-150-add-kwarg
...
fix image type
2025-04-24 14:11:02 +08:00
rong-xyz
da2cb0a796
fix image type
2025-04-24 06:10:24 +00:00
rong-xyz
237753db15
Merge pull request #6 from pathintegral-institute/rong/tech-150-add-kwarg
...
add converters
2025-04-23 20:45:38 +08:00
rong-xyz
39d14472fc
add converters
2025-04-23 12:44:49 +00:00
rong-xyz
f2fc240e28
Update README.md
2025-04-23 17:32:02 +08:00
rong-xyz
41a2c5be78
Merge pull request #5 from pathintegral-institute/rong/tech-149-allow-plugin-for-markitup
...
allow plugin
2025-04-23 17:24:58 +08:00
rong-xyz
87738fd782
allow plugin
2025-04-23 09:23:59 +00:00
rong-xyz
5213b8d22e
Merge pull request #4 from pathintegral-institute/rong/tech-145-add-test
...
Rong/tech 145 add test
2025-04-23 17:03:47 +08:00
rong-xyz
a837ae7808
add readme
2025-04-23 08:56:14 +00:00
rong-xyz
72e89cb368
fix test
2025-04-23 08:55:36 +00:00
rong-xyz
ce9cfff3bf
add readme and test
2025-04-23 07:18:35 +00:00
rong-xyz
8527b09e3f
add readme and test
2025-04-23 07:18:23 +00:00
rong-xyz
ff31c019df
Merge pull request #3 from pathintegral-institute/rong/tech-141-image-and-html-support
...
Rong/tech 141 image and html support
2025-04-23 14:44:36 +08:00
rong-xyz
71e55ba93e
support html
2025-04-23 06:43:37 +00:00
rong-xyz
bc67a318a1
finished webp support
2025-04-23 06:39:24 +00:00
rong-xyz
46b44d3ebd
support images
2025-04-23 06:13:19 +00:00
rong-xyz
cd85971867
Merge pull request #2 from pathintegral-institute/rong/tech-139-modality-conversion
...
Rong/tech 139 modality conversion
2025-04-22 19:30:24 +08:00
rong-xyz
e521dbcf2d
added image in html converter
2025-04-22 11:26:07 +00:00
rong-xyz
4519f9230c
remove xlrd
2025-04-22 10:18:05 +00:00
rong-xyz
c47cd0deec
support xlsx and xls
2025-04-22 10:17:22 +00:00
rong-xyz
f33a0ed922
finished audio transcription
2025-04-22 09:30:07 +00:00
rong-xyz
03f3fa9829
modality
2025-04-22 07:00:30 +00:00
rong-xyz
e729da2b38
Merge pull request #1 from pathintegral-institute/rong/tech-135-markitup-cleanup
...
Rong/tech 135 markitup cleanup
2025-04-22 14:33:09 +08:00
rong-xyz
cda189b8d0
supports images
2025-04-21 12:41:33 +00:00
rong-xyz
cc2ec44a4b
package change
2025-04-21 12:21:35 +00:00
rong-xyz
1e36bd8fc1
file change
2025-04-21 12:18:36 +00:00
rong-xyz
555a849a66
supports pptx
2025-04-21 09:37:43 +00:00
rong-xyz
615975f918
remove files
2025-04-21 08:43:19 +00:00
rong-xyz
9909ae13b8
add uv
2025-04-21 08:21:20 +00:00
rong-xyz
278c1d1c97
further remove
2025-04-21 08:17:21 +00:00
rong-xyz
9f70a124e0
del test files
2025-04-21 08:15:53 +00:00
rong-xyz
b66453a2e8
rename to markitup
2025-04-21 07:13:19 +00:00
createcentury
041be54471
Update README.md ( #1187 )
...
updated subtle misspelling.
2025-04-13 09:31:40 -07:00
lentil32
ebe2684b3d
chore: fix typo in README.md ( #1175 )
...
* chore: fix typo in README.md
2025-04-13 09:29:16 -07:00
Turdıbek
8576f1d915
Add CSV to Markdown table conversion - fixes #1144 ( #1176 )
...
* feat: Add CSV to Markdown table converter
- Add new CsvConverter class to convert CSV files to Markdown tables\n- Support text/csv and application/csv MIME types\n- Preserve table structure with headers and data rows\n- Handle edge cases like empty cells and mismatched columns\n- Fix Azure Document Intelligence dependency handling\n- Register CsvConverter in MarkItDown class
----
Thanks also to @benny123tw who submitted a very similar PR in #1171
2025-04-13 09:19:00 -07:00
Sathindu
3fcd48cdfc
feat: render math equations in .docx documents ( #1160 )
...
* feat: math equation rendering in .docx files
* fix: import fix on .docx pre processing
* test: add test cases for docx equation rendering
* docs: add ThirdPartyNotices.md
* refactor: reformatted with black
2025-03-28 15:36:38 -07:00
afourney
9e067c42b6
Make it easier to use AzureKeyCredentials with Azure Doc Intelligence ( #1151 )
...
* Make it easier to use AzureKeyCredentials with Azure Doc Intelligence
* Fixed mypy type error.
* Added more fine-grained options over types.
* Pass doc intel options further up the stack.
2025-03-26 10:44:11 -07:00
afourney
9a951055f0
Update readme to point to the mcp package. ( #1158 )
...
* Updated readme with link to the MCP package.
2025-03-25 15:00:04 -07:00
afourney
73b9d57312
Update badges ( #1157 )
...
* Update badges in subpackages.
2025-03-25 14:52:24 -07:00
afourney
3ca57986ef
Basic SSE MCP Server for MarkItDown ( #1155 )
...
* Added an initial minimal MCP server for MarkItDown
* Added STDIO default option.
* Added a Dockerfile, and updated the README accordingly. Also added instructions for Claude Desktop
* Pin mcp version.
2025-03-25 14:38:22 -07:00
afourney
c1f9a323ee
Bump version. ( #1154 )
2025-03-24 23:26:30 -07:00
afourney
e928b43afb
convert_url renamed to convert_uri, and now handles data and file URIs ( #1153 )
2025-03-24 21:43:04 -07:00
afourney
2ffe6ea591
Bump version. ( #1150 )
2025-03-22 11:21:32 -07:00
afourney
efc55b260d
Bump version and resolve a console encoding error. ( #1149 )
2025-03-21 09:27:25 -07:00
Yuzhong Zhang
52432bd228
Add support for preserving base64 encoded images ( #1140 )
...
* optional reserve base64 string in markdown _CustomMarkdownify and pptx
* add other converter para support
* fix linter
* Use *kwarg to pass keep_data_uri para.
* Add module cli vector tests
* Fixed formatting, and adjusted tests.
2025-03-20 18:50:23 -07:00
afourney
c0a511ecff
Updated docx file to include an image. ( #1146 )
2025-03-20 12:25:56 -07:00
afourney
cd6aa41361
Adjust warning filters and update dependencies ( #1143 )
...
Adjusts warning filters to be more contextual
Updates dependencies for magika and youtube-transcript-api
Updates the version to 0.1.0a5 in __about__.py
2025-03-19 22:09:14 -07:00
afourney
716f74dcb9
Consider anything with a charset as plain text-convertible. ( #1142 )
2025-03-19 20:46:35 -07:00