Commit graph

  • e4b419ba40
    Pin Markdownify version. (#1069) afourney 2025-02-27 23:09:33 -0800
  • c37e3a64d2 Restore error logging behavior. Adam Fourney 2025-02-27 23:04:54 -0800
  • d211a9c97e Pin markdownify version. TODO: update code for compatibility with Markdownify 1.0.0 Adam Fourney 2025-02-27 23:02:52 -0800
  • dae94e63fc Just throw an error to see what the CI does. Adam Fourney 2025-02-27 22:56:18 -0800
  • 4e0a10ecf3 ran unit tests locally kennyzhang/add-file-object-support Kenny Zhang 2025-02-27 16:44:50 -0500
  • 950b135da6 formatting Kenny Zhang 2025-02-27 15:08:10 -0500
  • b671345bb9 updated readme Kenny Zhang 2025-02-27 15:07:46 -0500
  • d9a92f7f06 added file obj unit tests for rss and json Kenny Zhang 2025-02-27 15:05:29 -0500
  • db0c8acbaf added file obj support to rss and plain text converters Kenny Zhang 2025-02-27 14:55:49 -0500
  • 08330c2ac3 added core unit tests for file obj support Kenny Zhang 2025-02-27 11:27:05 -0500
  • ef9e29e49c address self-review Raduan77 2025-02-23 20:39:37 +0100
  • 17fd15369d merge w/ main + resolve merge conflicts Raduan77 2025-02-23 20:37:27 +0100
  • 4afc1fe886 added non-binary example to README Kenny Zhang 2025-02-21 13:31:37 -0500
  • b0044720da updated docs Kenny Zhang 2025-02-20 16:56:47 -0500
  • 07a28d4f00 black formatting Kenny Zhang 2025-02-20 16:49:37 -0500
  • b8b3897952 modify ext guesser Kenny Zhang 2025-02-20 16:47:37 -0500
  • 395ce2d301 close file object after using Kenny Zhang 2025-02-20 13:54:51 -0500
  • 808401a331 added conversion path for file object in central class Kenny Zhang 2025-02-19 17:02:51 -0500
  • e75f3f6f5b local path inputs to MarkitDown class adhere to new converterinput structure Kenny Zhang 2025-02-19 15:16:45 -0500
  • 8e950325d2 refactored remaining converters Kenny Zhang 2025-02-19 14:01:43 -0500
  • 096fef3d5f refactored more converters to support input class Kenny Zhang 2025-02-19 13:34:28 -0500
  • 0d9d22ace6
    Fix UnboundLocalError in MarkItDown._convert André Menezes 2025-02-19 16:55:45 +0000
  • 52cbff061a begin refactoring converter classes Kenny Zhang 2025-02-19 11:48:00 -0500
  • 0c5c6d092f fix(readme): add youtube URLs as markitdown supports Nima 2025-02-18 21:17:20 +0100
  • cc36fe9f0b fix: implement retry logic for YouTube transcript fetching and fix URL decoding issue Nima 2025-02-18 20:20:26 +0100
  • f712b63bf3 fix: improve YouTube transcript extraction reliability Nima 2025-02-18 19:32:19 +0100
  • 8363f419ab fix: improve metadata and description extraction logic Nima 2025-02-18 19:28:31 +0100
  • 8f76393ad8 fix: add error handling, refactor _findKey to use json.items() Nima 2025-02-18 19:22:38 +0100
  • 0027e6d425 added wrapper class for converter file input Kenny Zhang 2025-02-18 12:44:18 -0500
  • 63a7bafadd removed redundant priority setting Kenny Zhang 2025-02-18 12:18:49 -0500
  • f2e116294f add necessary imports Toshiyuki Sakamoto 2025-02-16 13:47:46 +0900
  • d3c3b24640
    Merge branch 'main' into add_whisper_for_audio Ji 2025-02-13 15:31:46 -0800
  • c569a9680d feat: add xsl files Marcos Romero Lamas 2025-02-13 10:34:00 +0100
  • 00e68e6310 This line was accidentally removed and is added back here mpowers 2025-02-12 23:47:22 -0500
  • 388c4889d4 Update to Test PPtx for nested shape mpowers 2025-02-12 23:32:10 -0500
  • f8c14a8928 Adds support for Shape Groups mpowers 2025-02-12 23:24:52 -0500
  • 6f673703e8 style: black files Marcos Romero Lamas 2025-02-12 23:25:13 +0100
  • dbdf2c0c10
    Added CLI tests. (#327) afourney 2025-02-11 20:42:50 -0800
  • f60b41b741 Added CLI tests. Adam Fourney 2025-02-11 17:42:41 -0800
  • edc71dbdba add test audio file. (the moon landing audio) Ji Zhang 2025-02-11 17:31:23 -0800
  • 1f3d3ef524 add test for audio. commented out by default. Ji Zhang 2025-02-11 17:31:11 -0800
  • b8927e5e65 fallback to _transcribe_audio Ji Zhang 2025-02-11 17:30:39 -0800
  • 8301427ab5 add whisper support for audio transcript. only trigger when have llm_client and openai Ji Zhang 2025-02-11 17:26:40 -0800
  • 97eeed5f32
    Doc Intelligence fixes for refactored code (#325) KennyZhang1 2025-02-11 19:01:46 -0500
  • 18d326ff7d removed duplicate priority argument Kenny Zhang 2025-02-11 17:36:55 -0500
  • d6b904d166
    Merge branch 'main' into kennyzhang/docintel-fixes KennyZhang1 2025-02-11 17:29:58 -0500
  • 3adb40e49b formatting Kenny Zhang 2025-02-11 17:04:18 -0500
  • 923d3fbcae fixed analysis features bug for docx Kenny Zhang 2025-02-11 16:51:15 -0500
  • 4ec5982223 added priority flag to doc intel converter constructor Kenny Zhang 2025-02-11 16:50:36 -0500
  • 935da9976c
    Added priority argument to all converter constructors. (#324) afourney 2025-02-11 12:36:32 -0800
  • 049b8f77f1 Fix docstring. Adam Fourney 2025-02-11 12:34:46 -0800
  • f5767c7e46 Merge branch 'set_priorities' of github.com:microsoft/markitdown into set_priorities Adam Fourney 2025-02-11 12:31:32 -0800
  • 540410e5c8 Promote discussion of converter priority to a docstring. Adam Fourney 2025-02-11 12:31:17 -0800
  • 2c89d518ec
    Merge branch 'main' into set_priorities afourney 2025-02-11 11:23:03 -0800
  • 548273543a
    Merge branch 'main' into bytes-to-md Nishith Jain 2025-02-12 00:22:55 +0530
  • d1868f8588 Re-enabled try-catch. Adam Fourney 2025-02-11 10:35:48 -0800
  • 5ce85c236c
    Fix a typo in sample RTF plugin (#320) Ruijun Gao 2025-02-12 02:33:52 +0800
  • 91d6a93bea
    Merge branch 'main' into patch-1 afourney 2025-02-11 10:33:04 -0800
  • 7a546fcd70
    Merge branch 'main' into set_priorities afourney 2025-02-11 10:14:51 -0800
  • 4298cfad8d Added priority argument to all converter constructors. Adam Fourney 2025-02-11 10:13:36 -0800
  • 3a5ca22a8d
    Don't generate md links in 'pre' blocks (#322) Tomasz Kalinowski 2025-02-11 10:13:17 -0500
  • d92cd2b2a7
    Updated pip install from source to single line Nishith Jain 2025-02-11 19:04:45 +0530
  • b86174d8ff
    Update _markitdown.py Nishith Jain 2025-02-11 18:56:20 +0530
  • 1cf8b26577
    a liittle fix Nishith Jain 2025-02-11 18:53:08 +0530
  • 1161c30ba3
    Update _markitdown.py Nishith Jain 2025-02-11 18:46:00 +0530
  • 025812d31c Don't generate md links in 'pre' blocks Tomasz Kalinowski 2025-02-11 05:39:12 -0500
  • 64d8bdc568
    Fix a typo in sample RTF plugin Ruijun Gao 2025-02-11 14:16:26 +0800
  • abe9752438 Bumped version v0.0.1a4 Adam Fourney 2025-02-10 16:01:17 -0800
  • 4b62506451 Small typo in README. Adam Fourney 2025-02-10 15:24:28 -0800
  • c73afcffea
    Cleanup and refactor, in preparation for plugin support. (#318) afourney 2025-02-10 15:21:44 -0800
  • dbb0e7641a Bumped version, and added a note about compatibility. Adam Fourney 2025-02-10 15:15:55 -0800
  • b15e563970 Remove deprecation tests. Adam Fourney 2025-02-10 14:29:19 -0800
  • 4095073fb2 Added another reference to plugin development. Adam Fourney 2025-02-10 14:26:25 -0800
  • cd4f646a2f Updated READMEs Adam Fourney 2025-02-10 14:23:05 -0800
  • d8fd3cb169 Added flags to list and load plugins. Updated READMEs Adam Fourney 2025-02-10 14:11:15 -0800
  • 00717c4fa6
    Merge 3aebc24f2f into 73ba69d8cd Raduan A. 2025-02-10 22:17:07 +0100
  • bfd1a252cb Bump sample version. Adam Fourney 2025-02-10 11:17:15 -0800
  • f188abe9d6 Updated the plugin interface. Adam Fourney 2025-02-10 11:05:20 -0800
  • 87e6fe4e86
    Merge 1c0362f375 into 73ba69d8cd yeungadrian 2025-02-10 19:10:21 +0700
  • 34b33be279
    Merge 95da5fd2ae into 73ba69d8cd Hieu Lam 2025-02-10 12:00:48 +0000
  • 95da5fd2ae chore: delete to separate the code format guess part to another PR Hieu Lam 2025-02-10 19:00:14 +0700
  • a75f1a68fb chore: added short description in README.md about the feature Hieu Lam 2025-01-15 12:50:40 +0700
  • 979cdc6257 feat: support images in table and auto detect code languages (optional) Hieu Lam 2025-01-15 12:37:16 +0700
  • 62313c15f5
    Merge 3047a299e6 into 73ba69d8cd lumin 2025-02-10 17:18:01 +0530
  • 6f916718c8
    Merge 1c9a938a44 into 73ba69d8cd dzemeuksis 2025-02-10 10:24:04 +0100
  • e54f706ae3 Fixed a few typos. Adam Fourney 2025-02-10 00:38:40 -0800
  • 25997a02fa It does not appear the CI is updating. Adam Fourney 2025-02-10 00:34:01 -0800
  • c1020eaaee Attempt to repair the CI Adam Fourney 2025-02-10 00:26:59 -0800
  • 52beb78f8a Added instructions to the README.md Adam Fourney 2025-02-10 00:25:23 -0800
  • ef2d79e879 Point tests to the correct folder. Adam Fourney 2025-02-10 00:17:04 -0800
  • 8e3fba38c6 Added sample plugin. Adam Fourney 2025-02-10 00:13:17 -0800
  • 0f5554d67d Moved everything to a packages subfolder. Adam Fourney 2025-02-09 23:23:53 -0800
  • b40291d747 Significant cleanup and refactor. Adam Fourney 2025-02-09 20:42:58 -0800
  • 38d6fcc001
    Merge 0f948ade40 into 73ba69d8cd ZeyuTeng96 2025-02-09 17:46:35 -0800
  • a795a16ce0 All converters. Adam Fourney 2025-02-09 16:38:25 -0800
  • 6793648d15 More converters. Adam Fourney 2025-02-09 15:45:36 -0800
  • 254946858c More converters. Adam Fourney 2025-02-09 12:24:00 -0800
  • 7a6a08b3a1 More converters. Adam Fourney 2025-02-09 11:38:47 -0800
  • 71fa94e3c9 Work started moving converters to individual files. Adam Fourney 2025-02-09 10:33:42 -0800
  • bc5a57ec6e
    Merge 39d5a088b8 into 73ba69d8cd Raduan A. 2025-02-09 10:47:40 -0600