Commit graph

  • c9c7d98d30
    Merge pull request #11 from simonw/patch-2 gagb 2024-12-16 13:45:05 -0800
  • e7d9b5546a
    Merge branch 'main' into patch-2 gagb 2024-12-16 13:42:28 -0800
  • ed651aeb16
    Fix LLM terminology in code CharlesCNorton 2024-12-16 16:23:52 -0500
  • 3d9f3f3e5b
    Fix LLM terms CharlesCNorton 2024-12-16 16:23:03 -0500
  • a3208f2bd0 feat: Add IpynbConverter Om Gupta 2024-12-17 01:00:41 +0530
  • 7c4a35c30a
    added installation instructions Sarthak Pati 2024-12-16 12:58:04 -0500
  • ad01da308d fix issue #65 Divit 2024-12-16 21:48:33 +0530
  • fb7beff170 removed args as it is not fixing in pycharm, fixed args alignment Aviral Bhardwaj 2024-12-16 20:45:19 +0530
  • 964ee4311b
    Fix syntax error in test_markitdown.py HendricksJudy 2024-12-16 06:21:53 -0800
  • c2cb72122b fixed args alignment Aviral Bhardwaj 2024-12-16 19:39:10 +0530
  • 010f841008
    Ensure hatch is installed before running tests CyberNobie 2024-12-16 18:47:24 +0530
  • dfa7098336 Add binary and Homebrew support for CLI #25 PratibhaK11 2024-12-16 18:03:16 +0530
  • e521eebc66 Add binary and Homebrew support for CLI #25 PratibhaK11 2024-12-16 18:01:28 +0530
  • 5fc03b6415
    Added UID as argument Michele Adduci 2024-12-16 13:11:13 +0100
  • 013b022427
    Added Docker Image for using markitdown in a sandboxed environment Michele Adduci 2024-12-16 13:08:15 +0100
  • 04f00606ab new file Moustafaplusplus 2024-12-16 12:00:31 +0000
  • 5b4dcafb35 Add Homebrew formula for Markitdown PratibhaK11 2024-12-16 17:07:45 +0530
  • 42027aac2d chore: type annot Hew Li Yang 2024-12-16 17:35:44 +0800
  • 533f43f834 Add OneNote support HendricksJudy 2024-12-16 17:05:47 +0800
  • 19dc6a3641 chore: update test excel with a nan Hew Li Yang 2024-12-16 15:44:30 +0800
  • 5de769f1bc chore: excel improvements Hew Li Yang 2024-12-16 15:27:03 +0800
  • 695100d5d8 Support specifying YouTube transcript language narumi 2024-12-16 13:16:00 +0800
  • d66ef5fcca Update README to introduce the customized mlm_prompt Soulter 2024-12-16 12:08:51 +0800
  • c168703d5e Pass the kwargs to _convert method when converting an url file Soulter 2024-12-16 11:41:39 +0800
  • 3548c96dd3
    Create .gitattributes Yeonjun 2024-12-16 09:21:07 +0900
  • 5cbd3ceb6e
    Merge 1ffb875bf6 into 81e3f24acd Alex 2024-12-15 23:57:00 +0000
  • 1ffb875bf6 Proper handling when trying to read a non-existing file Alex 2024-12-16 01:54:43 +0200
  • 1559d9d163 pre-commit ran SH4DOW4RE 2024-12-15 22:15:20 +0100
  • b7f5662ffd PR: Catching pydub's warning of ffmpeg or avconv missing SH4DOW4RE 2024-12-15 17:29:14 +0100
  • 457f5e5034 test case passed 2 Aviral Bhardwaj 2024-12-15 21:49:13 +0530
  • cdea019f9d test case passed Aviral Bhardwaj 2024-12-15 21:46:08 +0530
  • bd25877d5f Adding test file Aviral Bhardwaj 2024-12-15 21:26:49 +0530
  • bb34d93711 Added DOC to Markdown Converter Function - Issue #23 Aviral Bhardwaj 2024-12-15 21:15:32 +0530
  • 0a7203b876
    add style_map prop to MarkItDown class Ville Puuska 2024-12-15 17:23:57 +0200
  • 024778a155 removing all indentation errors that came previously Aviral Bhardwaj 2024-12-15 20:48:38 +0530
  • 0704b0b6ff
    pass 'style_map' kwarg to mammoth when converting docx Ville Puuska 2024-12-15 16:59:21 +0200
  • 81df7599c7 according to this issue ``https://github.com/microsoft/markitdown/issues/23`` added doc DocConverter function Aviral Bhardwaj 2024-12-15 19:34:31 +0530
  • 0dd4e95584 Remove _is_chart sakasegawa 2024-12-15 21:14:58 +0900
  • 93130b5ba5 Add PPTX chart support sakasegawa 2024-12-15 20:42:55 +0900
  • eb09e3701d Add AsyncMarkItDown as a wrapper Raduan77 2024-12-15 12:18:44 +0100
  • 52b723724c Fix character decoding issues with text-like files Divyansh Singh 2024-12-15 10:37:15 +0530
  • a55c3d525c
    Merge branch 'main' into main Josh XT 2024-12-14 23:09:30 -0500
  • 02cc0cef84
    feat: Add OCR fallback when MLM is unavailable for image processing suke 2024-12-15 11:52:58 +0800
  • 970c5e91a5
    Merge 0c25a086e7 into 81e3f24acd gagb 2024-12-14 19:29:55 -0800
  • 81e3f24acd
    Merge pull request #29 from microsoft/gagb-patch-1 gagb 2024-12-14 19:17:54 -0800
  • b84294620a
    Update README.md gagb 2024-12-14 19:05:51 -0800
  • 60c495d609
    Merge branch 'main' into patch-2 gagb 2024-12-14 18:57:11 -0800
  • 71123a4df3
    Merge pull request #7 from microsoft/gagb/improve-readme gagb 2024-12-14 18:54:28 -0800
  • 5753e553fe Fix conflicts gagb 2024-12-14 18:47:34 -0800
  • 752dd897b9
    Merge pull request #28 from pawarbi/main gagb 2024-12-14 18:44:52 -0800
  • 1aa4abe90f
    Merge branch 'gagb/improve-readme' into main gagb 2024-12-14 18:44:33 -0800
  • ea7c6dcc40
    Merge pull request #27 from haesleinhuepf/patch-1 gagb 2024-12-14 18:39:51 -0800
  • a31c0a13e7
    Merge branch 'main' into gagb/improve-readme gagb 2024-12-14 18:34:27 -0800
  • 0c25a086e7
    Merge branch 'main' into gagb/add-github-issue-conversion gagb/add-github-issue-conversion gagb 2024-12-14 18:34:18 -0800
  • 30ab78fe9e
    Update README.md Sandeep Pawar 2024-12-14 19:15:10 -0600
  • 559b1fc62a
    Merge branch 'main' into patch-2 gagb 2024-12-14 15:02:42 -0800
  • df03382218 Improve docustring Josh XT 2024-12-14 17:55:22 -0500
  • 18301edcd0
    Add installation instructions Robert Haase 2024-12-14 23:22:54 +0100
  • 4987201ef6 test Josh XT 2024-12-14 08:49:03 -0500
  • 571c5bbc0e add test Josh XT 2024-12-14 08:45:51 -0500
  • e8ea8b6f3d Update readme Josh XT 2024-12-14 08:41:07 -0500
  • 7e634acf5f import zipfile Josh XT 2024-12-14 08:24:44 -0500
  • 862c39029e add zip handling Josh XT 2024-12-14 06:34:47 -0500
  • 6c46bf6549
    Merge branch 'main' into gagb/add-openai-example gagb 2024-12-13 23:47:10 -0800
  • 70ab149ff1
    Merge pull request #10 from simonw/patch-1 afourney 2024-12-13 21:10:53 -0800
  • afca0a8a87 Add example of using MarkItDown with OpenAI to README gagb 2024-12-13 15:54:34 -0800
  • 8a30fca732 Add support for GH prs as well gagb 2024-12-13 14:57:39 -0800
  • 0b6554738c Move github handling from convert to convert_url gagb 2024-12-13 14:16:56 -0800
  • f1274dca87 Run pre-commit gagb 2024-12-13 13:58:24 -0800
  • 778fca3f70
    Fix code scanning alert no. 1: Incomplete URL substring sanitization gagb 2024-12-13 13:57:03 -0800
  • 7979eecfef SHift to Documentconverter class gagb 2024-12-13 13:52:37 -0800
  • 33ce17954d
    Note about piping Simon Willison 2024-12-13 11:09:03 -0800
  • 6ebef5af0c
    CLI usage instructions Simon Willison 2024-12-13 11:06:11 -0800
  • 3b88696777
    Remove invalid classifiers Simon Willison 2024-12-13 10:53:35 -0800
  • 3f9ba06418 Improve the readme with contributing guidelines gagb 2024-12-12 15:17:18 -0800
  • 8f16f32d53 Add tests gagb 2024-12-12 23:10:23 +0000
  • 28af7ad341 Run pre-commit gagb 2024-12-12 22:39:03 +0000
  • 9d047103d5 Add method to convert GitHub issue to markdown gagb 2024-12-12 13:41:31 -0800
  • b40139652b
    Merge pull request #4 from microsoft/fixes_for_filesurfer v0.0.1a2 afourney 2024-11-25 15:08:44 -0800
  • cc0a039bb0 Small fixes for the filesurfer. Adam Fourney 2024-11-25 13:42:05 -0800
  • 851c7cff96
    Merge pull request #3 from microsoft/add_cli afourney 2024-11-14 10:27:55 -0800
  • 2ad821ae8f Merge branch 'add_cli' of github.com:microsoft/markitdown into add_cli Adam Fourney 2024-11-14 10:24:52 -0800
  • 2eab564c4c Fix continue trying on errors. Adam Fourney 2024-11-14 10:23:40 -0800
  • e3f8cdf1da
    Merge branch 'main' into add_cli afourney 2024-11-14 07:53:37 -0800
  • 997c7af53c Added a simple CLI. Adam Fourney 2024-11-14 07:50:21 -0800
  • 8a2957292c
    Merge pull request #2 from microsoft/update_readme afourney 2024-11-13 16:29:02 -0800
  • 3354904d0d
    Merge branch 'main' into update_readme afourney 2024-11-13 16:26:33 -0800
  • c78412536f Replaced placeholder content in the readme. Adam Fourney 2024-11-13 16:25:54 -0800
  • fdf1102148
    Merge pull request #1 from microsoft/ci_test afourney 2024-11-13 14:44:39 -0800
  • 1787b83d7d Fix remote tests. Adam Fourney 2024-11-13 14:37:47 -0800
  • fc3349185c Testing CI Adam Fourney 2024-11-13 14:33:31 -0800
  • f20c964f99 Initial commit. Adam Fourney 2024-11-13 13:00:01 -0800
  • 67fec84618 SUPPORT.md committed Microsoft Open Source 2024-11-13 11:56:48 -0800
  • 9bc7e2bed3 SECURITY.md committed Microsoft Open Source 2024-11-13 11:56:47 -0800
  • 558adf0253 README.md committed Microsoft Open Source 2024-11-13 11:56:46 -0800
  • bc978beb97 LICENSE committed Microsoft Open Source 2024-11-13 11:56:45 -0800
  • 1e22c8e989 CODE_OF_CONDUCT.md committed Microsoft Open Source 2024-11-13 11:56:44 -0800
  • f454a6d3c8
    Initial commit microsoft-github-operations[bot] 2024-11-13 19:56:40 +0000