Commit graph

  • c0c0533a5e
    slight tidy Casper da Costa-Luis 2024-12-17 05:51:30 +0000
  • 82d84e3edd
    Fixed formatting. (#1098) afourney 2025-03-05 23:30:29 -0800
  • 0c28ebde26 Fixed formatting. Adam Fourney 2025-03-05 23:29:10 -0800
  • 36c4bc9ec3
    Fixed deepcopy failure when passing llm_client (#1089) scalabreseGD 2025-03-06 08:25:37 +0100
  • 13552985a5
    Merge branch 'main' into main afourney 2025-03-05 23:24:11 -0800
  • 80baa5db18
    fix(README): correct pip install command formatting (#1090) Andrea Pietrobon 2025-03-06 08:21:10 +0100
  • 7b64554163
    Merge branch 'main' into patch-1 afourney 2025-03-05 23:19:37 -0800
  • 00a65e8f8b Fixed version in README. Adam Fourney 2025-03-05 23:10:21 -0800
  • 6bedf6d950
    Fixed version. (#1097) v0.1.0a1 afourney 2025-03-05 22:52:52 -0800
  • 70f3e7be03 Fixed version. Adam Fourney 2025-03-05 22:51:09 -0800
  • 9380112892
    Fixed loading of plugins. (#1096) afourney 2025-03-05 22:24:08 -0800
  • 1038174851 Fixed loading of plugins. Adam Fourney 2025-03-05 22:20:00 -0800
  • 784c293579 Bump plugin version. Adam Fourney 2025-03-05 21:55:20 -0800
  • 8eaf5a1da9 Clean up README.md v0.0.1 Adam Fourney 2025-03-05 21:35:08 -0800
  • 38c924793c
    Bump version (#1095) afourney 2025-03-05 21:30:56 -0800
  • fe0168132d Bump version Adam Fourney 2025-03-05 21:30:07 -0800
  • 70e9f8c3c0
    Bump version. (#1094) afourney 2025-03-05 21:26:06 -0800
  • 04df800299 Bump version. Adam Fourney 2025-03-05 21:24:57 -0800
  • 7b4d6eebb2 Bump version. Adam Fourney 2025-03-05 21:20:34 -0800
  • e921497f79
    Update converter API, user streams rather than file paths (#1088) afourney 2025-03-05 21:16:55 -0800
  • 3a588656d1 Update .gitattributes Adam Fourney 2025-03-05 21:07:18 -0800
  • b72ec04c60
    Update _mp3_converter.py to fix issue SigireddyBalasai 2025-03-06 10:35:06 +0530
  • aa86ae9c30 Fixed flow of README. Adam Fourney 2025-03-05 21:03:37 -0800
  • ce792ec731 Fixed typos. Adam Fourney 2025-03-05 21:00:29 -0800
  • ae5fd74821 Updated README Adam Fourney 2025-03-05 20:57:49 -0800
  • a7ae7c53d8 Move priority to outside DocumentConverter, allowing them to be reprioritized, and keeping the DocumentConverter interface simple. Adam Fourney 2025-03-05 20:09:18 -0800
  • 84f8198d8a Fixed many mypy errors. Adam Fourney 2025-03-05 16:41:15 -0800
  • aa94bce6d9 Bumped version. Adam Fourney 2025-03-05 15:15:23 -0800
  • fe1d57a06f Updated DocumentConverter documentation. Adam Fourney 2025-03-05 15:12:13 -0800
  • 1eb8b927c2 Add type hint, resolving circular import. Adam Fourney 2025-03-05 15:04:59 -0800
  • 1ce769e70d Fixed formatting. Adam Fourney 2025-03-05 14:01:53 -0800
  • a96a6a01b5 more formatting Kenny Zhang 2025-03-05 16:57:54 -0500
  • 8c3dd01f2f black formatting Kenny Zhang 2025-03-05 16:54:51 -0500
  • 30e5189581 removed dupe priority setting Kenny Zhang 2025-03-05 16:48:23 -0500
  • c281844c02 ported over unit tests from prev branch Kenny Zhang 2025-03-05 16:44:13 -0500
  • 4d097aa379 Updated markdownify dependency. Adam Fourney 2025-03-05 13:03:48 -0800
  • 88a5af2dfb
    fix(README): correct pip install command formatting Andrea Pietrobon 2025-03-05 21:12:02 +0100
  • cc38144752 Updated project readme with notes about changes, and use-cases. Adam Fourney 2025-03-05 11:50:56 -0800
  • 5f0b63bb95 Remove stale comments. Adam Fourney 2025-03-05 11:38:43 -0800
  • aa57757395 Updated plugin README. Adam Fourney 2025-03-05 11:37:00 -0800
  • 36a49806b5 Updated sample plugin to new Converter interface. Adam Fourney 2025-03-05 11:30:48 -0800
  • b3d6009eb8 Small cleanup. Adam Fourney 2025-03-05 10:42:36 -0800
  • 736e0ae332 Fixed exif warning test. Adam Fourney 2025-03-05 10:39:29 -0800
  • a9ceb13feb Added support for vaious audio files. Adam Fourney 2025-03-05 10:15:42 -0800
  • 1c746925d2
    Fixed deepcopy failure when passing llm_client scalabreseGD 2025-03-05 15:32:31 +0100
  • c426cb81b3 Most converters are now working. Adam Fourney 2025-03-05 00:24:54 -0800
  • 4a034da269 Stream exiftool. Adam Fourney 2025-03-04 17:18:54 -0800
  • 7879028c98 Added Outlook messages. Adam Fourney 2025-03-04 16:15:07 -0800
  • 4d09a4c6c6 Updating converters. Adam Fourney 2025-03-04 13:57:49 -0800
  • df372fa460 Progress on HTML converter. Adam Fourney 2025-03-04 08:33:50 -0800
  • 4129f30c23 More progress. Adam Fourney 2025-03-04 00:52:57 -0800
  • 7bc6d827ee Experimeting with new signaures. Adam Fourney 2025-03-03 23:01:16 -0800
  • e43632b048 Initial work updating signatures. Adam Fourney 2025-03-03 13:16:15 -0800
  • 1d2f231146
    Fixed property name (#1085) afourney 2025-03-03 09:45:36 -0800
  • 57afba40d6 Fixed property name Adam Fourney 2025-03-03 09:31:48 -0800
  • c5cd659f63
    Exploring ways to allow Optional dependencies (#1079) afourney 2025-03-03 09:06:19 -0800
  • 0de59e68e3
    Merge d3c3b24640 into f01c6c5277 Ji 2025-03-03 21:48:57 +0900
  • 605cc2fc8b
    Merge 548273543a into f01c6c5277 Nishith Jain 2025-03-01 15:56:03 -0700
  • 57fbb3d640 just replaced everywhere SigireddyBalasai 2025-03-01 09:12:57 +0000
  • 18aa3478ba
    Merge branch 'main' into patch-2 afourney 2025-02-28 23:15:27 -0800
  • a056192aaf Move openai to hatch-test environment. Adam Fourney 2025-02-28 23:09:01 -0800
  • 53feead8ae Updated READMEs, and finished remaining feature-categories. Adam Fourney 2025-02-28 22:53:11 -0800
  • a2cf8ee889 Added Ole files. Adam Fourney 2025-02-28 21:55:26 -0800
  • 11ffd2e550 Added pdfs Adam Fourney 2025-02-28 21:35:14 -0800
  • 8362df8e60 Added xlsx and xls Adam Fourney 2025-02-28 21:21:17 -0800
  • e5dc512948 Reuse error messages for missing dependencies. Adam Fourney 2025-02-28 20:28:35 -0800
  • 98698a64ce Added .docx to optional dependencies Adam Fourney 2025-02-28 17:06:59 -0800
  • b9487b6b6d Fix CLI tests.... have them install [all] Adam Fourney 2025-02-28 16:57:19 -0800
  • 10da043ec6 Merge branch 'main' into optional_dependencies Adam Fourney 2025-02-28 16:29:47 -0800
  • f01c6c5277
    Exceptions should subclass Exception not BaseException. (#1082) afourney 2025-02-28 16:28:35 -0800
  • e815a4227b Exceptions should subclass Exception not BaseException. Adam Fourney 2025-02-28 16:26:56 -0800
  • df80df0d1f Merge main. Adam Fourney 2025-02-28 16:25:04 -0800
  • 43bd79adc9
    Print and log better exceptions when file conversions fail. (#1080) afourney 2025-02-28 16:07:47 -0800
  • fcf7892bfb Added unit tests for exceptions. Adam Fourney 2025-02-28 16:05:13 -0800
  • a9670d7a21 Print and log better exceptions when file conversions fail. Adam Fourney 2025-02-28 12:46:53 -0800
  • 7d2e0bd9d4 Exploring ways to enable optional dependencies. Starting with pptx. Adam Fourney 2025-02-28 11:57:51 -0800
  • 0f63a7e28f Merge branch 'main' into optional_dependencies Adam Fourney 2025-02-28 11:08:43 -0800
  • 9182923375
    Don't have ZipConverter accept OOXML files. This will never yield a good result. (#1078) afourney 2025-02-28 09:54:19 -0800
  • c0c48bdf80 Don't have ZipConverter accept OOXML files. This will never yield a good result. Adam Fourney 2025-02-28 09:48:31 -0800
  • 2af4ba861c Renamed exception. Adam Fourney 2025-02-28 08:55:34 -0800
  • 8aa3378cce
    Update README.md SigireddyBalasai 2025-02-28 22:02:44 +0530
  • 9a19fdd134
    Make sure extensions are unique in MarkItDown's convert methods. (#1076) afourney 2025-02-28 07:43:03 -0800
  • 20a85d553a Make sure extensions are unique in MarkItDown's convert methods. Adam Fourney 2025-02-28 07:41:47 -0800
  • b9526d5e47
    Bump version. (#1075) afourney 2025-02-28 07:30:46 -0800
  • 326d17b802 Bump version. v0.0.1a5 Adam Fourney 2025-02-28 07:29:12 -0800
  • 519fe172aa
    Unable to convert HTML to Markdown (#1072) Hieu Lam 2025-02-28 15:57:41 +0700
  • 685e87570b Fixed formatting. Adam Fourney 2025-02-28 00:55:05 -0800
  • adf87dbe9f Fixed extensions getting randomized. Adam Fourney 2025-02-28 00:52:24 -0800
  • d565e77592 fix: reformatted files Hieu Lam 2025-02-28 14:56:11 +0700
  • 74f41bdc1e chore: deduplicate the list of extensions Hieu Lam 2025-02-28 14:43:28 +0700
  • c5e3ab4854 feat: issue where inherited function from markdownify.MarkdownConverter doesn't have current_tags leading to error using kwargs, also set default value for convert_as_inline Hieu Lam 2025-02-28 14:35:04 +0700
  • e82e0c1372
    Add Support For PPTX Shape Groups (Fix in code design to not miss out on slide content) (#331) Matthew Powers 2025-02-28 02:21:51 -0500
  • f5de1bf60d
    Merge branch 'main' into main afourney 2025-02-27 23:20:47 -0800
  • a394cc7c27
    fix: Implement retry logic for YouTube transcript fetching and fix URL decoding issue (#1035) Nima Akbarzadeh 2025-02-28 08:17:54 +0100
  • 4c17039a1c
    Merge branch 'main' into main afourney 2025-02-27 23:16:28 -0800
  • a87fbf01ee
    add necessary imports (#861) tanreinama 2025-02-28 16:16:09 +0900
  • 143a3b2475 Fixed formatting. Adam Fourney 2025-02-27 23:15:11 -0800
  • b2002012f0
    Merge branch 'main' into main afourney 2025-02-27 23:12:08 -0800
  • d0ed74fdf4
    Fix UnboundLocalError in MarkItDown._convert (#1038) André Menezes 2025-02-28 07:11:27 +0000
  • 02ef00ce19
    Merge branch 'main' into patch-1 afourney 2025-02-27 23:09:50 -0800