Commit Graph

  • b5e3397b88 Updating test to account for --original message-- case smitcona 2016-11-22 20:00:31 +00:00
  • 5685a4055a Improved algorithm smitcona 2016-11-22 19:56:57 +00:00
  • 97b72ef767 Adding in_header_block variable for reliability smitcona 2016-11-22 19:06:34 +00:00
  • 31489848be Remove print lines smitcona 2016-11-21 17:36:06 +00:00
  • e5988d447b Add space smitcona 2016-11-21 12:48:29 +00:00
  • adfed748ce split_emails function added, test added smitcona 2016-11-21 12:35:36 +00:00
  • 2444ba87c0 Merge pull request #111 from mailgun/sergey/tagscount v1.3.2 Sergey Obukhov 2016-09-14 11:06:29 -07:00
  • 534457e713 protect html_to_text as well Sergey Obukhov 2016-09-14 09:58:41 -07:00
  • ea82a9730e restrict html processing to a certain number of tags Sergey Obukhov 2016-09-14 09:33:30 -07:00
  • f04b872e14 Merge pull request #108 from mailgun/sergey/html5lib-fix v1.3.1 Sergey Obukhov 2016-08-22 18:10:35 -07:00
  • e61894e425 bump version Sergey Obukhov 2016-08-22 17:34:18 -07:00
  • 35fbdaadac use new parser each time we parse a document Sergey Obukhov 2016-08-22 16:25:04 -07:00
  • 8441bc7328 Merge pull request #106 from mailgun/sergey/html5lib v1.3.0 Sergey Obukhov 2016-08-19 15:58:07 -07:00
  • 37c95ff97b fallback untouched html if we can not parse html tree Sergey Obukhov 2016-08-19 11:38:12 -07:00
  • 5b1ca33c57 fix cssselect Sergey Obukhov 2016-08-16 17:09:07 -07:00
  • ec8e09b34e fix Sergey Obukhov 2016-08-15 20:31:04 -07:00
  • bcf97eccfa use html5lib to parse html Sergey Obukhov 2016-08-15 19:36:21 -07:00
  • f53b5cc7a6 Merge pull request #105 from mailgun/sergey/fromstring v1.2.16 Sergey Obukhov 2016-08-15 13:40:37 -07:00
  • 27adde7aa7 bump version Sergey Obukhov 2016-08-15 13:21:10 -07:00
  • a9719833e0 html with comment that has no parent crashes html_tree_to_text Sergey Obukhov 2016-08-12 17:40:12 -07:00
  • 7bf37090ca Merge pull request #101 from mailgun/sergey/empty-html v1.2.15 Sergey Obukhov 2016-08-12 12:18:50 -07:00
  • 44fcef7123 bump version Sergey Obukhov 2016-08-11 23:59:18 -07:00
  • 69a44b10a1 Merge branch 'master' into sergey/empty-html Sergey Obukhov 2016-08-11 23:58:11 -07:00
  • b085e3d049 Merge pull request #104 from mailgun/sergey/spaces Sergey Obukhov 2016-08-11 23:56:26 -07:00
  • 4b953bcddc fixes mailgun/talon#103 keep newlines when parsing html quotations Sergey Obukhov 2016-08-11 20:17:37 -07:00
  • 315eaa7080 if html stripped off quotations does not have readable text fallback to unparsed html Sergey Obukhov 2016-08-11 19:54:53 -07:00
  • 5a9bc967f1 Merge pull request #100 from mailgun/sergey/restrict v1.2.14 Sergey Obukhov 2016-08-11 16:08:03 -07:00
  • a0d7236d0b bump version and add a comment Sergey Obukhov 2016-08-11 15:49:09 -07:00
  • 21e9a31ffe add test Sergey Obukhov 2016-08-09 17:15:49 -07:00
  • 4ee46c0a97 do not parse html quotations if html is longer then certain threshold Sergey Obukhov 2016-08-09 17:08:58 -07:00
  • 10d9a930f9 Merge pull request #99 from mailgun/sergey/capitalized v1.2.12 Sergey Obukhov 2016-07-20 16:47:12 -07:00
  • a21ccdb21b consider word capitilized only if it is camel case - not all upper case Sergey Obukhov 2016-07-19 16:22:04 -07:00
  • 7cdd7a8f35 Merge pull request #98 from mailgun/sergey/1.2.11 v1.2.11 Sergey Obukhov 2016-07-19 16:22:24 -07:00
  • 01e03a47e0 version bump Sergey Obukhov 2016-07-19 15:51:46 -07:00
  • 1b9a71551a Merge pull request #97 from umairwaheed/strip-talon Sergey Obukhov 2016-07-19 15:46:56 -07:00
  • 911efd1db4 Move encoding detection inside if condition. Umair Khan 2016-07-19 09:44:40 +05:00
  • e61f0a68c4 Add six library to setup.py Umair Khan 2016-07-19 09:40:03 +05:00
  • cefbcffd59 Make tests/text_quotations_test.py compatible with Python 3. Umair Khan 2016-07-13 14:45:26 +05:00
  • 622a98d6d5 Make utils compatible with Python 3. Umair Khan 2016-07-13 13:00:24 +05:00
  • 7901f5d1dc Convert msg_body into unicode in preprocess. Umair Khan 2016-07-13 11:16:39 +05:00
  • 555c34d7a8 Make sure html_to_text processes bytes Umair Khan 2016-07-13 11:11:06 +05:00
  • dcc0d1de20 Convert msg_body to bytes in extract_from_html Umair Khan 2016-07-13 10:32:27 +05:00
  • 7bdf4d622b Only encode if str Umair Khan 2016-07-13 07:51:10 +05:00
  • 4a7207b0d0 Only convert to unicode if str Umair Khan 2016-07-13 07:46:07 +05:00
  • ad9c2ca0e8 Upgrade quotations.py Umair Khan 2016-07-13 00:08:39 +05:00
  • da998ddb60 Run modernizer on the code. Umair Khan 2016-07-12 17:25:46 +05:00
  • 07f68815df Allow installation of ML free version. Umair Khan 2016-07-11 16:03:03 +05:00
  • 35645f9ade Merge pull request #95 from mailgun/sergey/forge v1.2.10 Sergey Obukhov 2016-06-10 15:45:29 -07:00
  • 7c3d91301c open-sourcing email dataset Sergey Obukhov 2016-06-10 14:10:53 -07:00
  • 5bcf7403ad Merge pull request #94 from mailgun/obukhov-sergey-patch-1 v1.2.9 Sergey Obukhov 2016-05-31 20:16:13 -07:00
  • 2d6c092b65 bump version Sergey Obukhov 2016-05-31 18:42:47 -07:00
  • 6d0689cad6 Update README.rst Sergey Obukhov 2016-05-31 18:39:07 -07:00
  • 3f80e93ee0 Merge pull request #93 from mailgun/sergey/version-bump v1.2.8 Sergey Obukhov 2016-05-31 18:15:28 -07:00
  • 1b18abab1d bump Sergey Obukhov 2016-05-31 16:53:41 -07:00
  • 03dd5af5ab Merge pull request #91 from KevinCathcart/patch-1 Sergey Obukhov 2016-05-31 16:50:35 -07:00
  • dfba82b07c Merge pull request #92 from mailgun/obukhov-sergey-kuntzcamera Sergey Obukhov 2016-05-31 15:42:34 -07:00
  • 08ca02c87f Update README.rst Sergey Obukhov 2016-05-31 15:14:32 -07:00
  • b61f4ec095 Support outlook 2007/2010 running in en-us locale Kevin Cathcart 2016-05-23 17:23:53 -04:00
  • 9dbe6a494b Merge pull request #90 from mailgun/sergey/89 v1.2.7 Sergey Obukhov 2016-05-17 16:01:56 -07:00
  • 44e70939d6 fixes mailgun/talon#89 Sergey Obukhov 2016-05-17 15:31:01 -07:00
  • ab6066eafa Merge pull request #87 from mailgun/sergey/1.2.6 v1.2.6 Sergey Obukhov 2016-04-07 17:54:12 -07:00
  • 42258cdd36 bump up version Sergey Obukhov 2016-04-07 17:51:48 -07:00
  • d3de9e6893 Merge pull request #86 from dougkeen/master Sergey Obukhov 2016-04-07 17:47:38 -07:00
  • 333beb94af Fix #85 (exception when stripping gmail quotes) Doug Keen 2016-04-04 14:22:50 -07:00
  • f3c0942c49 Merge pull request #80 from mailgun/sergey/12 v1.2.5 Sergey Obukhov 2016-03-04 13:33:46 -08:00
  • 02adf53ab9 fixes mailgun/talon#12 Sergey Obukhov 2016-03-04 13:14:50 -08:00
  • 3497b5cab4 Merge pull request #79 from mailgun/sergey/version v1.2.4 Sergey Obukhov 2016-02-29 15:13:51 -08:00
  • 9c17dca17c bump version Sergey Obukhov 2016-02-29 14:50:52 -08:00
  • de342d3177 Merge pull request #78 from defkev/master Sergey Obukhov 2016-02-29 14:14:09 -08:00
  • 743b452daf Added Zimbra HTML quotation extraction defkev 2016-02-21 16:56:52 +01:00
  • c762f3c337 Merge pull request #77 from mailgun/sergey/fix-gmail-fwd v1.2.3 Sergey Obukhov 2016-02-19 19:08:37 -08:00
  • 31803d41bc fixes mailgun/talon#18 Sergey Obukhov 2016-02-19 18:30:43 -08:00
  • 2ecd9779fc bump up version v1.2.2 Sergey Obukhov 2016-02-19 18:32:07 -08:00
  • 5a7047233e Merge pull request #76 from mailgun/sergey/fix-date-splitter Sergey Obukhov 2016-02-19 18:28:23 -08:00
  • 999e9c3725 fixes mailgun/talon#19 Sergey Obukhov 2016-02-19 17:53:52 -08:00
  • f6940fe878 bump up version v1.2.1 Sergey Obukhov 2015-12-18 19:15:58 -08:00
  • ce65ff8fc8 Merge pull request #71 from clara-labs/ms-2010-issue Sergey Obukhov 2015-12-18 19:14:13 -08:00
  • eed6784f25 Merge pull request #70 from mailgun/sergey/gmail v1.2.0 Sergey Obukhov 2015-12-18 19:00:13 -08:00
  • 3d9ae356ea add more tests, make standard reply tests more relaxed Sergey Obukhov 2015-12-18 18:56:41 -08:00
  • f688d074b5 First pass at handling issue with ms outlook 2010 with unenclosed quoted text. Carlos Correa 2015-12-10 19:14:51 -08:00
  • 41457d8fbd fixes mailgun/talon#38 mailgun/talon#20 Sergey Obukhov 2015-12-05 00:37:02 -08:00
  • 2c416ecc0e Merge pull request #62 from tgwizard/better-support-for-scandinavian-languages v1.1.0 Sergey Obukhov 2015-10-14 21:48:10 -07:00
  • 3ab33c557b Merge pull request #65 from mailgun/sergey/cssselect Sergey Obukhov 2015-10-14 20:34:02 -07:00
  • 8db05f4950 add cssselect to dependencies Sergey Obukhov 2015-10-14 20:31:26 -07:00
  • 3d5bc82a03 Merge pull request #61 from tgwizard/fix-for-apple-mail Sergey Obukhov 2015-10-14 12:38:06 -07:00
  • 14e3a0d80b Add better support for Scandinavian languages Adam Renberg 2015-09-21 21:41:59 +02:00
  • fcd9e2716a Add fix for Apple Mail email format Adam Renberg 2015-09-21 21:33:55 +02:00
  • d62d633215 bump up version Sergey Obukhov 2015-09-21 09:55:51 -07:00
  • 3b0c9273c1 Merge pull request #60 from mailgun/sergey/26 Sergey Obukhov 2015-09-21 09:54:35 -07:00
  • e4c1c11845 remove print Sergey Obukhov 2015-09-21 09:52:47 -07:00
  • ae508fe0e5 fixes mailgun/talon#26 Sergey Obukhov 2015-09-21 09:51:26 -07:00
  • 2cb9b5399c bump up version Sergey Obukhov 2015-09-18 05:23:29 -07:00
  • 134c47f515 Merge pull request #59 from mailgun/sergey/43 Sergey Obukhov 2015-09-18 05:20:51 -07:00
  • d328c9d128 fixes mailgun/talon#43 Sergey Obukhov 2015-09-18 05:19:59 -07:00
  • 77b62b0fef Merge pull request #58 from mailgun/sergey/52 Sergey Obukhov 2015-09-18 04:48:50 -07:00
  • ad09b18f3f fixes mailgun/talon#52 Sergey Obukhov 2015-09-18 04:47:23 -07:00
  • b5af9c03a5 bump up version v1.0.7 Sergey Obukhov 2015-09-11 10:42:26 -07:00
  • 176c7e7532 Merge pull request #57 from mailgun/sergey/to_unicode Sergey Obukhov 2015-09-11 10:40:52 -07:00
  • 15976888a0 use precise encoding when converting to unicode Sergey Obukhov 2015-09-11 10:38:28 -07:00
  • 9bee502903 bump up version Sergey Obukhov 2015-09-11 06:27:12 -07:00