Phanindra Ramesh Challa
|
e756d55abf
|
Fixes issue #123
|
2016-12-27 13:53:40 +05:30 |
|
Sergey Obukhov
|
015c8d2a78
|
Merge pull request #120 from mailgun/sergey/talon-1.3.3
bump talon version
v1.3.3
|
2016-11-30 18:28:39 -08:00 |
|
Sergey Obukhov
|
5af846c13d
|
bump talon version
|
2016-11-30 12:56:06 -08:00 |
|
Sergey Obukhov
|
e69a9c7a54
|
Merge pull request #119 from conapart3/master
Addition of new split_email method for issue:115
|
2016-11-30 12:51:32 -08:00 |
|
conapart3
|
23cb2a9a53
|
Merge pull request #1 from conapart3/issue-115-date-split-in-headers
split_emails function added, test added
|
2016-11-22 20:02:54 +00:00 |
|
smitcona
|
b5e3397b88
|
Updating test to account for --original message-- case
|
2016-11-22 20:00:31 +00:00 |
|
smitcona
|
5685a4055a
|
Improved algorithm
|
2016-11-22 19:56:57 +00:00 |
|
smitcona
|
97b72ef767
|
Adding in_header_block variable for reliability
|
2016-11-22 19:06:34 +00:00 |
|
smitcona
|
31489848be
|
Remove print lines
|
2016-11-21 17:36:06 +00:00 |
|
smitcona
|
e5988d447b
|
Add space
|
2016-11-21 12:48:29 +00:00 |
|
smitcona
|
adfed748ce
|
split_emails function added, test added
|
2016-11-21 12:35:36 +00:00 |
|
Sergey Obukhov
|
2444ba87c0
|
Merge pull request #111 from mailgun/sergey/tagscount
restrict html processing to a certain number of tags
v1.3.2
|
2016-09-14 11:06:29 -07:00 |
|
Sergey Obukhov
|
534457e713
|
protect html_to_text as well
|
2016-09-14 09:58:41 -07:00 |
|
Sergey Obukhov
|
ea82a9730e
|
restrict html processing to a certain number of tags
|
2016-09-14 09:33:30 -07:00 |
|
Sergey Obukhov
|
f04b872e14
|
Merge pull request #108 from mailgun/sergey/html5lib-fix
use new parser each time we parse a document
v1.3.1
|
2016-08-22 18:10:35 -07:00 |
|
Sergey Obukhov
|
e61894e425
|
bump version
|
2016-08-22 17:34:18 -07:00 |
|
Sergey Obukhov
|
35fbdaadac
|
use new parser each time we parse a document
|
2016-08-22 16:25:04 -07:00 |
|
Sergey Obukhov
|
8441bc7328
|
Merge pull request #106 from mailgun/sergey/html5lib
use html5lib to parse html
v1.3.0
|
2016-08-19 15:58:07 -07:00 |
|
Sergey Obukhov
|
37c95ff97b
|
fallback untouched html if we can not parse html tree
|
2016-08-19 11:38:12 -07:00 |
|
Sergey Obukhov
|
5b1ca33c57
|
fix cssselect
|
2016-08-16 17:11:41 -07:00 |
|
Sergey Obukhov
|
ec8e09b34e
|
fix
|
2016-08-15 20:31:04 -07:00 |
|
Sergey Obukhov
|
bcf97eccfa
|
use html5lib to parse html
|
2016-08-15 19:36:21 -07:00 |
|
Sergey Obukhov
|
f53b5cc7a6
|
Merge pull request #105 from mailgun/sergey/fromstring
html with comment that has no parent crashes html_tree_to_text
v1.2.16
|
2016-08-15 13:40:37 -07:00 |
|
Sergey Obukhov
|
27adde7aa7
|
bump version
|
2016-08-15 13:21:10 -07:00 |
|
Sergey Obukhov
|
a9719833e0
|
html with comment that has no parent crashes html_tree_to_text
|
2016-08-12 17:40:12 -07:00 |
|
Sergey Obukhov
|
7bf37090ca
|
Merge pull request #101 from mailgun/sergey/empty-html
if html stripped off quotations does not have readable text fallback …
v1.2.15
|
2016-08-12 12:18:50 -07:00 |
|
Sergey Obukhov
|
44fcef7123
|
bump version
|
2016-08-11 23:59:18 -07:00 |
|
Sergey Obukhov
|
69a44b10a1
|
Merge branch 'master' into sergey/empty-html
|
2016-08-11 23:58:11 -07:00 |
|
Sergey Obukhov
|
b085e3d049
|
Merge pull request #104 from mailgun/sergey/spaces
fixes mailgun/talon#103 keep newlines when parsing html quotations
|
2016-08-11 23:56:26 -07:00 |
|
Sergey Obukhov
|
4b953bcddc
|
fixes mailgun/talon#103 keep newlines when parsing html quotations
|
2016-08-11 20:17:37 -07:00 |
|
Sergey Obukhov
|
315eaa7080
|
if html stripped off quotations does not have readable text fallback to unparsed html
|
2016-08-11 19:55:23 -07:00 |
|
Sergey Obukhov
|
5a9bc967f1
|
Merge pull request #100 from mailgun/sergey/restrict
do not parse html quotations if html is longer then certain threshold
v1.2.14
|
2016-08-11 16:08:03 -07:00 |
|
Sergey Obukhov
|
a0d7236d0b
|
bump version and add a comment
|
2016-08-11 15:49:09 -07:00 |
|
Sergey Obukhov
|
21e9a31ffe
|
add test
|
2016-08-09 17:15:49 -07:00 |
|
Sergey Obukhov
|
4ee46c0a97
|
do not parse html quotations if html is longer then certain threshold
|
2016-08-09 17:08:58 -07:00 |
|
Sergey Obukhov
|
10d9a930f9
|
Merge pull request #99 from mailgun/sergey/capitalized
consider word capitilized only if it is camel case - not all upper case
v1.2.12
|
2016-07-20 16:47:12 -07:00 |
|
Sergey Obukhov
|
a21ccdb21b
|
consider word capitilized only if it is camel case - not all upper case
|
2016-07-19 17:37:36 -07:00 |
|
Sergey Obukhov
|
7cdd7a8f35
|
Merge pull request #98 from mailgun/sergey/1.2.11
version bump
v1.2.11
|
2016-07-19 16:22:24 -07:00 |
|
Sergey Obukhov
|
01e03a47e0
|
version bump
|
2016-07-19 15:51:46 -07:00 |
|
Sergey Obukhov
|
1b9a71551a
|
Merge pull request #97 from umairwaheed/strip-talon
Strip down Talon
|
2016-07-19 15:46:56 -07:00 |
|
Umair Khan
|
911efd1db4
|
Move encoding detection inside if condition.
|
2016-07-19 09:44:40 +05:00 |
|
Umair Khan
|
e61f0a68c4
|
Add six library to setup.py
|
2016-07-19 09:40:03 +05:00 |
|
Umair Khan
|
cefbcffd59
|
Make tests/text_quotations_test.py compatible with Python 3.
|
2016-07-13 14:45:26 +05:00 |
|
Umair Khan
|
622a98d6d5
|
Make utils compatible with Python 3.
|
2016-07-13 13:00:24 +05:00 |
|
Umair Khan
|
7901f5d1dc
|
Convert msg_body into unicode in preprocess.
|
2016-07-13 11:18:10 +05:00 |
|
Umair Khan
|
555c34d7a8
|
Make sure html_to_text processes bytes
|
2016-07-13 11:18:10 +05:00 |
|
Umair Khan
|
dcc0d1de20
|
Convert msg_body to bytes in extract_from_html
|
2016-07-13 11:18:06 +05:00 |
|
Umair Khan
|
7bdf4d622b
|
Only encode if str
|
2016-07-13 08:01:47 +05:00 |
|
Umair Khan
|
4a7207b0d0
|
Only convert to unicode if str
|
2016-07-13 08:01:47 +05:00 |
|
Umair Khan
|
ad9c2ca0e8
|
Upgrade quotations.py
|
2016-07-13 08:01:44 +05:00 |
|