Sergey Obukhov
8138ea9a60
fix text with Date: misclassified as quotations splitter
2019-01-18 16:49:39 +03:00
Sergey Obukhov
c171f9a875
Merge pull request #169 from Savageman/patch-2
...
Use regex match to detect outlook 2007, 2010, 2013
2018-11-05 10:43:20 +03:00
Sergey Obukhov
3f97a8b8ff
Merge branch 'master' into patch-2
2018-11-05 10:42:00 +03:00
Esperat Julian
1147767ff3
Fix regression: windows mail format was left forgotten
...
Missing a | at the end of the regex, so next lines are part of the global search.
2018-11-04 19:42:12 +01:00
Sergey Obukhov
6a304215c3
Merge pull request #177 from mailgun/obukhov-sergey-patch-1
...
Update Readme with how to retrain on your own data
v1.4.6
2018-11-02 15:22:18 +03:00
Sergey Obukhov
31714506bd
Update Readme with how to retrain on your own data
2018-11-02 15:21:36 +03:00
Sergey Obukhov
403d80cf3b
Merge pull request #161 from glaand/master
...
Fix: Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration.
2018-11-02 15:03:02 +03:00
Sergey Obukhov
7cf20f2877
Merge branch 'master' into master
2018-11-02 14:52:38 +03:00
Sergey Obukhov
afff08b017
Merge branch 'master' into patch-2
2018-11-02 09:13:42 +03:00
Sergey Obukhov
685abb1905
Merge pull request #171 from gabriellima95/Add-Portuguese-Language
...
Add Portuguese language to quotations
2018-11-02 09:12:43 +03:00
Sergey Obukhov
41990727a3
Merge branch 'master' into Add-Portuguese-Language
2018-11-02 09:11:07 +03:00
Sergey Obukhov
b113d8ab33
Merge pull request #172 from ad-m/patch-1
...
Fix catastrophic backtracking in regexp
2018-11-02 09:09:49 +03:00
Adam Dobrawy
7bd0e9cc2f
Fix catastrophic backtracking in regexp
...
Co-Author: @Nipsuli
2018-09-21 22:00:10 +02:00
gabriellima95
1e030a51d4
Add Portuguese language to quotations
2018-09-11 15:27:39 -03:00
Esperat Julian
238a5de5cc
Use regex match to detect outlook 2007, 2010, 2013
...
I encountered a variant of the outlook quotations with a space after the semicolon.
To prevent multiplying the number of rules, I implemented a regex match instead (I found how to here: https://stackoverflow.com/a/34093801/211204 ).
I documented all the different variants as cleanly as I could.
2018-08-31 12:39:52 +02:00
André Glatzl
53b24ffb3d
Cut out first some encoding html tags such as xml and doctype for avoiding conflict with unicode decoding
2017-12-19 15:15:10 +01:00
Sergey Obukhov
a7404afbcb
Merge pull request #155 from mailgun/sergey/appointment
...
fix appointments in text
v1.4.5
2017-10-23 16:34:08 -07:00
Sergey Obukhov
0e6d5f993c
fix appointments in text
2017-10-23 16:32:42 -07:00
Sergey Obukhov
60637ff13a
Merge pull request #152 from mailgun/sergey/v1.4.4
...
bump version
v1.4.4
2017-08-24 16:00:05 -07:00
Sergey Obukhov
df8259e3fe
bump version
2017-08-24 15:58:53 -07:00
Sergey Obukhov
aab3b1cc75
Merge pull request #150 from ezrapagel/fix_greedy_dash_regex
...
android_wrote regex incorrectly matching
2017-08-24 15:52:29 -07:00
Sergey Obukhov
9492b39f2d
Merge branch 'master' into fix_greedy_dash_regex
2017-08-24 15:39:28 -07:00
Sergey Obukhov
b9ac866ea7
Merge pull request #151 from mailgun/sergey/reshape
...
reshape data as suggested by sklearn
v1.4.3
2017-08-24 12:04:58 -07:00
Sergey Obukhov
678517dd89
reshape data as suggested by sklearn
2017-08-24 12:03:47 -07:00
Ezra Pagel
221774c6f8
android_wrote regex was incorrectly iterating characters in 'wrote', resulting in greedy regex that
...
matched many strings with dashes
2017-08-21 12:47:06 -05:00
Sergey Obukhov
a2aa345712
Merge pull request #148 from mailgun/sergey/v1.4.2
...
bump version after adding support for Vietnamese format
v1.4.2
2017-07-10 11:44:46 -07:00
Sergey Obukhov
d998beaff3
bump version after adding support for Vietnamese format
2017-07-10 11:42:52 -07:00
Sergey Obukhov
a379bc4e7c
Merge pull request #147 from hnx116/master
...
add support for Vietnamese reply format
2017-07-10 11:40:04 -07:00
Hung Nguyen
b8e1894f3b
add test case
2017-07-10 13:28:33 +07:00
Hung Nguyen
0b5a44090f
add support for Vietnamese reply format
2017-07-10 11:18:57 +07:00
Sergey Obukhov
b40835eca2
Merge pull request #145 from mailgun/sergey/outlook-2013-version-bump
...
bump version after merging outlook 2013 support PR
v1.4.1
2017-06-18 22:56:16 -07:00
Sergey Obukhov
b38562c7cc
bump version after merging outlook 2013 support PR
2017-06-18 22:55:15 -07:00
Sergey Obukhov
70e9fb415e
Merge pull request #139 from Savageman/patch-1
...
Added Outlook 2013 rules
2017-06-18 22:53:18 -07:00
Sergey Obukhov
64612099cd
Merge branch 'master' into patch-1
2017-06-18 22:51:46 -07:00
Sergey Obukhov
45c20f979d
Merge pull request #144 from mailgun/sergey/python3-support-version-bump
...
bump version after merging python 3 support PR
v1.4.0
2017-06-18 22:49:20 -07:00
Sergey Obukhov
743c76f159
bump version after merging python 3 support PR
2017-06-18 22:48:12 -07:00
Sergey Obukhov
bc5dad75d3
Merge pull request #141 from yfilali/master
...
Python 3 compatibility up to 3.6.1
2017-06-18 22:44:07 -07:00
Yacine Filali
4acf05cf28
Only use load compat if we can't load the classifier
2017-05-24 13:29:59 -07:00
Yacine Filali
f5f7264077
Can now handle read only classifier data as well
2017-05-24 13:22:24 -07:00
Yacine Filali
4364bebf38
Added exception checking for pickle format conversion
2017-05-24 10:26:33 -07:00
Yacine Filali
15e61768f2
Encoding fixes
2017-05-23 16:17:39 -07:00
Yacine Filali
dd0a0f5c4d
Python 2.7 backward compat
2017-05-23 16:10:13 -07:00
Yacine Filali
086f5ba43b
Updated talon for Python 3
2017-05-23 15:39:50 -07:00
Esperat Julian
e16dcf629e
Added Outlook 2013 rules
...
Only the border color changes (compared to Outlook 2007, 2010) from `#B5C4DF` to `#E1E1E1`.
2017-04-27 11:34:01 +02:00
Sergey Obukhov
f16ae5110b
Merge pull request #138 from mailgun/sergey/v1.3.7
...
bumped talon version
v1.3.7
2017-04-25 11:49:29 -07:00
Sergey Obukhov
ab5cbe5ec3
bumped talon version
2017-04-25 11:43:55 -07:00
Sergey Obukhov
be5da92f16
Merge pull request #135 from esetnik/polymail_support
...
Polymail Quote Support
2017-04-25 11:34:47 -07:00
Sergey Obukhov
95954a65a0
Merge branch 'master' into polymail_support
2017-04-25 11:30:53 -07:00
Sergey Obukhov
0b55e8fa77
Merge pull request #137 from mailgun/sergey/chardet
...
loosen the encoding requirement for detect_encoding
v1.3.6
2017-04-25 11:29:06 -07:00
Sergey Obukhov
6f159e8959
loosen the encoding requirement for detect_encoding
2017-04-25 11:19:01 -07:00