Sergey Obukhov
2c416ecc0e
Merge pull request #62 from tgwizard/better-support-for-scandinavian-languages
...
Add better support for Scandinavian languages
2015-10-14 21:48:10 -07:00
Adam Renberg
14e3a0d80b
Add better support for Scandinavian languages
...
This is a port of https://github.com/tictail/claw/pull/6 by @simonflore.
2015-09-21 21:42:01 +02:00
Adam Renberg
fcd9e2716a
Add fix for Apple Mail email format
...
Where they have an initial > on the "date line".
2015-09-21 21:33:57 +02:00
Sergey Obukhov
ae508fe0e5
fixes mailgun/talon#26
2015-09-21 09:51:26 -07:00
Sergey Obukhov
d328c9d128
fixes mailgun/talon#43
2015-09-18 05:19:59 -07:00
Sergey Obukhov
ad09b18f3f
fixes mailgun/talon#52
2015-09-18 04:47:23 -07:00
Sergey Obukhov
15976888a0
use precise encoding when converting to unicode
2015-09-11 10:38:28 -07:00
Sergey Obukhov
385285e5de
process first 1000 lines for long messages, support for German and Dutch
2015-09-11 06:17:14 -07:00
Sergey Obukhov
cc98befba5
Merge pull request #50 from Easy-D/preserve-regular-blockquotes
...
Preserve regular blockquotes
2015-09-11 04:49:36 -07:00
Easy-D
ed6b861a47
add failing test that shows how regular blockquotes are removed
2015-07-16 21:24:49 +02:00
Oliver Song
7ea773e6a9
Fix iphone test
2015-07-02 21:49:09 -04:00
Scott MacVicar
8b1f87b1c0
Get this building and passing tests
...
Changes:
* add .DS_Store to .gitignore
* Decode base64 encoded emails for tests
* Pick a version of scikit since the pickled clasifiers are based on that
* Add missing numpy and scipy dependencies
2015-07-02 21:49:09 -04:00
Alex Riina
215e36e9ed
allow higher version of regex library
2015-07-02 21:49:09 -04:00
Alex Riina
e3ef622031
remove unused regex
2015-07-02 21:49:09 -04:00
Alex Riina
f16760c466
Remove flanker and replace PyML with scikit-learn
...
I never was actually able to successfully install PyML but the source-forge
distribution and lack of python3 support convinced me that scikit-learn would
be a fine substitute. Flanker was also difficult for me to install and seemed
only to be used in the tests, so I removed it as well to get into a position
where I could run the tests. As of this commit, only one is not passing
(test_standard_replies with android.eml) though I'm not familiar with the `email`
library yet.
2015-07-02 21:49:09 -04:00
Alex Riina
b36287e573
clean up style and extra imports
2015-07-02 21:49:09 -04:00
Alex Riina
4df7aa284b
remove extra imports
2015-07-02 21:49:09 -04:00
Simon
072a440837
Test cases for new patterns
2015-04-15 13:55:17 +02:00
szymonsobczak
3c9ef4653f
some more french fromats
2015-02-24 12:18:54 +01:00
szymonsobczak
b16060261a
support some polish and french formats
2015-02-24 11:39:12 +01:00
Jeremy Schlatter
3768d7ba31
make a separate test function for each language
2014-12-30 14:41:20 -08:00
Jeremy Schlatter
613d1fc815
Add extra splitter expressions and tests for German and Danish.
...
Also some refactoring to make it a bit easier to add more languages.
2014-12-23 15:44:04 -08:00
Sergey Obukhov
170f11038b
initial commit
2014-07-23 21:12:54 -07:00