5 Commits

Author SHA1 Message Date
Sergey Obukhov
35645f9ade Merge pull request #95 from mailgun/sergey/forge
open-sourcing email dataset
2016-06-10 15:45:29 -07:00
Sergey Obukhov
7c3d91301c open-sourcing email dataset 2016-06-10 14:10:53 -07:00
Sergey Obukhov
5bcf7403ad Merge pull request #94 from mailgun/obukhov-sergey-patch-1
Update README.rst
2016-05-31 20:16:13 -07:00
Sergey Obukhov
2d6c092b65 bump version 2016-05-31 18:42:47 -07:00
Sergey Obukhov
6d0689cad6 Update README.rst 2016-05-31 18:39:07 -07:00
2 changed files with 5 additions and 4 deletions

View File

@@ -117,9 +117,9 @@ or
train(init(), EXTRACTOR_DATA, EXTRACTOR_FILENAME) train(init(), EXTRACTOR_DATA, EXTRACTOR_FILENAME)
Open-source Dataset Open-source Dataset
------------------------- -------------------
Recently we started a `kuntzcamera`_ project to create an open-source, annotated dataset of raw emails. In the project we Recently we started a `forge`_ project to create an open-source, annotated dataset of raw emails. In the project we
used a subset of `ENRON`_ data, cleansed of private, health and financial information by `EDRM`_. At the moment over 190 used a subset of `ENRON`_ data, cleansed of private, health and financial information by `EDRM`_. At the moment over 190
emails are annotated. Any contribution and collaboration on the project are welcome. Once the dataset is ready we plan to emails are annotated. Any contribution and collaboration on the project are welcome. Once the dataset is ready we plan to
start using it for talon. start using it for talon.
@@ -127,7 +127,8 @@ start using it for talon.
.. _scikit-learn: http://scikit-learn.org .. _scikit-learn: http://scikit-learn.org
.. _ENRON: https://www.cs.cmu.edu/~enron/ .. _ENRON: https://www.cs.cmu.edu/~enron/
.. _EDRM: http://www.edrm.net/resources/data-sets/edrm-enron-email-data-set .. _EDRM: http://www.edrm.net/resources/data-sets/edrm-enron-email-data-set
.. _kuntzcamera: https://github.com/mailgun/kuntzcamera .. _forge: https://github.com/mailgun/forge
Research Research
-------- --------

View File

@@ -2,7 +2,7 @@ from setuptools import setup, find_packages
setup(name='talon', setup(name='talon',
version='1.2.8', version='1.2.10',
description=("Mailgun library " description=("Mailgun library "
"to extract message quotations and signatures."), "to extract message quotations and signatures."),
long_description=open("README.rst").read(), long_description=open("README.rst").read(),