| 
							
							
								 smitcona | 984c036b6e | Set the marker back to 'm' rather than 't' if it matches the QUOT_PATTERN. Updated test case. | 2017-02-01 18:28:19 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | a403ecb5c9 | Adding two level indentation test | 2017-02-01 18:09:35 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | a44713409c | Added additional case for testing new functionality of split_emails() | 2017-02-01 17:40:59 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | 567467b8ed | Update comment | 2017-02-01 17:29:05 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | 139edd6104 | Add new method which marks as splitlines, lines which are splitlines but start with email quotation indents ("> ") | 2017-02-01 17:16:30 +00:00 |  | 
			
				
					| 
							
							
								 Phanindra Ramesh Challa | e756d55abf | Fixes issue #123 | 2016-12-27 13:53:40 +05:30 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 015c8d2a78 | Merge pull request #120 from mailgun/sergey/talon-1.3.3 bump talon versionv1.3.3 | 2016-11-30 18:28:39 -08:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 5af846c13d | bump talon version | 2016-11-30 12:56:06 -08:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | e69a9c7a54 | Merge pull request #119 from conapart3/master Addition of new split_email method for issue:115 | 2016-11-30 12:51:32 -08:00 |  | 
			
				
					| 
							
							
								 conapart3 | 23cb2a9a53 | Merge pull request #1 from conapart3/issue-115-date-split-in-headers split_emails function added, test added | 2016-11-22 20:02:54 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | b5e3397b88 | Updating test to account for --original message-- case | 2016-11-22 20:00:31 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | 5685a4055a | Improved algorithm | 2016-11-22 19:56:57 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | 97b72ef767 | Adding in_header_block variable for reliability | 2016-11-22 19:06:34 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | 31489848be | Remove print lines | 2016-11-21 17:36:06 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | e5988d447b | Add space | 2016-11-21 12:48:29 +00:00 |  | 
			
				
					| 
							
							
								 smitcona | adfed748ce | split_emails function added, test added | 2016-11-21 12:35:36 +00:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 2444ba87c0 | Merge pull request #111 from mailgun/sergey/tagscount restrict html processing to a certain number of tagsv1.3.2 | 2016-09-14 11:06:29 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 534457e713 | protect html_to_text as well | 2016-09-14 09:58:41 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | ea82a9730e | restrict html processing to a certain number of tags | 2016-09-14 09:33:30 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | f04b872e14 | Merge pull request #108 from mailgun/sergey/html5lib-fix use new parser each time we parse a documentv1.3.1 | 2016-08-22 18:10:35 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | e61894e425 | bump version | 2016-08-22 17:34:18 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 35fbdaadac | use new parser each time we parse a document | 2016-08-22 16:25:04 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 8441bc7328 | Merge pull request #106 from mailgun/sergey/html5lib use html5lib to parse htmlv1.3.0 | 2016-08-19 15:58:07 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 37c95ff97b | fallback untouched html if we can not parse html tree | 2016-08-19 11:38:12 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 5b1ca33c57 | fix cssselect | 2016-08-16 17:11:41 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | ec8e09b34e | fix | 2016-08-15 20:31:04 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | bcf97eccfa | use html5lib to parse html | 2016-08-15 19:36:21 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | f53b5cc7a6 | Merge pull request #105 from mailgun/sergey/fromstring html with comment that has no parent crashes html_tree_to_textv1.2.16 | 2016-08-15 13:40:37 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 27adde7aa7 | bump version | 2016-08-15 13:21:10 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | a9719833e0 | html with comment that has no parent crashes html_tree_to_text | 2016-08-12 17:40:12 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 7bf37090ca | Merge pull request #101 from mailgun/sergey/empty-html if html stripped off quotations does not have readable text fallback …v1.2.15 | 2016-08-12 12:18:50 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 44fcef7123 | bump version | 2016-08-11 23:59:18 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 69a44b10a1 | Merge branch 'master' into sergey/empty-html | 2016-08-11 23:58:11 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | b085e3d049 | Merge pull request #104 from mailgun/sergey/spaces fixes mailgun/talon#103 keep newlines when parsing html quotations | 2016-08-11 23:56:26 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 4b953bcddc | fixes mailgun/talon#103 keep newlines when parsing html quotations | 2016-08-11 20:17:37 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 315eaa7080 | if html stripped off quotations does not have readable text fallback to unparsed html | 2016-08-11 19:55:23 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 5a9bc967f1 | Merge pull request #100 from mailgun/sergey/restrict do not parse html quotations if html is longer then certain thresholdv1.2.14 | 2016-08-11 16:08:03 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | a0d7236d0b | bump version and add a comment | 2016-08-11 15:49:09 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 21e9a31ffe | add test | 2016-08-09 17:15:49 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 4ee46c0a97 | do not parse html quotations if html is longer then certain threshold | 2016-08-09 17:08:58 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 10d9a930f9 | Merge pull request #99 from mailgun/sergey/capitalized consider word capitilized only if it is camel case - not all upper casev1.2.12 | 2016-07-20 16:47:12 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | a21ccdb21b | consider word capitilized only if it is camel case - not all upper case | 2016-07-19 17:37:36 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 7cdd7a8f35 | Merge pull request #98 from mailgun/sergey/1.2.11 version bumpv1.2.11 | 2016-07-19 16:22:24 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 01e03a47e0 | version bump | 2016-07-19 15:51:46 -07:00 |  | 
			
				
					| 
							
							
								 Sergey Obukhov | 1b9a71551a | Merge pull request #97 from umairwaheed/strip-talon Strip down Talon | 2016-07-19 15:46:56 -07:00 |  | 
			
				
					| 
							
							
								 Umair Khan | 911efd1db4 | Move encoding detection inside if condition. | 2016-07-19 09:44:40 +05:00 |  | 
			
				
					| 
							
							
								 Umair Khan | e61f0a68c4 | Add six library to setup.py | 2016-07-19 09:40:03 +05:00 |  | 
			
				
					| 
							
							
								 Umair Khan | cefbcffd59 | Make tests/text_quotations_test.py compatible with Python 3. | 2016-07-13 14:45:26 +05:00 |  | 
			
				
					| 
							
							
								 Umair Khan | 622a98d6d5 | Make utils compatible with Python 3. | 2016-07-13 13:00:24 +05:00 |  | 
			
				
					| 
							
							
								 Umair Khan | 7901f5d1dc | Convert msg_body into unicode in preprocess. | 2016-07-13 11:18:10 +05:00 |  |