Commit Graph

  • 8f61ce34e1
    Merge 5261c83dc7 into 2285531110 Mišo Belica 2024-05-09 18:23:57 +0200
  • 2285531110 Add support for "lxml[html_clean]" v5.2 module master Mišo Belica 2024-05-09 16:13:40 +0000
  • 03376c4636 Add support for "lxml[html_clean]" v5.2 module Mišo Belica 2024-05-09 16:13:40 +0000
  • 5261c83dc7 Change docopt to the maitained one fix/unmaitaned-docopt Mišo Belica 2023-02-21 08:35:13 +0000
  • f6ea017975 Updating to drop testing 3.3 and 3.4 Python develop Craig Maloney 2019-08-02 09:26:17 -0400
  • 8b51105410 Remove the comment about Python versions Craig Maloney 2019-08-02 09:25:14 -0400
  • 9858d69ab4 Updating to drop testing 3.3 and 3.4 Python Craig Maloney 2019-08-02 09:14:59 -0400
  • d7634db822 Adding Python 3.7 to tests Craig Maloney 2019-08-02 08:30:53 -0400
  • a5be4ac2f1
    Merge d99b82134c into 95a364c43b pictuga 2018-03-31 17:42:07 +0000
  • 0d8ff2d449
    Merge a383c56c34 into 95a364c43b Jeffrey Guo 2018-03-31 17:40:23 +0000
  • 95a364c43b Fixing README link for TravisCI image Craig Maloney 2018-03-31 12:39:24 -0400
  • b18e4fbcef Removing moribund versions of Python Craig Maloney 2018-03-31 12:24:36 -0400
  • ea182eceb7 Adding tox.ini Craig Maloney 2018-03-31 12:21:22 -0400
  • abc3c0bbb9
    Merge pull request #35 from bookieio/port-pytest Craig Maloney 2018-03-31 12:16:02 -0400
  • 501c35c8bc Pass tests for Python 3.7 Mišo Belica 2018-03-31 13:10:20 +0200
  • 0751fe0c97 Fixed failing tests Mišo Belica 2018-03-31 12:38:36 +0200
  • aa83825334 Tests migrated into pytest style Mišo Belica 2018-03-31 12:33:13 +0200
  • 48acf389b1 Prefer pytest over nosetest runner Mišo Belica 2018-03-31 11:25:22 +0200
  • d4fcb5053a Use new directive for wheel format Mišo Belica 2018-03-31 11:21:08 +0200
  • 2c123008fa Added new Python versions into TravisCI Mišo Belica 2018-03-31 11:19:54 +0200
  • 7cb038166a Ignore files starting with dot (hidden files anyway) Mišo Belica 2018-03-31 11:15:13 +0200
  • a383c56c34 fix-bug: missing-pick-sentences 郭江伟 2018-03-20 20:24:49 +0800
  • d99b82134c shrink_text is the same as normalize_whitespace pictuga 2015-04-07 17:58:28 +0800
  • 1cfa1090ae .strip() is useless before normalize_whitespace pictuga 2015-04-07 17:57:16 +0800
  • 58718c7dbd Make normalize_whitespace faster pictuga 2015-04-07 17:52:53 +0800
  • d91236681e Fix travis file for tests Richard Harding 2014-04-20 19:05:10 -0400
  • 35630387f2 Merge pull request #29 from jelmer/install-scripts Rick Harding 2014-04-20 19:01:15 -0400
  • c9c3b1f3a0 Merge pull request #28 from jelmer/manpage Rick Harding 2014-04-20 18:59:31 -0400
  • 55b6da8a57 Fix whatis. Jelmer Vernooij 2014-04-21 00:25:10 +0200
  • be2da44269 Fix installation of scripts. Jelmer Vernooij 2014-04-21 00:15:51 +0200
  • 1a3a7495b1 Add basic manual page. Jelmer Vernooij 2014-04-21 00:14:56 +0200
  • c1e2e529a9 Update tests to be py.test Richard Harding 2014-04-13 22:54:26 -0400
  • d7038a0845 Add 3.4 to the travis builds Richard Harding 2014-04-13 22:22:02 -0400
  • 6d747a312a Update to 0.1.20, remove tests from build Richard Harding 2014-04-13 22:08:13 -0400
  • 5e8d9b46be Update for version 0.1.19 Richard Harding 2014-04-13 21:14:12 -0400
  • badf625184 Merge 6f912830c0 into e2f3391dc3 Jelmer Vernooij 2014-04-09 01:43:59 +0000
  • 6f912830c0 Use chardet rather than charade. Jelmer Vernooij 2014-04-09 03:42:37 +0200
  • e2f3391dc3 Better decoding page into unicode Mišo Belica 2014-03-29 15:41:23 +0100
  • a4813821cf Merge 15bfc80898 into 5cb028ec93 Jeffrey Nappi 2014-03-29 15:40:01 +0000
  • 2b59b34f39 Merge 849f9cc914 into 5cb028ec93 Mišo Belica 2014-03-29 15:08:54 +0000
  • 5cb028ec93 Tests are executable with pytest framework Mišo Belica 2014-03-29 16:07:51 +0100
  • 6918eca90b Debug logging is less verbose Mišo Belica 2014-03-29 15:45:43 +0100
  • 14f1845b4e Monk was here :) Mišo Belica 2014-03-29 15:43:37 +0100
  • 849f9cc914 Better decoding page into unicode Mišo Belica 2014-03-29 15:41:23 +0100
  • 15bfc80898 Ignore invalid characters Jeffrey Nappi 2014-02-27 16:16:39 -0700
  • 6d8a76a2b9 Merge pull request #21 from miso-belica/upstream-sync Rick Harding 2014-01-23 13:47:01 -0800
  • 66022e2503 Updated dependecies and tests Mišo Belica 2014-01-23 21:56:46 +0100
  • e42cfbe487 Cleanups Mišo Belica 2014-01-23 21:38:54 +0100
  • d40a89a683 Use nose collector for tests Mišo Belica 2014-01-23 18:01:15 +0100
  • e6b3567417 Be ready for wheel binary packaging Mišo Belica 2014-01-23 17:59:41 +0100
  • 687d2ecfdf Merge branch 'master' of https://github.com/bookieio/breadability into upstream-sync Mišo Belica 2014-01-23 17:57:52 +0100
  • 6549a6c307 Added alternative "newspaper" into README Mišo Belica 2014-01-23 17:01:44 +0100
  • 6906f3b2fa Update logging to drop WARN to INFO Richard Harding 2014-01-22 21:32:11 -0500
  • 347f3ea0b5 Lint Richard Harding 2014-01-02 21:24:37 -0500
  • 17270db5f0 Add test for title Richard Harding 2014-01-02 20:46:24 -0500
  • 19d3ee634c Update readme to note py3 ready Richard Harding 2013-11-29 13:47:28 -0500
  • ca8bee0a7b Update to 0.1.15 Richard Harding 2013-11-29 13:43:27 -0500
  • 1fc153d850 Rename it back. Respect others Richard Harding 2013-11-29 13:34:22 -0500
  • 4cbde9cb5a Don't need the old versions any more Richard Harding 2013-11-29 12:25:34 -0500
  • f4fa0c1040 Working on merging/updating changelog, news, and makefile Richard Harding 2013-11-29 12:21:22 -0500
  • dc0493f99b Update to catch back up to craig's image helper Richard Harding 2013-11-29 12:08:26 -0500
  • 433195e122 Update sycning with the other branch Richard Harding 2013-11-29 11:58:34 -0500
  • e9485b6fdf Tests working, makefile back into play Richard Harding 2013-11-29 11:51:57 -0500
  • d6317cd2ce Sync up with the fork Richard Harding 2013-11-29 11:32:22 -0500
  • 5f1b39fe0b Cleanups [ci skip] Mišo Belica 2013-11-28 11:56:20 +0100
  • 09b4040578 Append sibling node only when it doesn't already exist Mišo Belica 2013-11-28 11:50:48 +0100
  • 3746ee5bb5 Treat images a little differently so they get more inclusion Mišo Belica 2013-11-28 11:39:12 +0100
  • 02160fe2ae Cleanup Mišo Belica 2013-11-28 11:06:41 +0100
  • 573a05f940 Added alternative "python-goose" into README Mišo Belica 2013-11-28 11:03:45 +0100
  • d138b6394e Cleanups Mišo Belica 2013-11-28 11:03:25 +0100
  • e5401d7ab2 Added URL into User-Agent string Mišo Belica 2013-11-28 10:51:48 +0100
  • d530acb8c6 I discovered maintainer meta-data parameter Mišo Belica 2013-11-28 10:46:12 +0100
  • c091249162 Changed execution of nosetests Mišo Belica 2013-11-28 10:45:22 +0100
  • 042779bd12 Update version to 0.1.14 Richard Harding 2013-11-07 21:01:39 -0500
  • 05e13a4834 Update to only append sibling if we don't already have it Richard Harding 2013-11-07 21:00:06 -0500
  • 952ea273c5 Update to version 0.1.13 origin/test_25_hour Richard Harding 2013-08-31 15:51:22 -0400
  • 9b9ec5b0e6 Treat images a little differently so they get more inclusion. Craig Maloney 2013-08-31 13:18:08 -0400
  • 2c463a754c Merge 0d22d12eb5 into 37c6c41d29 Craig Maloney 2013-08-31 12:38:46 -0700
  • 0d22d12eb5 Copy / Paste Craig Maloney 2013-08-31 15:38:34 -0400
  • 983861d1b0 Adding sweetshark test from issue #1 Craig Maloney 2013-08-31 15:28:20 -0400
  • 0b55c071a9 Added test for Business Insider article Craig Maloney 2013-08-31 13:25:55 -0400
  • 5703783de7 Treat images a little differently so they get more inclusion Craig Maloney 2013-08-31 13:18:08 -0400
  • 471db19a43 Added BTE tool into similar tools to readme Mišo Belica 2013-08-21 01:39:05 +0200
  • 43cc38dc7b Cleanup Mišo Belica 2013-08-21 01:38:24 +0200
  • 37c6c41d29 Update versions for 0.1.12 Richard Harding 2013-07-28 10:55:31 -0400
  • db7890639f Merge 05f2131df5 into 4f2b744a3a macmenot 2013-07-27 09:02:39 -0700
  • 4f2b744a3a Set urllib useragent string. macmenot 2013-07-15 21:41:10 +0100
  • 05f2131df5 Set urllib useragent string. macmenot 2013-07-15 21:41:10 +0100
  • 81ba7aec3c Create console scripts with python version suffix Mišo Belica 2013-05-04 23:02:06 +0200
  • 51df29f05d Write readable content into temp file in binary mode Mišo Belica 2013-05-04 22:21:10 +0200
  • 42530d4af7 Use py3k compatible urllib with own User-Agent header Mišo Belica 2013-05-04 22:19:13 +0200
  • 9ed02047dd Added string representation for empty scored node Mišo Belica 2013-04-22 19:01:27 +0200
  • 7630237b86 Added missing empty line Mišo Belica 2013-04-14 22:00:31 +0200
  • c34bc53d9e Updated list of similar tools Mišo Belica 2013-04-14 21:58:51 +0200
  • bf6cfef556 Renamed '_py3k.py' -> '_compat.py' Mišo Belica 2013-04-07 19:35:00 +0200
  • bd084a8e28 Fixed named argument name 'fragment' Mišo Belica 2013-04-07 19:30:32 +0200
  • 8f3ebf0950 Removed file with version number Mišo Belica 2013-04-07 19:29:11 +0200
  • 8c775fee7f Added new test article Mišo Belica 2013-04-05 22:38:19 +0200
  • c9afc38c49 Cleanups for function 'clean_document' Mišo Belica 2013-03-27 00:13:31 +0100
  • 5c20673d45 Don't remove h1/h2 elements from readable article Mišo Belica 2013-03-26 23:55:55 +0100