1
0
Fork 0
mirror of https://github.com/kemayo/leech synced 2025-12-15 21:05:59 +01:00
Commit graph

375 commits

Author SHA1 Message Date
IdanDor
6d7b5ffcf0 Removed trailing whitespace. 2021-01-23 13:30:03 +02:00
IdanDor
1afac50437 Made arbitrary sites no longer leak memory and fixed worm epub.
Each `Chapter` object had a reference to the entire page tree, meaning that the program rose in RAM usage by a lot.

Transformed Worm to be with next_selector so the chapters are correctly ordered, E.2 is not skipped and the download does not crush due to `?share=twitter` url matched before.

Fixed Worm titles.
2021-01-23 12:12:48 +02:00
Kevin Turner
dc13946a77 have Travis CI test more things
- run the --help command
- try making a cover
- build the distribution and show what it made
2021-01-14 21:32:27 -08:00
Kevin Turner
f909472509 provide "leech" as a runnable script 2021-01-14 21:21:31 -08:00
David Lynch
377e6ff76c Update dependencies on attrs and Pillow to latest versions 2020-11-01 21:13:20 -06:00
David Lynch
4429049ade Fix the Vacant Throne example as well 2020-09-08 23:07:21 -05:00
David Lynch
c208e33752 Arbitrary: strip all namespaced elements
This is `fb:like` and similar, which break some epub readers.

Refs: #41, #43
2020-09-08 23:04:47 -05:00
David Lynch
9c9877ed26 Example of a smarter approach to books with a "next" link
Authors are often strangely bad at having an accurate table of
contents. (E.g. practical guide has at least one mislinked chapter in
there.) Show how to follow a "next" link, stopping when hitting a
certain URL.

For practical guide, this also has the benefit of dropping in the extra
chapters where they were originally experienced.
2020-09-08 22:15:44 -05:00
David Lynch
91747edb53 Deathworlders example is also now non-functional
Site seems to have moved... and now has an official epub download anyway
2020-09-08 21:29:38 -05:00
David Lynch
9da5c60deb Remove non-functional example
Fixes #40
2020-09-08 21:25:19 -05:00
David Lynch
988368bb66 Better xenforo blockquote chrome removal 2020-08-18 13:21:01 -05:00
David Lynch
ffc6dccaac Update urllib3 dependency 2020-08-10 09:41:30 -05:00
David Lynch
0256516702
Merge pull request #39 from kemayo/poetry
Use poetry rather than pip
2020-07-16 09:18:22 -05:00
David Lynch
a8c47cda08 Use poetry rather than pip 2020-07-15 12:08:42 -05:00
David Lynch
983152e408 Further examples 2020-07-15 11:18:09 -05:00
David Lynch
2103f37cfb AO3: fallback for single-chapter works 2020-05-04 00:31:19 -05:00
David Lynch
64fbdac4cd A few arbitrary examples 2020-04-29 17:55:20 -05:00
David Lynch
6fbdc8843d Make arbitrary site chapter-title selectors more resilient 2020-04-29 17:55:20 -05:00
David Lynch
6631095726 Fiction.live: niche URLs
* occasional stories with "Sci-fi" in the URL instead of "stories"
* rare cases of `-` in the work id

Fixes #31
2019-11-14 14:45:19 -06:00
David Lynch
a856f9d0f8 Fiction.live: account for a weird rare bug/possibility in votes
Also, add a bunch of error handling / logging to the section-parsing to
avoid this in the future.

Fixes #30
2019-11-07 09:34:39 -06:00
David Lynch
f89f5163b5 Fiction.live: Fix choices array check
Fixes #29
2019-11-05 15:02:09 -06:00
David Lynch
685120b17d Add .venv to .gitignore 2019-10-31 00:43:23 -05:00
David Lynch
4861ffbd7e Fiction.live can have votes for absent choices
Fixes #28.
2019-10-29 08:17:01 -05:00
David Lynch
dc10e4cf17 FFN: less-destructive attribute clearing 2019-10-17 22:29:01 -05:00
David Lynch
6cc1620fb3 Spec-compliance: metadata shouldn't be compressed 2019-10-17 22:28:46 -05:00
David Lynch
5c9fe18157 Travis config: bump python version 2019-10-15 11:17:49 -05:00
David Lynch
7208cfdaaf Minor readability improvement: use f-strings 2019-10-15 11:14:27 -05:00
David Lynch
c584988994 Update dependencies 2019-10-14 00:40:34 -05:00
David Lynch
ac3ba8db77 Fix sparse object passed for cover options 2019-10-14 00:29:40 -05:00
David Lynch
9d0b5f1d3a
Merge pull request #26 from thegrinner/no-vote-fictionlive
Fix FictionLive download failure on missing vote node
2019-10-14 00:07:34 -05:00
David Lynch
d782928e0e Spacebattles is now on XenForo2 2019-10-12 10:51:22 -05:00
thegrinner
4e4f16e7cc Appease flake8 2019-10-03 17:48:45 -04:00
thegrinner
d0402daa7b Add handling for votes that don't have a votes kvp 2019-10-03 17:36:43 -04:00
David Lynch
5e034a7d65 Xenforo let non-first-category threadmarks work
Currently this just requires passing a link to the reader view of a particular
category. In the future I might want to support more variants on this -- a
flag to pull down all the threadmark categories, for instance.
2019-08-06 17:29:53 -05:00
David Lynch
532a7c6682 Fix typo of title_element in arbitrary
Fixes #25
2019-07-30 09:37:03 -05:00
David Lynch
f002064352 Xenforo2 title labels 2019-07-24 23:29:12 -05:00
David Lynch
a148fa8c43 Flake8 errors 2019-07-13 13:17:54 -05:00
David Lynch
3443304ab1 XenForo: handle SV's XenForo2 changes 2019-07-13 11:42:22 -05:00
David Lynch
b1b51bdc8f Xenforo: clean out title prefixes 2019-06-17 16:13:09 -05:00
David Lynch
c8f5b3f8d8 XenForo should use reader-view if available
Much like 40b4856 greatly sped up AO3, this greatly speeds up XenForo
2019-05-29 01:56:39 -05:00
David Lynch
2bd5d77715 Helper for URL-joining 2019-05-29 01:55:35 -05:00
David Lynch
4d93f84efa flake8 should extend_ignore not ignore 2019-05-25 20:04:54 -05:00
David Lynch
66576048da Fix flake8 errors 2019-05-25 20:03:17 -05:00
David Lynch
e8f9c6a085 Update requirements 2019-05-25 15:38:01 -05:00
David Lynch
40b4856a14 Optimize AO3: use full_work URL 2019-05-25 15:31:39 -05:00
David Lynch
617ee5ebfd Practical Guide: extra chapters 2019-05-16 23:59:54 -05:00
David Lynch
4414b8d063
Merge pull request #24 from ad1217/fix-filename
Fix typo in PGTE5 example file name
2019-05-10 00:35:08 -05:00
Adam Goldsmith
8b9b5281a9 Fix typo in PGTE5 example file name 2019-05-10 01:28:59 -04:00
David Lynch
0c26da3dab
Merge pull request #22 from codetheweb/patch-1
Add book 5 of APGTE
2019-03-30 14:48:21 -05:00
Max Isom
77fb154368
Add book 5 of APGTE
Thanks for your work, it's been great being able to read web serials on my Kindle. :)
2019-03-30 00:10:15 +00:00