1
0
Fork 0
mirror of https://github.com/kemayo/leech synced 2025-12-06 08:22:56 +01:00
Commit graph

393 commits

Author SHA1 Message Date
David Lynch
0f176f5a7c session.cookies should stay a RequestsCookieJar 2021-02-12 15:50:18 -06:00
David Lynch
e66adc171f Add example for getting all of Practical Guide in one ebook
...this has been sitting uncommitted since I did all the `next` stuff,
so I think I just forgot to commit it then.
2021-02-12 10:46:16 -06:00
David Lynch
406e88acfa Add an example for Heretical Edge 2, to show how to exclude chapters
Refs #46
2021-02-12 10:43:54 -06:00
David Lynch
5e8d42e256 JSON indentation should be spaces 2021-02-12 10:37:14 -06:00
David Lynch
28cc1fbcc7 Arbitrary should store contents as a string, not a bs4 Tag
It coincidentally works by being string-like for previous uses, but it's
not string-like enough for the new unicode stuff.

Fixes #54
2021-02-05 19:58:47 -06:00
David Lynch
533c14f0d7 Normalize fancy unicode characters by default
Kindle can't display the "𝖙𝖍𝖚𝖌 𝖑𝖎𝖋𝖊" mathematical bold fraktur codepoints
so NFKC normalize them (and anything else) into its plain equivalent.

Can be disabled by running with `--no-normalize` if needed.
2021-02-05 01:59:20 -06:00
David Lynch
ba77390f27 Remove .travis.yml 2021-01-26 14:43:11 -06:00
David Lynch
3a8137a099 eclint restrictions 2021-01-26 14:42:07 -06:00
David Lynch
3ec075cf2a cover.py isn't executable so different call needed 2021-01-26 14:37:29 -06:00
David Lynch
74a07cb2f8 But without the typo in the YAML this time 2021-01-26 13:53:37 -06:00
David Lynch
112565099c Tweak action 2021-01-26 13:53:00 -06:00
David Lynch
b1e9350a23
Create python-package-poetry.yml 2021-01-26 13:49:26 -06:00
David Lynch
7c7d536019 Bump lock packages 2021-01-26 13:12:39 -06:00
David Lynch
ae1b77da2f Wattpad: use API instead
Their on-page HTML sometimes uses JS to load parts of the story
2021-01-26 13:11:56 -06:00
David Lynch
23c7a1496c Quick take on wattpad 2021-01-26 01:56:41 -06:00
David Lynch
7c040c08a0
Merge pull request #49 from keturn/feat/travis-build
have Travis CI test more things
2021-01-23 13:00:29 -06:00
David Lynch
7af94af17b
Merge pull request #48 from keturn/feat/leech-script
provide "leech" as a runnable script
2021-01-23 12:59:18 -06:00
David Lynch
2465a64fe3
Merge pull request #51 from IdanDor/master
Made arbitrary sites no longer leak memory and fixed worm epub.
2021-01-23 12:57:45 -06:00
IdanDor
6d7b5ffcf0 Removed trailing whitespace. 2021-01-23 13:30:03 +02:00
IdanDor
1afac50437 Made arbitrary sites no longer leak memory and fixed worm epub.
Each `Chapter` object had a reference to the entire page tree, meaning that the program rose in RAM usage by a lot.

Transformed Worm to be with next_selector so the chapters are correctly ordered, E.2 is not skipped and the download does not crush due to `?share=twitter` url matched before.

Fixed Worm titles.
2021-01-23 12:12:48 +02:00
Kevin Turner
dc13946a77 have Travis CI test more things
- run the --help command
- try making a cover
- build the distribution and show what it made
2021-01-14 21:32:27 -08:00
Kevin Turner
f909472509 provide "leech" as a runnable script 2021-01-14 21:21:31 -08:00
David Lynch
377e6ff76c Update dependencies on attrs and Pillow to latest versions 2020-11-01 21:13:20 -06:00
David Lynch
4429049ade Fix the Vacant Throne example as well 2020-09-08 23:07:21 -05:00
David Lynch
c208e33752 Arbitrary: strip all namespaced elements
This is `fb:like` and similar, which break some epub readers.

Refs: #41, #43
2020-09-08 23:04:47 -05:00
David Lynch
9c9877ed26 Example of a smarter approach to books with a "next" link
Authors are often strangely bad at having an accurate table of
contents. (E.g. practical guide has at least one mislinked chapter in
there.) Show how to follow a "next" link, stopping when hitting a
certain URL.

For practical guide, this also has the benefit of dropping in the extra
chapters where they were originally experienced.
2020-09-08 22:15:44 -05:00
David Lynch
91747edb53 Deathworlders example is also now non-functional
Site seems to have moved... and now has an official epub download anyway
2020-09-08 21:29:38 -05:00
David Lynch
9da5c60deb Remove non-functional example
Fixes #40
2020-09-08 21:25:19 -05:00
David Lynch
988368bb66 Better xenforo blockquote chrome removal 2020-08-18 13:21:01 -05:00
David Lynch
ffc6dccaac Update urllib3 dependency 2020-08-10 09:41:30 -05:00
David Lynch
0256516702
Merge pull request #39 from kemayo/poetry
Use poetry rather than pip
2020-07-16 09:18:22 -05:00
David Lynch
a8c47cda08 Use poetry rather than pip 2020-07-15 12:08:42 -05:00
David Lynch
983152e408 Further examples 2020-07-15 11:18:09 -05:00
David Lynch
2103f37cfb AO3: fallback for single-chapter works 2020-05-04 00:31:19 -05:00
David Lynch
64fbdac4cd A few arbitrary examples 2020-04-29 17:55:20 -05:00
David Lynch
6fbdc8843d Make arbitrary site chapter-title selectors more resilient 2020-04-29 17:55:20 -05:00
David Lynch
6631095726 Fiction.live: niche URLs
* occasional stories with "Sci-fi" in the URL instead of "stories"
* rare cases of `-` in the work id

Fixes #31
2019-11-14 14:45:19 -06:00
David Lynch
a856f9d0f8 Fiction.live: account for a weird rare bug/possibility in votes
Also, add a bunch of error handling / logging to the section-parsing to
avoid this in the future.

Fixes #30
2019-11-07 09:34:39 -06:00
David Lynch
f89f5163b5 Fiction.live: Fix choices array check
Fixes #29
2019-11-05 15:02:09 -06:00
David Lynch
685120b17d Add .venv to .gitignore 2019-10-31 00:43:23 -05:00
David Lynch
4861ffbd7e Fiction.live can have votes for absent choices
Fixes #28.
2019-10-29 08:17:01 -05:00
David Lynch
dc10e4cf17 FFN: less-destructive attribute clearing 2019-10-17 22:29:01 -05:00
David Lynch
6cc1620fb3 Spec-compliance: metadata shouldn't be compressed 2019-10-17 22:28:46 -05:00
David Lynch
5c9fe18157 Travis config: bump python version 2019-10-15 11:17:49 -05:00
David Lynch
7208cfdaaf Minor readability improvement: use f-strings 2019-10-15 11:14:27 -05:00
David Lynch
c584988994 Update dependencies 2019-10-14 00:40:34 -05:00
David Lynch
ac3ba8db77 Fix sparse object passed for cover options 2019-10-14 00:29:40 -05:00
David Lynch
9d0b5f1d3a
Merge pull request #26 from thegrinner/no-vote-fictionlive
Fix FictionLive download failure on missing vote node
2019-10-14 00:07:34 -05:00
David Lynch
d782928e0e Spacebattles is now on XenForo2 2019-10-12 10:51:22 -05:00
thegrinner
4e4f16e7cc Appease flake8 2019-10-03 17:48:45 -04:00