1
0
Fork 0
mirror of https://github.com/kemayo/leech synced 2025-12-06 08:22:56 +01:00
Commit graph

36 commits

Author SHA1 Message Date
David Lynch
5cb887f767 Move image processing into sites
The epub-builder still downloads the image, but all the html-mangling
is done in the extraction process now.

Turns footnotes into a chapter-object, for easier processing later on.
2025-03-22 19:39:16 -05:00
David Lynch
9ed2d54db7 Make the _soup method able to cope with being given a html string 2025-03-04 23:14:51 -06:00
Max Isom
53bc2045f0 Use lxml (>40% faster) 2025-03-04 22:23:50 -06:00
David Lynch
9a2b574b4b Missed a call to _soup in ao3 2024-12-23 21:02:09 -06:00
David Lynch
21834bb5ed _clean takes a base argument and reformats image srcs into absolute urls 2024-11-23 15:30:57 -06:00
David Lynch
a0a057c48c _soup always returns a base URL 2024-11-23 15:15:29 -06:00
David Lynch
1f57cd6f07 Basic success-testing on logins 2024-05-14 22:07:05 -05:00
David Lynch
6895a0eb61 AO3 single-chapter story bugs 2023-03-31 23:51:27 -05:00
David Lynch
d81eefa7f3 AO3: use new form helper so this shouldn't break again if fields change 2022-05-13 11:04:25 -05:00
David Lynch
56bc2b941c AO3: utf8 field no longer in login form 2022-04-16 18:26:26 -05:00
David Lynch
f05bfb51ef AO3: work if www is present in the URL 2021-08-10 17:15:18 -05:00
David Lynch
d1caf85883 Extract tags when present
Supported currently on Xenforo and AO3
2021-05-01 16:35:49 -05:00
David Lynch
37cb0332b7 AO3: fix issue that could occur if the work had gaps in chapter numbers 2021-04-05 19:55:46 -05:00
David Lynch
f25befc237 Decode cloudflare email address protection
Makes a generic _clean function on Site that can be called. Will
probably want to migrate some other generic bits into there after
analysis of what's *really* generic.
2021-03-27 10:46:39 -05:00
David Lynch
dfa298dd3b Better error message for restricted AO3 stories 2021-03-21 23:17:29 -05:00
David Lynch
2103f37cfb AO3: fallback for single-chapter works 2020-05-04 00:31:19 -05:00
David Lynch
7208cfdaaf Minor readability improvement: use f-strings 2019-10-15 11:14:27 -05:00
David Lynch
2bd5d77715 Helper for URL-joining 2019-05-29 01:55:35 -05:00
David Lynch
66576048da Fix flake8 errors 2019-05-25 20:03:17 -05:00
David Lynch
40b4856a14 Optimize AO3: use full_work URL 2019-05-25 15:31:39 -05:00
David Lynch
f64fce0286 AO3: login form changed 2018-12-29 21:00:02 -06:00
David Lynch
e78ffdb85b Method to get a site-key for config
Means that things like XenForoIndex and AO3Series don't require separate
config entries.
2018-10-11 15:42:59 -05:00
David Lynch
cdcd110c50 AO3: change title detection for logged-in only 2018-10-11 15:42:36 -05:00
David Lynch
2042f813d0 Allow AO3 logins for member-only stories 2018-01-19 14:15:43 -06:00
David Lynch
f8d494283c Proper URL normalization for AO3 chapters 2018-01-19 13:19:45 -06:00
David Lynch
7bb6da382c Oh hey, another missing Section URL 2017-11-04 00:30:59 -05:00
David Lynch
6d52c72c99 Use logging instead of print
Fixes #10
2017-11-04 00:09:09 -05:00
Will Oursler
1c577b6f67 Fix lint errors 2017-10-12 10:07:22 -04:00
Will Oursler
5bd07a5b90 Splits out ebook generation logic into a seperate module, in anticipation of maybe supporting multiple output formats. 2017-10-12 09:49:32 -04:00
David Lynch
ea3432bc99 AO3: author notes should be included 2017-05-28 01:15:04 -05:00
David Lynch
5b4b9a0dc3 Canonicalize URLs 2017-02-23 15:03:23 -06:00
David Lynch
86e36715f1 Set up for Travis 2017-02-08 13:20:14 -06:00
David Lynch
e6343cb1c9 Stories are now made of nested sections/chapters
This is prep-work for improving epub TOC generation a bit.
2017-01-10 00:23:24 -08:00
David Lynch
7addf4c3d1 AO3: handle series, imperfectly 2016-12-28 03:06:43 -06:00
David Lynch
24fa9aa22d Use a namedtuple for chapters 2016-09-23 13:11:52 -05:00
David Lynch
008eb8e63d Support ArchiveOfOurOwn 2016-04-03 21:30:29 -05:00