No description
Find a file
wsuetholz 33d1fabd11 Changeset Tag Branch User Date Summary
53:c93e07566456			wsuetholz	2010-11-10 10:11	Ficwad wants you to login in order to view some stories..  They old ficwad.py got around that by starting with the first chapter instead of the story index page.  Since I needed the story index page I had changed it to switch to that page, and then scrape the chapter information from there, which doesn't work if the chapters are blocked.  While it still won't work if you pass in the URL for the story index page, I now switch back to the page that you passed in originally when looking for the chapters to download..  The one problem I have with this, is I change the self.url to the story index page, which should probably remain so that we have a consistent self.url even if the user starts with chapter 9 this time instead of chapter 1.
2010-11-10 10:18:46 -06:00
books Allow one-shot fics for ffnet. 2010-09-24 23:36:05 -05:00
__init__.py Whole lot of fixes related to appengine 2009-12-18 14:51:53 +00:00
adapter.py Add a title page to the resulting EPUB file. This required scraping more information from the web pages in order to populate the new fields. This change includes a change to the way that the output.py uses the adapters. It now gets passed in the adapter and then calls functions from the adapter in order to retrieve the scraped information. This will make it easier down the road when adding more information, or even pictures. 2010-11-09 16:35:46 -06:00
BeautifulSoup.py Starting to use source control again after a long break :-) 2010-07-17 14:44:14 +01:00
constants.py Need to reverse sort entities list to get entities with ';' ahead of versions without. Like '"' and '&quot'. Otherwise '"' becomes '";'. 2010-11-09 23:02:33 -06:00
downloader.py Add a title page to the resulting EPUB file. This required scraping more information from the web pages in order to populate the new fields. This change includes a change to the way that the output.py uses the adapters. It now gets passed in the adapter and then calls functions from the adapter in order to retrieve the scraped information. This will make it easier down the road when adding more information, or even pictures. 2010-11-09 16:35:46 -06:00
ffnet.py Add a title page to the resulting EPUB file. This required scraping more information from the web pages in order to populate the new fields. This change includes a change to the way that the output.py uses the adapters. It now gets passed in the adapter and then calls functions from the adapter in order to retrieve the scraped information. This will make it easier down the road when adding more information, or even pictures. 2010-11-09 16:35:46 -06:00
fictionalley.py Add a title page to the resulting EPUB file. This required scraping more information from the web pages in order to populate the new fields. This change includes a change to the way that the output.py uses the adapters. It now gets passed in the adapter and then calls functions from the adapter in order to retrieve the scraped information. This will make it easier down the road when adding more information, or even pictures. 2010-11-09 16:35:46 -06:00
ficwad.py Changeset Tag Branch User Date Summary 2010-11-10 10:18:46 -06:00
hpfiction.py Had left in a debugging statements that should have been removed.. 2010-11-09 18:18:42 -06:00
html2text.py Starting to use source control again after a long break :-) 2010-07-17 14:44:14 +01:00
html_constants.py Whole lot of fixes related to appengine 2009-12-18 14:51:53 +00:00
output.py Need to reverse sort entities list to get entities with ';' ahead of versions without. Like '"' and '&quot'. Otherwise '"' becomes '";'. 2010-11-09 23:02:33 -06:00
readme.txt Adding a twipassword.py and a quick readme.txt. 2010-10-20 14:58:48 -05:00
twilighted.py Add a title page to the resulting EPUB file. This required scraping more information from the web pages in order to populate the new fields. This change includes a change to the way that the output.py uses the adapters. It now gets passed in the adapter and then calls functions from the adapter in order to retrieve the scraped information. This will make it easier down the road when adding more information, or even pictures. 2010-11-09 16:35:46 -06:00
twipassword.py Adding a twipassword.py and a quick readme.txt. 2010-10-20 14:58:48 -05:00
zipdir.py Fix zip routine so it checks for file mimetype doesn't just assume it exists. 2010-10-16 11:30:09 -05:00

To use, do:

python downaloder.py <url> (epub|html)

Eg:

python downaloder.py http://www.fanfiction.net/s/5192986/1/A_Fox_in_Tokyo epub

This tool uses Python 2.5.2, but should work with newer versions.