Hello, I am an Engineering Manager at Facebook with 13+ years in Ad Technology, Natural Language Processing and Data mining. (Learn More)
by Pravin Paratey

RediffBlog Crawler

Wouldn’t it be great if you could save your rediffblog to disk? You could then browse through your posts and comments even when you werent connected to the internet. You could give your whole blog to a friend, move to another blog host, or even delete your blog but save a copy of it. That is where BlogCrawler comes in. BlogCrawler enables you to save your entire blog to disk along with all the comments!

Installation and Uninstallation

Download BlogCrawler 0.1 Requires the .NET 2.0 Framework

To install, double click the setup.exe file and follow the instructions. BlogCrawler comes with an uninstaller that you can access either through the Start Menu or through the Add/Remove Programs section of the the Control Panel.

User Guide

Setting up your blog for crawling

The first step is to add unique tags before and after your post in the Template. This tells BlogCrawler where each post starts and ends. Use the two strings <!-- BlogCrawler Start --> and <!-- BlogCrawler End --> between <rediffBlog> and </rediffBlog>.

Remember to save your template and publish your blog after you’re done

Running BlogCrawler

  1. First, fire up BlogCrawler.
  2. Click on Next to go to Step 2. Enter the fields.
    Step 2
  3. Enter the start and end strings.
    Step 3
  4. Wait while your blog is downloaded.
  5. Thats it! You’re done. Open the containing folder to see your posts with comments :-)

Sample Output

Blogcrawler output

Troubleshooting

In case of issues, feel free to get in touch: pravinp -at- gmail -dot- com

Awards

Softpedia clean