I was thinking what makes my eyes find out a list of items on a webpage as a list. either there is a set of lines or the text is arranged in a way that shows a similar pattern, so if I write an algorithm which follows the same way my eyes recognize a property list, then my application can do it too.
I actually wrote a small script which points out repeated html tags inside a page. so if there is a table of many ""s it finds it out.
not so intelligent but still it's helpful to write crawlers for each single property list.
* funny thing is I code a crawler for a Real Estate website today, tomorrow the site disappears. WTF!!!
June 2011
November 2009: Monthly Archives
Categories
Monthly Archives
- June 2011 (5)
- May 2011 (1)
- April 2011 (1)
- March 2011 (1)
- February 2011 (5)
- January 2011 (1)
- December 2010 (3)
- October 2010 (2)
- September 2010 (2)
- August 2010 (1)
- July 2010 (3)
- June 2010 (1)
- May 2010 (3)
- April 2010 (1)
- March 2010 (3)
- February 2010 (7)
- January 2010 (2)
- November 2009 (1)
- August 2009 (3)
- July 2009 (1)
- May 2009 (2)
- April 2009 (1)
- March 2009 (2)
- February 2009 (4)
- September 2008 (2)
- August 2008 (4)
- July 2008 (2)
- June 2008 (4)
- May 2008 (1)
- April 2008 (1)
- March 2008 (3)
- February 2008 (8)
Pages
Search
About this Archive
This page is a archive of entries in the Sedubai category from August 2008.
Sedubai: July 2008 is the previous archive.
Sedubai: February 2009 is the next archive.
Find recent content on the main index or look in the archives to find all content.
Powered by Movable Type Open Source
