Hello sparkfun,
For a school assignment I have to make a nice simple list of different available sensors. They also want to know which (online) shops sell what sensors.
Now that would be a Pain in the behind to maintain that manually. So I thought up of the following: make a very simple crawler, specific for a few sites (but able to be expanded for other sites as well).
So it starts at the basics,
<?php
$doc = new DOMDocument()l
$doc->loadHTMLFile("http://www.sparkfun.com/");
?>
But that gives me:
Warning: DOMDocument::loadHTMLFile(http://www.sparkfun.com/) [function.DOMDocument-loadHTMLFile]: failed to open stream: HTTP request failed! HTTP/1.1 500 Internal Server Error in D:\webserver\htdocs\school\core\crawler.class.php on line 19
Warning: DOMDocument::loadHTMLFile() [function.DOMDocument-loadHTMLFile]: I/O warning : failed to load external entity "http://www.sparkfun.com/" in D:\webserver\htdocs\school\core\crawler.class.php on line 19
So, before continuing, I would like to know AM I allowed to crawl your site for this purpose (never bad to ask for such a thing, right? :P) secondly, why do I get the 500status >_< I tried google, no problem although it gives a sh*t load of tag errors
And, perhaps a more rude question, would it be possible for you to supply a datafile with a certain dump of your products database? Would make it even more easy
Thanks in advance,
Daan Timmer
p.s. Why can I post this topic as a sticky? bug?