EasyAdPost.com Oh just great, now we're getting spam adveristing blogs. This is progress? In f'ing HTMl no less. -Bill Kearney Received: from sbapp3...
I'm working on a LiSA[1] parser in Perl and have completed a set of test cases for RSS 0.91[2] which I thought might be of interest to other parser developers....
I actually think that the appearance of the Christian Science Monitor and Computerworld feeds (thanks to both providers) are a huge landmark for RSS and for...
Hi all, I came across an interesting situation with regard to the RSS feed validator. The feed in question: http://www.aquarionics.com/meta/all.rss It's using...
Thanks to everyone who has been so helpful with creation and publicizing of the Christian Science Monitor RSS feeds. I'm trying to get the tech resources to...
... I very much encourage you to do this. IMO, this is much of the value of RSS - being able to track a specific subject, despite, or even because it is...
Hi all, I recently came across [1] a feed [2] that attempts to use white space for formatting within it's description element. This was raised during an...
Hi folks, I've taken to using a robots.txt file to ban (polite) web crawlers from my site because I've got memory problems and can't handle them thrashing my...
... No, RSS tools don't look at the robots.txt files. The RSS files are, most of the time, served as if they were just regular web pages. So they'll neither ...
... I think the assumption is that the text itself determines the width. This would, of course, be a problem for an HTML oriented display environment. ...
... I agree. If someone were to post a BDG, like the one [1] Simon Fell posted for Etags, I would put together support for it in Radio (assuming it was easy,...
And what would you expect the aggregator to do? There's nothing in a robots.txt file that would 'help' an aggregator manage it's use of a resource. The FIRST...
... How so? How is the robots.txt file germaine to a reader's behavior? http://www.robotstxt.org/wc/faq.html#what The only way a robots.txt file is going to...
... If they're that dysfunctional they're certainly not going to respect/understand how to parse out a robots.txt file. Asking the readers to engage in extra...
... You may want to deny access to the feed to certain aggregators, perhaps ones that aggressively fetch it every 10 minutes. Although, since robots.txt is...
... when i said 'aggregator', i wasn't necessarily talking about end-user tools like radio, but more about large-scale tools and sites that collect mountains...
... my own experience with such hostile user-agents is that the odds of them honoring robots.txt is roughly zero. they usually end up being custom perl or php...
... eh, but having more than one way to do the same kind of thing has never hurt *anyone*. :) ... not everyone gets to use apache, even when they want to....
... Oh which there's what, a half-dozen or so? ... So if it's not a lightweight newsfeed don't list it. Or, as we've recently implemented for Newsisfree and...
... Indeed, that was my tack on the in-band redirection arguments. That's /still/ mired in useless arguing. ... Again, I'm right there with you. This is my...
... The only things you will succeed in blocking are going to mainstream applications. It won't even make anyone who is really intent on getting the RSS even...
... Then how about setting up a sticky trap that opens a session that stays open? Or sending them a godzilla-gram of megabytes of pseudo-random XML data?...
... There's a very low tech solution that doesn't get used enough. Just put an item in, explaining in plain text that the feed is moving and give it a bit of...
Would be interesting if people started using the Retry-After header with 503... 14.37 Retry-After The Retry-After response-header field can be used with a 503...