HTML Scraping with Javascript -
i use simple javascript script, in batch file, download audio , video - radio , tv shows - bbc iplayer.
part of script extracts data bbc's xml pages.
i want try extracting data html page. can point me javascript method extracting data ordinary .htm or .html page?
i'm anxious keep things simple, having javascript routine can include in html page on website, i'm interested in javascript solutions. thanks.
edit, 24 aug -
the bbc's html pages don't respond javascript scripts parse xml pages.
i use simple javascript interrogate xml, based on -
function loadxml() { xmldoc = new activexobject("microsoft.xmldom"); xmldoc.async = false; xmldoc.onreadystatechange = readxml; xmldoc.load(url); }
your question kinda vague. think there may 2 ways done: 1. apply regexp match patterns 2. import html dom simulator , walk tree find data ( assume using nodejs )
Comments
Post a Comment