HTML Scraping with Javascript -


i use simple javascript script, in batch file, download audio , video - radio , tv shows - bbc iplayer.

part of script extracts data bbc's xml pages.

i want try extracting data html page. can point me javascript method extracting data ordinary .htm or .html page?

i'm anxious keep things simple, having javascript routine can include in html page on website, i'm interested in javascript solutions. thanks.

edit, 24 aug -

the bbc's html pages don't respond javascript scripts parse xml pages.

i use simple javascript interrogate xml, based on -

function loadxml() { xmldoc = new activexobject("microsoft.xmldom"); xmldoc.async = false; xmldoc.onreadystatechange = readxml; xmldoc.load(url); }

your question kinda vague. think there may 2 ways done: 1. apply regexp match patterns 2. import html dom simulator , walk tree find data ( assume using nodejs )


Comments

Popular posts from this blog

java - Run a .jar on Heroku -

java - Jtable duplicate Rows -

validation - How to pass paramaters like unix into windows batch file -