**------------------------------------------------------------------------------------------------ * @header_start * WebGrab+Plus ini for grabbing EPG data from TvGuide websites * @Site: aasthatv.com * @MinSWversion: V1.1.1/53.16 * @Revision 1 - [01/07/2014] Jan van Straaten * - july 2014 version * @Revision 0 - [12/05/2014] Jan van Straaten * - creation * @Remarks: Must be revised each beginning of the month. This one is for June 2014 * @header_end **------------------------------------------------------------------------------------------------ * site {url=aasthatv.com|timezone=UTC+05:30|maxdays=6|cultureinfo=en-GB|charset=UTF-8|titlematchfactor=90} * urldate.format {datestring|dd-MMM-yyyy} scope.range {(urlindex)|end} index_variable_element.modify {calculate(format=date,vba)|'urldate'} * the config_site_id is the daynumber offset for the channel * the last part of the next line : |number - 'config_site_id' +} * how to calculate the number : * start with 01/06/2014 ... the vba datenumber = 41791 * calculate this vba datenumber for the 1st day of the month you want e.g. July 2014 41791+31 = 41821 * determine which channel has the lowest daynumber (for June 2014 that was aastha-india = 2780 * the number = vbadatenumber - lowest daynumber = for June = 41791 - 2780 = 39011 * it is also needed togenerate a new channel list each month (see at the end) * for July VBA number = 42821 (see above) * lowest daynumber = 3049 (aastha-india) * the number 41821 - 3049 = 39772 index_variable_element.modify {calculate(format=F0)|38772 - 'config_site_id' +} * calculates the daynumber url_index{url|http://www.aasthatv.com/index.php/schedules/aastha-india/####-|urldate|} url_index.modify {replace()|####|'index_variable_element'} *example *http://www.aasthatv.com/index.php/schedules/aastha-india/2584-13-may-2014 url_index.headers {customheader=Accept-Encoding=gzip,deflate} * to speedup the downloading of the index pages end_scope * * july 2014: index_showsplit.scrub {multi()|||
} * index_start.scrub {regex()||(\d{2} \d{2} [apm]{2})||} index_start.modify {replace| am|am} index_start.modify {replace| pm|pm} index_start.modify {replace| |:} index_title.scrub {regex()||width=\"535\">.+?

(.+?)

||} * this was valid until june *index_start.scrub {regex()||(\d+:\d{2} [apm]{2})||} *index_title.scrub {regex()||(.+)||} *index_title.modify {cleanup(tags="<"">")} * ** _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ ** ##### CHANNEL FILE CREATION (only to create the xxx-channel.xml file) ** ** @auto_xml_channel_start *index_site_id.scrub {|} *index_site_id.modify {addstart|0\|30\|60\|90\|120\|150} * sequence for June 2014 (a month with 30 days) *index_site_id.modify {addstart|0\|30\|61\|92\|123\|154} * sequence for July 2014 (a month with 31 days) *index_site_channel.modify {addstart|aastha-india\|aastha-uk\|aastha-usa-canada\|aastha-bhajan-india\|aastha-bhajan-usa\|vedic-india} * July 2014 ** @auto_xml_channel_end *For May 2014 the following urls and day values were found: *http://www.aasthatv.com/index.php/schedules/aastha-india/2584-13-may-2014 *http://www.aasthatv.com/index.php/schedules/aastha-uk/2615-13-may-2014 *http://www.aasthatv.com/index.php/schedules/aastha-usa-canada/2646-13-may-2014 *http://www.aasthatv.com/index.php/schedules/vedic-india/2677-13-may-2014 *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-india/2708-13-may-2014 *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-usa/2739-13-may-2014 * *index_site_id.modify {addstart|0\|31\|62\|93\|124\|155} * sequence for May 2014 (a month with 31 days) *index_site_channel.modify {addstart|aastha-india\|aastha-uk\|aastha-usa-canada\|vedic-india\|aastha-bhajan-india\|aastha-bhajan-usa} * May 2014 * *For June 2014 *http://www.aasthatv.com/index.php/schedules/aastha-india/2780-01-june-2014 *http://www.aasthatv.com/index.php/schedules/aastha-uk/2810-01-june-2014 *http://www.aasthatv.com/index.php/schedules/aastha-usa-canada/2840-01-june-2014 *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-india/2870-01-june-2014 *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-usa/2900-01-june-2014 *http://www.aasthatv.com/index.php/schedules/vedic-india/2930-01-june-2014 *index_site_id.modify {addstart|0\|30\|60\|90\|120\|150} * sequence for June 2014 (a month with 30 days) *index_site_channel.modify {addstart|aastha-india\|aastha-uk\|aastha-usa-canada\|aastha-bhajan-india\|aastha-bhajan-usa\|vedic-india} * June 2014 vedic is now the last! *For July 2014 *Following are the new links July. I am mostly interest in aastha-india. Any hope of getting the logic? *http://www.aasthatv.com/index.php/schedules/aastha-india/2809-30-june-2014 *http://www.aasthatv.com/index.php/schedules/aastha-india/3049-01-july-2014 * *http://www.aasthatv.com/index.php/schedules/aastha-uk/2839-30-june-2014 *http://www.aasthatv.com/index.php/schedules/aastha-uk/3079-01-july-2014 * *http://www.aasthatv.com/index.php/schedules/aastha-usa-canada/2869-30-ju... *http://www.aasthatv.com/index.php/schedules/aastha-usa-canada/3111-01-ju... * *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-india/2899-30-... *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-india/3142-01-... * *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-usa/2929-30-ju... *http://www.aasthatv.com/index.php/schedules/aastha-bhajan-usa/3173-01-ju... * *http://www.aasthatv.com/index.php/schedules/vedic-india/2959-30-june-2014 *http://www.aasthatv.com/index.php/schedules/vedic-india/3204-01-july-2014 * *No logic between subsequent month , the start number 3049 (aastha-india) *Strange is also that the difference between 1juli aastha-india and the next , aastha UK is only 30 while the month juli has 31 days *That should go wrong , probably on 31 july because then the number for aastha-india (3079) is then the same as for asstha-uk on 1 july *index_site_id.modify {addstart|0\|30\|61\|92\|123\|154} * sequence for July 2014 (a month with 31 days) *index_site_channel.modify {addstart|aastha-india\|aastha-uk\|aastha-usa-canada\|aastha-bhajan-india\|aastha-bhajan-usa\|vedic-india} *e.g *http://www.aasthatv.com/index.php/schedules/aastha-india/2602-31-may-2014 is still valid *http://www.aasthatv.com/index.php/schedules/aastha-india/2603-01-jun-2014 is invalid, error 404 *For each channel the daynumber is 31 (the number of days in the month) higher. This offset is entered in the site_id. *It is unknown how this daynumber will be for the start of each month. * we ask the user lkospa7@gmail.com to keep record of the url's