**------------------------------------------------------------------------------------------------ * @header_start * WebGrab+Plus ini for grabbing EPG data from TvGuide websites * @Site: dagenstv.com * @MinSWversion: V1.1.1/55.27 * none * @Revision 9 - [16/05/2016] Blackbear199 * fixed start/stop scrub * @Revision 8 - [30/10/2015] Jan van Straaten * fixed date error and adapted to latest standards * @Revision 7 - [20/12/2013] Jan van Straaten * added cleanup of director * @Revision 6 - [09/11/2012] Jan van Straaten * removes the second show having the same start time * @Revision 5 - [] * replaces #x20 (non breaking space) in site_channel with #x20 (space) /added nopageoverlap/added index_stop * @Revision 4 - [] * fix in episode (removes faulty values longer than 4 words) * @Revision 3 - [] * director multi value, director remove [....] * @Revision 2 - [] * bug fixed in description * @Revision 1 - [] * adapted for V1.0.5 * @Revision 0 - [] * none * @Remarks: * This site is index-only with a lot of details in it * @header_end **------------------------------------------------------------------------------------------------ site {url=dagenstv.com|timezone=Europe/Stockholm|maxdays=12|cultureinfo=sv-SE|charset=ISO-8859-1|titlematchfactor=90|nopageoverlaps|episodesystem=onscreen} * url_index{url|http://www.dagenstv.com/se/chart/?cha=|channel|&dat=|urldate} urldate.format {datestring|yyyy-MM-dd} index_showsplit.scrub {multi||} * index_start.scrub {regex||} * used to remove actors from description index_director.scrub {single(separator="Skådespelare""[" include=first)|
} index_presenter.scrub {single(separator=".""[" include=first)|
} * scope.range {(indexshowdetails)|end} index_director.modify {cleanup} * episode filter: must start with Del or Säsong index_episode.modify {cleanup} index_temp_1.modify {calculate(type=element format=F0)|'index_episode' #} loop {('index_temp_1' > "0")|end} index_temp_1.modify {calculate(format=F0)|1 -} index_temp_2.modify {substring(type=element)|'index_episode' 'index_temp_1' 1} * index_temp_2.modify {substring(type=word)|0 1} * first word index_temp_3.modify {clear} * reset index_temp_3.modify {addstart('index_temp_2' "Del")|!OK!!} * condition start with Del index_temp_3.modify {addstart('index_temp_2' "Säsong")|!OK!!} * condition start with Säsong index_episode.modify {remove('index_temp_3' not "!OK!!" type=element)|'index_episode' 'index_temp_1' 1} *remove if !start with Del | Säsong end_loop * index_stop.modify {remove|)} * * the condition when two shows with same start time , we remove the last: index_temp_1.modify {addstart('index_start' = 'previous_start')|remove} index_start.modify {clear('index_temp_1' ~ "remove")} index_stop.modify {clear('index_temp_1' ~ "remove")} * index_episode.modify {replace|\||. } index_description.modify {remove('index_episode' not "")|'index_episode'.} index_description.modify {remove|Producerat år 'index_productiondate'.} index_director.modify {cleanup} index_description.modify {remove|Regissör: 'index_director'} index_description.modify {remove|Skådespelare: 'index_temp_1'.} index_description.modify {remove|Expertkommentator: '{single(separator="Kommentator"" ""[" include=first)|
}'} index_description.modify {remove|presenteras av '{single(separator="." include=first)|
"0" type=char)|'index_director' 'index_temp_1'} index_director.modify {replace|, |\|} * make multi value index_presenter.modify {remove|.} index_presenter.modify {remove|,} * * rebuild the original episode * *Säsong 1. Avsnitt 9 av 23 = S1E9/23 *index_episode.modify {replace|S|Säsong } *index_episode.modify {replace(~ "Säsong")|E|. Avsnitt } *index_episode.modify {replace(not ~ "Säsong")|Avsnitt }| *index_episode.modify {replace|/| av } * * optional: ** Add episode to description *index_description.modify {addend('index_episode' not "")|\nEpisode: 'index_episode'} ** Remove episode *index_episode.modify {clear} end_scope * ** _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ ** ##### CHANNEL FILE CREATION (only to create the xxx-channel.xml file) ** ** @auto_xml_channel_start *index_site_channel.scrub {multi|kanal...|">||} *index_site_id.scrub {multi|kanal...|