73 lines
4.7 KiB
INI
Executable File
73 lines
4.7 KiB
INI
Executable File
**------------------------------------------------------------------------------------------------
|
|
* @header_start
|
|
* WebGrab+Plus ini for grabbing EPG data from TvGuide websites
|
|
* @Site: ontv.dk
|
|
* @MinSWversion: V0
|
|
* none
|
|
* @Revision 2 - [25/09/2012] Willy de Wilde/Jesper Schumacher
|
|
* none
|
|
* @Remarks:
|
|
* Replaced by international: en.timefor.tv
|
|
* Corrected by Jesper Schuamcher June 2012 to fit MediaPortal XMLTV_ns
|
|
* removed duplicate actors Jan van Straaten 25/09/2012
|
|
* in episode : is replaced with /
|
|
* In episode Sæson is removed and added to Category due to MP not able to read episode as "episode no./of episode/season"
|
|
* Added all info to find on hostpage
|
|
* @header_end
|
|
**------------------------------------------------------------------------------------------------
|
|
|
|
site {url=ontv.dk|timezone=UTC+01:00|maxdays=14|cultureinfo=da-DK|charset=UTF-8|titlematchfactor=90}
|
|
url_index{url(debug)|http://ontv.dk/tv-guide/|channel|/|urldate|}
|
|
urldate.format {datestring|yyyy-MM-dd}
|
|
*
|
|
index_showsplit.scrub {multi()|<td style="width:40px;|||</tr>}
|
|
index_urlshow {url ()|http://ontv.dk|<td><p>|href="|" programid="|</a>}
|
|
index_date.scrub {single(force include=last)|<div class="content" id="content">|/20|" style="text-decoration:underline;">|</table>}
|
|
index_start.scrub {single()|text-align:right;">|<p>|:</p>|</p>}
|
|
index_title.scrub {single()|<a href="|summary">|</a>|}
|
|
*
|
|
*stop.scrub {single()|<div class="content" id="content">| - | |</p>} * 144 finder stop tid til test **********************
|
|
|
|
title.scrub {single(separator=" <img" include=first)|<div class="content" id="content">|<td><h1>|</h1><p>|</h1><p>} *line 139 and 144
|
|
titleoriginal.scrub {single(lang=en)|<div class="content" id="content">|Original titel:</strong></p></td><td><p>|</p></td></tr>|} *linje 193 finds original title
|
|
subtitle.scrub {single () |<div class="content" id="content">|</div><p><strong>|</strong>|</strong>} * line 139 finds subtitle
|
|
description.scrub {single (separator=" Læs mere " exclude="Læs den komplette programomtale" "ontv.dk")|<meta name="Description" content="||.">} * line 6
|
|
category.scrub {single()|<div class="content" id="content">|Genre:</strong></p></td><td><p>|</p></td></tr>|} * Finds genre eg. komedie, action, drama etc.
|
|
productiondate.scrub {single()|<div class="content" id="content">|Fra:</strong></p></td><td><p>|</p></td></tr>|} * finds productionyear
|
|
category.scrub {single()|<div class="content" id="content">|Type:</strong></p></td><td><p>|</p></td></tr>|} * Finds more Typeinfo eg. Film, serie etc.
|
|
episode.scrub {single()|<div class="content" id="content">|Episode:</strong></p></td><td><p>|</p></td></tr>|} * Extracts all episode info incl. sæson-info
|
|
temp_1.scrub {single()|Episode:</strong></p></td><td><p>|sæson |</p></td></tr>|</tr></table><br>} * line 139 finds the "sæson" number, if it is a sæson
|
|
*temp_1.modify {substring(type=word)|0 1} * clears everything but the first word (number), because sæson can occur in description
|
|
actor.scrub {multi()|<p>Medvirkende</p></div>|<strong>|</p>|</table>} * finds actor list, HTML junk is removed with modify later
|
|
director.scrub {single()|<img src="/imgs/icons/director.png">|<strong>|</strong>|</strong><br>Instruktør} * finds director
|
|
director.modify {addend()| - Instruktør} * adds " - Instruktør"
|
|
director.modify {remove()|.} * remove dot
|
|
* operations:
|
|
*producer.modify {addstart()| 'temp_1'} * writes temp_1 (sæson number) in producer to see contest - only for test
|
|
temp_1.modify {addstart(notnull)|sæson } *adds sæson if it is not 'blank'
|
|
category.modify {addend()| 'temp_1'} * adds sæson and number to category (movietype)
|
|
episode.modify {remove()|'temp_1'} * Removes "sæson" and number in episode
|
|
episode.modify {addstart()|#} * Helper
|
|
episode.modify {addend()|#} * Helper
|
|
episode.modify {remove()|:#} * To remove if it ends on :
|
|
episode.modify {remove()|#:} * To remove : if it starts with a :
|
|
episode.modify {remove()|#} * remove helper
|
|
episode.modify {remove()| } * removes spaces
|
|
episode.modify {replace()|:|/} * replace : with /
|
|
actor.modify {replace()|</strong><br>| - } * removes HTML junk in actorlist
|
|
actor.modify {addend()|#} * Helper
|
|
actor.modify {remove()| - #} * removes - if it is the end fo string
|
|
actor.modify {remove()|#} * remove helper
|
|
actor.modify {remove()|.} * remove dot
|
|
actor.modify {remove()|'director'} * Removes director from actorlist
|
|
*
|
|
temp_1.modify {substring(type=element)|'actor' -1 1} * the last actor
|
|
temp_2.modify {substring(type=element)|'actor' -2 1} * the fore-last actor
|
|
actor.modify {remove('temp_1' = 'temp_2' type=element)|-1 1} * remove if equal
|
|
*
|
|
description.modify {remove()|..}
|
|
index_date.modify {addstart()|20}
|
|
subtitle.modify {remove() |Annonce:}
|
|
description.modify {cleanup}
|
|
subtitle.modify {cleanup}
|