**------------------------------------------------------------------------------------------------
* @header_start
* WebGrab+Plus ini for grabbing IMDB data from TvGuide websites
* @MinSWversion :
* @Site: allocine.fr
* @Revision 1 - [04/06/2013] Francis De Paemeleere
* - add correct title scrub
* - max=5 for mdb_commentsummary
* @Revision 0 - [03/06/2013] Francis De Paemeleere
* - creation
* @Remarks: primary search with google.com
* @header_end
**------------------------------------------------------------------------------------------------
*
site {url=allocine.fr|cultureinfo=fr-FR|charset=UTF-8|matchfactor=60|searchsite=google}
* primary search:
url_primarysearch {url(urlencode=1,2,3,4)|http://www.google.com/search?num=10&q=|site:allocine.fr "|'title'|" "|'productiondate'|"}
show_id.scrub {multi()|primary|allocine.fr/film/fichefilm_gen_cfilm=||.html|.html}
*
* imdb url's:
url_mdb_p1 {url|primary|http://allocine.fr/film/fichefilm_gen_cfilm=|show_id|.html}
url_mdb_p2 {url|primary|http://www.allocine.fr/film/fichefilm-|show_id|/casting}
url_mdb_p3 {url|primary|http://www.allocine.fr/film/fichefilm-|show_id|/critiques/presse/}
url_mdb_p4 {url|primary|http://www.allocine.fr/film/fichefilm-|show_id|/critiques/spectateurs/membres-critiques/}
*
* imdb elements
mdb_title.scrub {single()|p1|
Titre original
|| | |} * original title when redirected
mdb_title.scrub {single()|p1||" />} * normal title
mdb_title.scrub {multi()|p1|Ce film dans d'autres pays|||} * aka's
mdb_title.modify {cleanup()}
mdb_productiondate.scrub {single()|p1|Année de production
|| | |} * original title when redirected
mdb_actor.scrub {multi()|p2|itemprop="actors"|||}
mdb_director.scrub {multi()|p2|itemprop="director"|||}
mdb_starrating.scrub {single()|p1||\|}
mdb_starrating.modify {substring(type=element)|-1 1} * get the last part (should be the rating)
mdb_starratingvotes.scrub {single()|p1|||}
mdb_description.modify {cleanup(tags="<"">")}
mdb_commentsummary.scrub {multi(max=5)|p4||
|} * comments spectateur
mdb_commentsummary.modify {cleanup(tags="<"">")}
mdb_review.scrub {multi()|p3||
|
|} * comments presse
mdb_review.modify {cleanup(tags="<"">")}
* remove emty comments = containing "Voir la critique sur"
******mdb_temp_6.modify {addstart()|'mdb_review'}
******mdb_temp_1.modify {calculate(debug type=element format=F0)|'mdb_review' #} * number of comments
*******mdb_review.modify {clear}
*******loop {('mdb_temp_1' > "0" max=50)|end}
******mdb_temp_1.modify {calculate(format=F0)|1 -} * decrease index
******mdb_temp_2.modify {substring(debug type=element)|'mdb_temp_6' 'mdb_temp_1' 1}
******mdb_temp_3.modify {addstart(debug 'mdb_temp_2' not~ "Voir la critique sur")|####'mdb_temp_2'} * only add non-empty comments
*******end_loop
******
******mdb_review.modify {addstart(debug)|'mdb_temp_3'} * make multi
******mdb_review.modify {replace(debug)|####|\|} * make multi