**------------------------------------------------------------------------------------------------ * @header_start * WebGrab+Plus ini for grabbing IMDB data from TvGuide websites * @MinSWversion : * @Site: allocine.fr * @Revision 1 - [04/06/2013] Francis De Paemeleere * - add correct title scrub * - max=5 for mdb_commentsummary * @Revision 0 - [03/06/2013] Francis De Paemeleere * - creation * @Remarks: primary search with google.com * @header_end **------------------------------------------------------------------------------------------------ * site {url=allocine.fr|cultureinfo=fr-FR|charset=UTF-8|matchfactor=60|searchsite=google} * primary search: url_primarysearch {url(urlencode=1,2,3,4)|http://www.google.com/search?num=10&q=|site:allocine.fr "|'title'|" "|'productiondate'|"} show_id.scrub {multi()|primary|allocine.fr/film/fichefilm_gen_cfilm=||.html|.html} * * imdb url's: url_mdb_p1 {url|primary|http://allocine.fr/film/fichefilm_gen_cfilm=|show_id|.html} url_mdb_p2 {url|primary|http://www.allocine.fr/film/fichefilm-|show_id|/casting} url_mdb_p3 {url|primary|http://www.allocine.fr/film/fichefilm-|show_id|/critiques/presse/} url_mdb_p4 {url|primary|http://www.allocine.fr/film/fichefilm-|show_id|/critiques/spectateurs/membres-critiques/} * * imdb elements mdb_title.scrub {single()|p1|
Titre original
|||} * original title when redirected mdb_title.scrub {single()|p1||" />} * normal title mdb_title.scrub {multi()|p1|Ce film dans d'autres pays|||} * aka's mdb_title.modify {cleanup()} mdb_productiondate.scrub {single()|p1|
Année de production
|||} * original title when redirected mdb_actor.scrub {multi()|p2|itemprop="actors"|||} mdb_director.scrub {multi()|p2|itemprop="director"|||} mdb_starrating.scrub {single()|p1||\|} mdb_starrating.modify {substring(type=element)|-1 1} * get the last part (should be the rating) mdb_starratingvotes.scrub {single()|p1||

|

} mdb_description.modify {cleanup(tags="<"">")} mdb_commentsummary.scrub {multi(max=5)|p4|

|

|

} * comments spectateur mdb_commentsummary.modify {cleanup(tags="<"">")} mdb_review.scrub {multi()|p3|
|

|

|

} * comments presse mdb_review.modify {cleanup(tags="<"">")} * remove emty comments = containing "Voir la critique sur" ******mdb_temp_6.modify {addstart()|'mdb_review'} ******mdb_temp_1.modify {calculate(debug type=element format=F0)|'mdb_review' #} * number of comments *******mdb_review.modify {clear} *******loop {('mdb_temp_1' > "0" max=50)|end} ******mdb_temp_1.modify {calculate(format=F0)|1 -} * decrease index ******mdb_temp_2.modify {substring(debug type=element)|'mdb_temp_6' 'mdb_temp_1' 1} ******mdb_temp_3.modify {addstart(debug 'mdb_temp_2' not~ "Voir la critique sur")|####'mdb_temp_2'} * only add non-empty comments *******end_loop ****** ******mdb_review.modify {addstart(debug)|'mdb_temp_3'} * make multi ******mdb_review.modify {replace(debug)|####|\|} * make multi