Tried my own script but no luck

merc_slk · Post by **merc_slk** » Mon Feb 25, 2008 9:58 pm

Hello,

I tried this script on the website but I think I missed something, can't figure it out.

if string.find(HTML.Url,'IDType=prd',1,true) then
local s1,e1=string.find(HTML.Url,'SetIDWebObject\?ID=%w*');
if (s1 and e1) then
product=string.sub(HTML.Url,s1+19,e1);
local url='http://www.glamonweb.com/GlamOnWeb/images/catalog/';
url=url .. 'prd' .. product .. '_F.jpg';
AddLink(url,HTML.Url,productname,'',HTML.TaskID,HTML.Level);
local url='http://www.glamonweb.com/GlamOnWeb/images/catalog/';
url=url .. 'prd' .. product .. '_R.jpg';
AddLink(url,HTML.Url,productname,'',HTML.TaskID,HTML.Level);
end;
end;
DefaultParser(HTML.Content,HTML.Url,HTML.Title,HTML.TaskID,HTML.Level);

It seems the parser doesn't enter into my script. BTW I checked get JPG but PL gets also the GIF, don't know why?

KoalaBear · Post by **KoalaBear** » Tue Feb 26, 2008 1:08 pm

1.this site requist you to choose your location at first, we can use PicaLoader's Manual Login feature at here: set http://www.glamonweb.com/GlamOnWeb/index.jsp as manual login URL, when start downloading, click your location (e.g United Kingdom) at manual login dialog, then click OK button at the top left conner.
2.check the start URL with IE, we found it links to a collection page first that URL look like xxxxxxxIDType=ser, and then, from this collection page, links to the product page that URL look like xxxxxxIDType=prd, so we set Page URL Include Filters:IDType=prd$;IDType=ser$
3.the big pics's URL look like this: xxxxxxxx_P.jpg, so we set Picture URL Include Filters:_P\.jpg$
4.this site using Adobe Flash to show big pics, so we have to using PicaLoader's HTML Parser Script to add pics's URL, there are a few error at your script, it should be like this:

if string.find(HTML.Url,'IDType=prd',1,true) then
local s1,e1=string.find(HTML.Url,'SetIDWebObject%?ID=%d+');
if (s1 and e1) then
product=string.sub(HTML.Url,s1+18,e1);
productname=HTML.Title;
s1,e1=string.find(HTML.Content,'<title%b><');
if (s1 and e1) then
productname=string.sub(HTML.Content,s1+7,e1-1);
end;
local url='http://www.glamonweb.com/GlamOnWeb/images/catalog/';
url=url .. 'prd' .. product .. 'f_P.jpg';
AddLink(url,HTML.Url,productname,'',HTML.TaskID,HTML.Level);
end;
end;
DefaultParser(HTML.Content,HTML.Url,HTML.Title,HTML.TaskID,HTML.Level);

merc_slk · Post by **merc_slk** » Sat Mar 01, 2008 9:07 am

I rewrote the script and some changes in the filter expressions.

The picture to be downloaded can be found in HTML.Content

value="/GlamOnWeb/images/catalog/prd8131f.jpg"

and than some annex are added like _P,_R,_F,_PR or nothing

So I changed the script like this :

if string.find(HTML.Url,'IDType=prd',1,true) then
local s1,e1=string.find(HTML.Content,'value="/GlamOnWeb/images/catalog/prd+\%w+');
if (s1 and e1) then
product=string.sub(HTML.Content,s1+34,e1);
local views=",_P,_R,_F,_PR";
for view in string.gmatch(views,[%u_]*) do
local url='http://www.glamonweb.com/GlamOnWeb/images/catalog/';
url=url .. product .. view .. '.jpg';
AddLink(url,HTML.Url,HTML.Title,'',HTML.TaskID,HTML.Level);
end;
end;
end;
DefaultParser(HTML.Content,HTML.Url,HTML.Title,HTML.TaskID,HTML.Lev

But after opening and analyzing first page nothing happens. The Href's with "prd$" or "ser$" aren't even downloaded, so it will not get into the lua script. This problem I've very often and than suddenly without changing something it runs marvelous???
P.S. In which cases do I select "support J/VB/JAVASCRIPT"

KoalaBear · Post by **KoalaBear** » Sun Mar 02, 2008 6:35 pm

merc_slk wrote:I rewrote the script and some changes in the filter expressions.

The picture to be downloaded can be found in HTML.Content

value="/GlamOnWeb/images/catalog/prd8131f.jpg"

and than some annex are added like _P,_R,_F,_PR or nothing

So I changed the script like this :

if string.find(HTML.Url,'IDType=prd',1,true) then
local s1,e1=string.find(HTML.Content,'value="/GlamOnWeb/images/catalog/prd+\%w+');
should be:
local s1,e1=string.find(HTML.Content,'value="/GlamOnWeb/images/catalog/prd%w+');
(prd+\%w+ will match prd+\32344f, '\' is special character for C/C++, not for lua)
if (s1 and e1) then
product=string.sub(HTML.Content,s1+34,e1);
should be:
product=string.sub(HTML.Content,s1+33,e1);
local views=",_P,_R,_F,_PR";
for view in string.gmatch(views,[%u_]*) do
should be:
for view in string.gmatch(views,'[%u_]+') do
(* means match 0 or more times.)
local url='http://www.glamonweb.com/GlamOnWeb/images/catalog/';
url=url .. product .. view .. '.jpg';
AddLink(url,HTML.Url,HTML.Title,'',HTML.TaskID,HTML.Level);
end;
end;
end;
DefaultParser(HTML.Content,HTML.Url,HTML.Title,HTML.TaskID,HTML.Level);

But after opening and analyzing first page nothing happens. The Href's with "prd$" or "ser$" aren't even downloaded, so it will not get into the lua script. This problem I've very often and than suddenly without changing something it runs marvelous???
please visit the start URL with IE, you will find that this site will skip to it's index page and let you select your language.
you can try to let IE show the correct content of start URL before run PicaLoader, and keep IE open, PicaLoader will auto using IE's cookie.
P.S. In which cases do I select "support J/VB/JAVASCRIPT"
If the web site using JavaScript links that PicaLoader's default HTML parser can deal with and these links is useful for us, please select this.
for this site, it's not necessary.

and your Picture URL Include Filters:
prd+\w+\.jpg$
should be:
prd[\w_]+\.jpg$

merc_slk · Post by **merc_slk** » Sun Mar 02, 2008 10:03 pm

Hi KoalaBear,

Fisrt of all thanks for the explination and the help, also with the other scripts of other sites. I get now a better idea how PL is working.

I changed the scipt like you advised and kept IE open before opening and running the project in PL. The only thing what happend was "loading the URL, waiting ...., no analysing and after a pause the error 1 web page loaded no pictures. I cleared the script, the "ser" and "prd" URL's are downloaded. I put the script back in, same story nothing happens.

Don't know what is happening. The only thing I noticed is that the line

local s1,e1=string.find(HTML.Content,'value="/GlamOnWeb/images/catalog/prd%w+');

is split in 3 lines in my HTMP Parser Script List box.

Is the script loaded and analized before the task is ran?

Many thanks.

merc_slk · Post by **merc_slk** » Mon Mar 10, 2008 9:53 pm

Hello,

Has anybody tried this modified script?? I cann't get it running. Only when I clear the Lua script box the included URL's are downloaded. So far it works. Script loaded, nothing is downloaded.

Any answer or solution is welcome.

Merc

KoalaBear · Post by **KoalaBear** » Tue Mar 11, 2008 7:25 am

my settings

merc_slk · Post by **merc_slk** » Tue Mar 11, 2008 10:13 pm

The first script did it's work but did not download all the pictures.
It loaded the "ser" and "prd" url from "marchi.jsp"and some pictures.

Second we changed the script and nothing else happened, only if I cleared the script, the "ser" and "prd" url were downloaded again, but not the required pictures.

Third The URL to start the download from is changed and the modified script works.

So I don't understand what caused the problem. One script works together with the url marchi.jsp but not the modified one??? Now the modified works with the modified start URL!! A litle flaw in the program? It sure would be nice If I saw what the parser was doing or creating a log registering what the parser is doing.So if there's a problem like here it would it make more understandable.

But anyway thanks again for the help KoalaBear. Thanks a lot.

Jef

Batch Download Pictures From Website

Tried my own script but no luck

Tried my own script but no luck

Changed script, Only analyzes first page but does nothing

Re: Changed script, Only analyzes first page but does nothin

Still won't run

Who has tried

Let's summarize.