How to extract/download content from HTTPS page?


Posted on 16th Feb 2014 07:03 pm by admin

Hello to all the Members of this forum, Im Shoiab, A novice programmer in php.. for my first job I have been recently assigned a project, in which I have got to extract/download the contents of the webpage (of my clients website) from HTTPS webpage using cURL. In other words I want to extract the same exact webpage to my local host.

Let me tell you, what all I have done so far, I am able to download the web content from "www.virginholidays.co.uk" here is the link to book a resort
"http://www.virginholidays.co.uk/brochures/florida/holidays/orlando/kissimmee/champions_world_resort" when i click on BOOK THE HOLIDAY BUTTON, it takes me to "https webpage" from which im not able to download (https://www.virginholidays.co.uk/book/start)

Im using windows XP, IE 5, php 5.2 and fiddler.

Here is my code:

$req1="GET /book/start HTTP/1.0rn";
$req1.='Accept: */*';
$req1.="rnAccept-Encoding: gzip, deflate
Cookie: _#lc=#; 90225614_clogin=l=1259059733&v=1&e=1259062485781;

__utmc=262657675;
CoreID6=60127103647212586967853;

__utma=262657675.233062282.1258696796.1259047752.1259059734.14;
__utmz=262657675.1258696796.1.1.utmccn=(direct)|utmcsr=(direct)

|utmcmd=(none);
_#uid=1258696798931.315033071.3223127.1883.436744734.051;

_#srchist=11611%3A1%3A20091221055958;
_#sess=1%7C20091120062958%7C1; _#vdf=11611%7C1%7C20091221055958;

__utmb=262657675;

ASP.NET_SessionId=zpn5ftje1xxodv55f1h3yg45; cmTPSet=Y;

cookie_complete=Region%3DFlorida%26Resort%3D2018.OR;

_csoot=1259036845125;

ememberedSearch=GeographyArea=Florida&GeographyResort=329.OR&Depart

ureAirport=MAN&DepartureDate=Fri 11 Dec

2009&Duration=7&AdultPax=2&ChildPax=0&InfantPax=0&ChildAge1=&ChildA

ge2=&ChildAge3=&ChildAge4=&ChildAge5=&ChildAge6=&ChildAge7=&ChildAg

e8=&SearchType=complete; _csuid=X47174a9c82f607;

cmRS=t3=1259060790328&pi=Hotel%20Options%20-%20Atop

User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1;

InfoPath.2; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR

3.5.30729)

Host: http://www.virginholidays.co.uk
Connection: Keep-Alive
Accept-Language: en-us";

$header[0] = "Accept:

text/xml,application/xml,application/xhtml+xml,application/json,";
$header[0] .=

"text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5";
$header[] = "Cache-Control: public";
$header[] = "Connection: keep-alive";
$header[] = "Keep-Alive: 300";
$header[] = "Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7";
$header[] = "Accept-Language: en-us,en;q=0.5";
$header[] = "Pragma: "; // browsers keep this blank.
$cookie="#lc=#; 90225614_clogin=l=1259059733&v=1&e=1259062485781;

__utmc=262657675;

CoreID6=60127103647212586967853;

__utma=262657675.233062282.1258696796.1259047752.1259059734.14;
__utmz=262657675.1258696796.1.1.utmccn=(direct)|utmcsr=(direct)

|utmcmd=(none);
_#uid=1258696798931.315033071.3223127.1883.436744734.051;

_#srchist=11611%3A1%3A20091221055958;
_#sess=1%7C20091120062958%7C1; _#vdf=11611%7C1%7C20091221055958;

__utmb=262657675;
ASP.NET_SessionId=zpn5ftje1xxodv55f1h3yg45; cmTPSet=Y;

cookie_complete=Region%3DFlorida%26Resort%3D2018.OR;

_csoot=1259036845125;

RememberedSearch=GeographyArea=Florida&GeographyResort=329.OR&Depar

tureAirport=MAN&DepartureDate=Fri 11 Dec

2009&Duration=7&AdultPax=2&ChildPax=0&InfantPax=0&ChildAge1=&ChildA

ge2=&ChildAge3=&ChildAge4=&ChildAge5=&ChildAge6=&ChildAge7=&ChildAg

e8=&SearchType=complete; _csuid=X47174a9c82f607;

cmRS=t3=1259060790328&pi=Hotel%20Options%20-%20Atop";

$ch = curl_init();
curl_setopt($ch,

CURLOPT_URL,"https://www.virginholidays.co.uk/book/start");
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 0);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, 0);
curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, FALSE);
curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt ($ch, CURLOPT_HTTPHEADER, $header);
curl_setopt($ch, CURLOPT_COOKIESESSION, TRUE);
curl_setopt($ch, CURLOPT_POST, 0);
curl_setopt($ch, CURLOPT_HEADER, 1);
curl_setopt($ch, CURLOPT_ENCODING, 'gzip,deflate');
curl_setopt ($ch, CURLOPT_COOKIE, $cookie);
$response1=curl_exec($ch);
curl_close($ch);
echo $response1;

$response = str_replace

("/_assets/","http://www.virginholidays.co.uk/_assets/",$response);
$response = str_replace

("/brochures/","http://www.virginholidays.co.uk/brochures/",$respon

se);
$response = str_replace

("/dynamichtag.aspx","http://www.virginholidays.co.uk/dynamichtag.a

spx",$response);
echo $response;

Could you please help me download the content of https webpage? Im not sure what is the issue? Is the cookie or session expired? Or I need to write a different code..?

Please help,
Thanks

No comments posted yet

Your Answer:

Login to answer
189 Like 5 Dislike
Previous forums Next forums
Other forums

Filtering an Array Based on Value
I have a very simple script set up that pulls data from a database and is output using this code:

How to update this array?
I am saving a string that looks this: username1, username2, username3...
into a single cell in a

Taking an HTML form and a PHP program and making it into one working file.
Hello,

So I would like to display everything INSIDE a specific directory. In other words, I

Which practice of iteration through containers is preferred
In the "real world" what kind of loop do most people use to iterate through a container like a vecto

Local file browser with php
Halo..

So this is what i want to create. I have a folder that i share and it has many movies.

Is it possible to stop reservation creation in PM Order?
Hi All,

Is it possible to stop reservation creation in PM Order?

Thanks in adv

Quick Question about echo value
Hey guys, I'm trying to get something to show differently in one of my scripts. I'm trying to make

Undefined variables
hi
----------------------------------------------------------------------------------------------

need help with this contact form
I made this form and I need it to read the data and write the data to a table and its not working ca

Undefined Index Notice In Internet Explorer Only
I am getting the following notice:

QuoteNotice: Undefined index: jrox in /home/ycsn/public_h

Sign up to write
Sign up now if you have flare of writing..
Login   |   Register
Follow Us
Indyaspeak @ Facebook Indyaspeak @ Twitter Indyaspeak @ Pinterest RSS



Play Free Quiz and Win Cash