当前位置:编程学习 > C#/ASP.NET >>

获取网页数据

现在要做一个获取跟踪号的信息
跟踪号:RA081572108CN
网址为:http://17track.net/IndexEn.html?nums=RA081572108CN

现在的问题是:这个网站中的结果存放在一个框架中 <iframe id="frmResult" name="frmResult" scrolling="no" frameborder="0" src="Result.htm"></iframe>而我要获取的是Result.htm中的数据。

不知哪位高手可以帮助提供一下思路或者是代码,小弟我不胜感激。我主要用C#开发。

--------------------编程问答-------------------- 没有截获到和RA081572108CN相同的数据,我猜它在提交前一定对要查询的号做了相关加密的,你自己研究看看JS文件有没有结果

GET /Resources/Scripts/Zh_Cn/InfoState.js?ref=1303900166754 HTTP/1.1
Host: 17track.net
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.2.16) Gecko/20110319 Firefox/3.6.16
Accept: application/json, text/javascript, */*; q=0.01
Accept-Language: zh-cn,zh;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: GB2312,utf-8;q=0.7,*;q=0.7
Keep-Alive: 115
Connection: keep-alive
X-Requested-With: XMLHttpRequest
Referer: http://17track.net/Result.htm
Cookie: __utma=103993024.1693448300.1303899609.1303899609.1303899609.1; __utmb=103993024.12.10.1303899609; __utmc=103993024; __utmz=103993024.1303899609.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); cnzz_a2909656=9; sin2909656=; rtime=0; ltime=1303900024137; cnzz_eid=91050212-1303899764-



HTTP/1.1 200 OK
Content-Length: 188
Content-Type: application/x-javascript
Content-Encoding: gzip
Last-Modified: Tue, 15 Mar 2011 06:54:10 GMT
Accept-Ranges: bytes
ETag: "0951dc9dde2cb1:526"
Vary: Accept-Encoding
Server: Microsoft-IIS/6.0
X-Powered-By: ASP.NET
Date: Wed, 27 Apr 2011 10:31:39 GMT

.�........d蜯事0.囻珨Yw聱M{..崾C�.�(nD匤�.�.7�.A▉"絃扤W^�&i脞�>縜�抰.h�..H�$焮�
林�.�.2�
评秚j闐nH/b乘<抺.RZqK1gI..臦駫9璛D蹅陪饛;?鎙嶓7�.瞷功kF媲┥姘臑B3K鏹朏签.銋鬰榺..屪.h�...

GET /Resources/Scripts/Zh_Cn/PackageState.js?ref=1303900166936 HTTP/1.1
Host: 17track.net
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.2.16) Gecko/20110319 Firefox/3.6.16
Accept: application/json, text/javascript, */*; q=0.01
Accept-Language: zh-cn,zh;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: GB2312,utf-8;q=0.7,*;q=0.7
Keep-Alive: 115
Connection: keep-alive
X-Requested-With: XMLHttpRequest
Referer: http://17track.net/Result.htm
Cookie: __utma=103993024.1693448300.1303899609.1303899609.1303899609.1; __utmb=103993024.12.10.1303899609; __utmc=103993024; __utmz=103993024.1303899609.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); cnzz_a2909656=9; sin2909656=; rtime=0; ltime=1303900024137; cnzz_eid=91050212-1303899764-



HTTP/1.1 200 OK
Content-Length: 123
Content-Type: application/x-javascript
Content-Encoding: gzip
Last-Modified: Fri, 21 Jan 2011 08:02:18 GMT
Accept-Ranges: bytes
ETag: "021dc8541b9cb1:526"
Vary: Accept-Encoding
Server: Microsoft-IIS/6.0
X-Powered-By: ASP.NET
Date: Wed, 27 Apr 2011 10:31:39 GMT

.�........4�0.€岖�輟�.律C�...)...讇�.t.○r._|N^�&帱缦栶蕥�.O.N�<
埴咹.�=皶屳Q玬57$.�!#.京韔H锚戶鸦A�..�_Ih...

GET /Resources/Scripts/Zh_Cn/CommonRes.js?ref=1303900167029 HTTP/1.1
Host: 17track.net
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.2.16) Gecko/20110319 Firefox/3.6.16
Accept: application/json, text/javascript, */*; q=0.01
Accept-Language: zh-cn,zh;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: GB2312,utf-8;q=0.7,*;q=0.7
Keep-Alive: 115
Connection: keep-alive
X-Requested-With: XMLHttpRequest
Referer: http://17track.net/Result.htm
Cookie: __utma=103993024.1693448300.1303899609.1303899609.1303899609.1; __utmb=103993024.12.10.1303899609; __utmc=103993024; __utmz=103993024.1303899609.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); cnzz_a2909656=9; sin2909656=; rtime=0; ltime=1303900024137; cnzz_eid=91050212-1303899764-



HTTP/1.1 200 OK
Content-Length: 1072
Content-Type: application/x-javascript
Content-Encoding: gzip
Last-Modified: Wed, 13 Apr 2011 06:11:46 GMT
Accept-Ranges: bytes
ETag: "0bdc0aaa1f9cb1:526"
Vary: Accept-Encoding
Server: Microsoft-IIS/6.0
X-Powered-By: ASP.NET
Date: Wed, 27 Apr 2011 10:31:39 GMT

.�........擳袾.G.}.�.9璗�.oih欽m"厫椇�.飥W]fV籧,dH.踐L.�..亗塁d/.�郡炠�.啃�;�.m_*Y杴蠞3鐬93淄��.
.B婏.x��/攪壣1l)C
�.y酟AA�.M.fW釠-^we�..猑sEl梕u劑.嚧^�.閕Y}�.�.老鯠垑r氾o�.y�y丝\岢.邼醑婵遌.恔炶摌&Xo夛湁�.h.%w@懞蘐 u?�獧.I腷囟麍�,筌k逛!e?X4a巂諍呕..)鈛舧幯愎k.}.>[�<�*r�.l┍8`澱u遯:髖X.b.鶋獨N&鞦m.亅瘹鱦貶$視B.�.荡@�瀣l麟G糢.{.(汞.輲c-h優�!�/翷奁.�?(J;鈣Qd還w維.+鵹K�亞[@.,/f篏�.s.O}.鬆"r�>魣舉.�.疙鎆.J.��B己赧匣涋凟m�囡,毚滨絜褉[㱮e€...劮z�>_.3誤�..�M啹Wプ乡H7�+л瑘�#锇�.{"宦/隐斩X狣.8m`..蛆R吵�.炄.,i_m醋z榻[.鑟▼DI旚鍒b槌.眝!鍃閭�
銽k=9^uo9澋匩z飽渨.]9oOn嫂e=麂萎訐笕凞1�.4.�.k.hRM�.F將.MXhL:妉�.湮F*�*獞TS6�&&馃�$J闐t�.俢.gTA鉞e .."永獚憛'�.0莡粐.踐b嚳�*�/宭潄.��秬襀E�.{�;鄷.D锸.4F壩耚�
+?.啳)�.�.苬.爒s�.^蚺Y�Xf30╀vJЯgc祘胴.:夳壥.媄鳅b�. 傋F�:bq谯<厓丂�.鐫兊晤嚳�收~3/.癍簇)茸镾含幜c4.\燌斵�.]竟矸.�.._^釭s控�.罿D5罫fGbT胏`�IL7U銠�~韱筿.继J`嗷綳鬴>蝙&攛n.�._*.[.齍�+Vx=..�(,C<牊邕.
工G0G枞]M化NQ.庨簉s��p瀣耮侠.x尹鵆xo`瞑.g�.簊'橪F峻柫.鳾刞.$�.欜沁..�{...

POST /Code/Actions/GetResults.ashx?ref=1303900167266 HTTP/1.1
Host: 17track.net
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.2.16) Gecko/20110319 Firefox/3.6.16
Accept: */*
Accept-Language: zh-cn,zh;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: GB2312,utf-8;q=0.7,*;q=0.7
Keep-Alive: 115
Connection: keep-alive
Content-Type: application/x-www-form-urlencoded; charset=UTF-8
X-Requested-With: XMLHttpRequest
Referer: http://17track.net/Result.htm
Content-Length: 31
Cookie: __utma=103993024.1693448300.1303899609.1303899609.1303899609.1; __utmb=103993024.12.10.1303899609; __utmc=103993024; __utmz=103993024.1303899609.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); cnzz_a2909656=9; sin2909656=; rtime=0; ltime=1303900024137; cnzz_eid=91050212-1303899764-
Pragma: no-cache
Cache-Control: no-cache

number=UkEwODE1NzIxMDhDTg%3D%3D
--------------------编程问答-------------------- 用WebBrowser控件加载你所需要的页面

在获取页面你所需要的数据,这个是最快的了,不用去分析它那些加密的东西

所见即所得 --------------------编程问答-------------------- 不是,加密,是压缩了一下。

请求的时候不要发送 Accept-Encoding: gzip。。。 --------------------编程问答-------------------- 如楼上所说,是压缩的。
Accept-Encoding: gzip,deflate --------------------编程问答-------------------- 参数跟返回都做了加密,明摆着就是要防止采集的哦.
JS也做了混淆,可以说是用心良苦啊.
补充:.NET技术 ,  C#
CopyRight © 2012 站长网 编程知识问答 www.zzzyk.com All Rights Reserved
部份技术文章来自网络,