这是列表页:
http://zjnews.zjol.com.cn/05zjnews/system/more/3060000/0001/3060000_00000111.shtml内容页:
http://zjnews.zjol.com.cn/05zjnews/system/2008/08/22/009864616.shtml查看源文件发现有 <!--function content() parse begin-->代码出现在内容页中
如下:
<div id="oZoom" style="zoom:100%;">
<!--function content() parse begin-->
{content=*}
<!--function: content() parse end 0ms cost! -->
</div>
采集时出现774错误提示,
和
服务器获取不了远程文件信息,因而采集失败
这是本人写的规则:
INSERT+INTO+%60p8_gather_rule%60+%28%60id%60%2C+%60type%60%2C+%60fixsystem%60%2C+%60filetype%60%2C+%60webname%60%2C+%60listurl%60%2C+%60firstpage%60%2C+%60page_begin%60%2C+%60page_end%60%2C+%60page_step%60%2C+%60title_minleng%60%2C+%60listmoreurl%60%2C+%60link_include_word%60%2C+%60link_noinclude_word%60%2C+%60link_replace_word%60%2C+%60title_replace_word%60%2C+%60list_begin_code%60%2C+%60list_end_code%60%2C+%60list_begin_preg%60%2C+%60list_end_preg%60%2C+%60gatherthesame%60%2C+%60show_begin_preg%60%2C+%60show_end_preg%60%2C+%60show_endfile_preg%60%2C+%60show_begin_code%60%2C+%60show_end_code%60%2C+%60show_replace_word%60%2C+%60show_morepage%60%2C+%60show_firstpage%60%2C+%60show_spe2page%60%2C+%60posttime%60%2C+%60list%60%2C+%60copypic%60%2C+%60sort%60%2C+%60file_type%60%2C+%60file_minleng%60%2C+%60file_minsize%60%2C+%60file_includeword%60%2C+%60file_noincludeword%60%2C+%60file_explode%60%2C+%60file_picwidth%60%2C+%60file_star_string%60%2C+%60title_rule%60%2C+%60content_rule%60%2C+%60title_morepage_rull%60%2C+%60content_morepage_rull%60%2C+%60charset_type%60%29+VALUES+%28%27%27%2C%27article%27%2C%27article%27%2C%27%27%2C%27%D5%E3%BD%AD%D4%DA%CF%DF--%B8%F7%B5%D8%D0%C2%CE%C5%C1%AA%B2%A5%27%2C%27http%3A%2F%2Fzjnews.zjol.com.cn%2F05zjnews%2Fsystem%2Fmore%2F3060000%2F0001%2F%5Bpage%5D%27%2C%27http%3A%2F%2Fzjnews.zjol.com.cn%2F05zjnews%2Forigin_news%2F%27%2C%271%27%2C%273%27%2C%271%27%2C%270%27%2C%27%27%2C%27%27%2C%27%3C%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%270%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%27%27%2C%270%27%2C%271220758784%27%2C%271220759180%27%2C%270%27%2C%270%27%2C%27%27%2C%270%27%2C%270%27%2C%27%27%2C%27%27%2C%27%27%2C%270%27%2C%27%27%2C%27%3CA+class%3Ddefault2+%5Cr%5Cn++++++++++++++++++href%3D%5C%22%7Burl%3DNO%5C%22%7D%5C%22%5Cr%5Cn++++++++++++++++++target%3D_blank%3E%7Btitle%3DNO%3C%7D%3C%2FA%3E%27%2C%27%3Cdiv+id%3D%5C%22oZoom%5C%22+style%3D%5C%22zoom%3A100%25%3B%5C%22%3E%5Cr%5Cn%3C%21--function+content%28%29+parse+begin--%3E%5Cr%5Cn%7Bcontent%3D%2A%7D%5Cr%5Cn%3C%21--function%3A+content%28%29+parse+end++0ms+cost%21+--%3E%5Cr%5Cn%3C%2Fdiv%3E%27%2C%27%27%2C%27%27%2C%270%27%29%3B
不知是采集规则没写对,还是因为有隐藏代码无法采集。
望高手解惑,不胜感谢。