• dushu.tw 读书网、小说免费阅读网站

笔趣阁(bqg44.net) 关关3.5版本采集规则

软件 举个栗子 76次浏览 未收录 0个评论 扫描二维码

经常用爱发电。不用则断

有空就搞几个规则来发布。因为免费规则,木有维护

自行学习。这个笔趣阁估计是最简单的/

关关采集器 适用于Windows 系统。

另外就是linux+远程桌面+wine(32)+宝塔+jieqi 1.7 可以搞。 不过Windows 最简单,安装好就可以弄
杰奇小说的专用采集器就是关关采集器
关关采集器3.5版本可以发布到很多版本。具体可以自己找来看看

<?xml version="1.0"?>
<RuleConfigInfo xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <RuleVersion>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>RuleVersion</RegexName>
  </RuleVersion>
  <RuleID>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>101</Pattern>
    <RegexName>RuleID</RegexName>
  </RuleID>
  <GetSiteName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>bqg44.net</Pattern>
    <RegexName>GetSiteName</RegexName>
  </GetSiteName>
  <GetSiteCharset>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern>utf-8</Pattern>
    <RegexName>GetSiteCharset</RegexName>
  </GetSiteCharset>
  <GetSiteUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase Singleline</Options>
    <Pattern>https://www.bqg44.net</Pattern>
    <RegexName>GetSiteUrl</RegexName>
  </GetSiteUrl>
  <NovelSearchUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelSearchUrl</RegexName>
  </NovelSearchUrl>
  <NovelSearchData>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelSearchData</RegexName>
  </NovelSearchData>
  <NovelSearch_GetNovelKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelSearch_GetNovelKey</RegexName>
  </NovelSearch_GetNovelKey>
  <NovelListUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern>https://www.bqg44.net</Pattern>
    <RegexName>NovelListUrl</RegexName>
  </NovelListUrl>
  <NovelListFilter>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelListFilter</RegexName>
  </NovelListFilter>
  <NovelList_GetNovelKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><a href="/book/(\d*)/" target="xs">(.+?)</a></em><em></Pattern>
    <RegexName>NovelList_GetNovelKey</RegexName>
  </NovelList_GetNovelKey>
  <NovelUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern>https://www.bqg44.net/book/{NovelKey}/</Pattern>
    <RegexName>NovelUrl</RegexName>
  </NovelUrl>
  <NovelName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:novel:book_name" content="(.+?)"></Pattern>
    <RegexName>NovelName</RegexName>
  </NovelName>
  <NovelErr>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase Singleline</Options>
    <Pattern>对不起,该文章不存在</Pattern>
    <RegexName>NovelErr</RegexName>
  </NovelErr>
  <NovelAuthor>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:novel:author" content="(.+?)"></Pattern>
    <RegexName>NovelAuthor</RegexName>
  </NovelAuthor>
  <Isboy>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>Isboy</RegexName>
  </Isboy>
  <LagerSort>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:novel:category" content="(.+?)"></Pattern>
    <RegexName>LagerSort</RegexName>
  </LagerSort>
  <SmallSort>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:novel:category" content="(.+?)"></Pattern>
    <RegexName>SmallSort</RegexName>
  </SmallSort>
  <NovelIntro>
    <FilterPattern>&nbsp;
</div>
<div>
</p>
<p>
作者:.+?<br>
最新章节 :.+?<br>
最新章节预览:</FilterPattern>
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:description" content="((.|\n)+?)"></Pattern>
    <RegexName>NovelIntro</RegexName>
  </NovelIntro>
  <NovelKeyword>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelKeyword</RegexName>
  </NovelKeyword>
  <NovelDegree>
    <FilterPattern>b♂连载中
a♂完本</FilterPattern>
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:novel:status" content="(.+?)"></Pattern>
    <RegexName>NovelDegree</RegexName>
  </NovelDegree>
  <NovelCover>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><meta property="og:image" content="(.+?)"></Pattern>
    <RegexName>NovelCover</RegexName>
  </NovelCover>
  <NovelDefaultCoverUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelDefaultCoverUrl</RegexName>
  </NovelDefaultCoverUrl>
  <NovelInfo_GetNovelPubKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelInfo_GetNovelPubKey</RegexName>
  </NovelInfo_GetNovelPubKey>
  <PubCookies>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubCookies</RegexName>
  </PubCookies>
  <PubIndexUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern>https://www.bqg44.net/book/{NovelKey}/</Pattern>
    <RegexName>PubIndexUrl</RegexName>
  </PubIndexUrl>
  <PubIndexErr>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern>无法找到该页</Pattern>
    <RegexName>PubIndexErr</RegexName>
  </PubIndexErr>
  <PubVolumeContent>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>章节目录</strong></div>((.|\n)+?)</div>\s*</div>\s*<div class="container"></Pattern>
    <RegexName>PubVolumeContent</RegexName>
  </PubVolumeContent>
  <PubVolumeSplit>
    <FilterPattern />
    <Method>Spilt</Method>
    <Options>IgnoreCase</Options>
    <Pattern>章节目录</strong></div></Pattern>
    <RegexName>PubVolumeSplit</RegexName>
  </PubVolumeSplit>
  <PubVolumeName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><div class="info-chapters-title"><strong>《.+?》(.+?)</strong></div></Pattern>
    <RegexName>PubVolumeName</RegexName>
  </PubVolumeName>
  <PubChapterName>
    <FilterPattern>xinbqg.com
www.xinbqg.com
m.xinbqg.com
http://
https://
新笔趣阁
&nbsp;
</div></FilterPattern>
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><a href="/book/\d*/\d*.html" title="(.+?)" target="zj"></Pattern>
    <RegexName>PubChapterName</RegexName>
  </PubChapterName>
  <PubChapter_GetChapterKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><a href="(/book/\d*/\d*.html)" title=".+?" target="zj"></Pattern>
    <RegexName>PubChapter_GetChapterKey</RegexName>
  </PubChapter_GetChapterKey>
  <PubContentUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern>https://www.bqg44.net{ChapterKey}</Pattern>
    <RegexName>PubContentUrl</RegexName>
  </PubContentUrl>
  <PubContentErr>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>您访问的页面可能暂时未更新、已更名或已经删除,请稍后访问或马上点此举报:</Pattern>
    <RegexName>PubContentErr</RegexName>
  </PubContentErr>
  <PubContent_GetTextKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContent_GetTextKey</RegexName>
  </PubContent_GetTextKey>
  <PubTextUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubTextUrl</RegexName>
  </PubTextUrl>
  <PubContentText>
    <FilterPattern><script>.+?</script>
xinbqg.com
www.xinbqg.com
m.xinbqg.com
https://
新笔趣阁
&nbsp;
</div>
<br /><br /><p>(.+?)</p>|</FilterPattern>
    <Method>Match</Method>
    <Options>IgnoreCase</Options>
    <Pattern><article id="article" class="content">((.|\n)+?)<div class="reader-hr"></div></Pattern>
    <RegexName>PubContentText</RegexName>
  </PubContentText>
  <PubContentPageArea>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentPageArea</RegexName>
  </PubContentPageArea>
  <PubContentPage>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentPage</RegexName>
  </PubContentPage>
  <PubContentPageUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentPageUrl</RegexName>
  </PubContentPageUrl>
  <PubContentPageKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentPageKey</RegexName>
  </PubContentPageKey>
  <PubContentChapterName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentChapterName</RegexName>
  </PubContentChapterName>
  <PubContentChapterNum>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentChapterNum</RegexName>
  </PubContentChapterNum>
  <PubContentImages>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentImages</RegexName>
  </PubContentImages>
  <PubContentReplace>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentReplace</RegexName>
  </PubContentReplace>
</RuleConfigInfo>

 

 

bqg44.net 关关采集器3.5版本规则


举个栗子 , 版权所有丨如未注明 , 均为原创丨本网站采用BY-NC-SA协议进行授权
转载请注明原文链接:笔趣阁(bqg44.net) 关关3.5版本采集规则
喜欢 (0)
举个栗子
关于作者:
建筑工地上施工员,闲暇时弄个博客打发时间,
发表我的评论
取消评论
表情 贴图 加粗 删除线 居中 斜体 签到