• xsz.tw 不带广告的小说站
  • down.tw 资源、下载站
  • dushu.tw 读书网、小说免费阅读网站

XXbiquge关关采集规则分享20200122

程序 举个栗子 5个月前 (01-22) 319次浏览 2个评论 扫描二维码

XXbiquge 关关采集规则分享 20200122

XXbiquge 关关采集规则分享 20200122老规矩,先介绍下几个正则符号用法。简单方便。可以自己写

介绍一下关关采规则当中需要用到的一些标签
\d* 表示数字 \s* 表示空格+换行 .+? 表示字符(不能为空) .* 表示字符(可以为空)
() 表示我们需要的部分 ((.|\n)*) 章节的内容部分,包括了换行。
=====与杰奇后台标签的对应关系=====
!!!! 相当于 ([^><]*)   ~~~~ 相当于 ([^><‘”]*)   ^^^^ 相当于 ([^><\d]*) $$$$ 相当于 ([\d]*) **** 相当于 (.*) 如果不行。就根据相关提示调整 复制代码保存为 xml 文件。放在关关规则文件夹里。在关关里面选择即可,规则适用于 V1.20.7.9 版本,关关文件夹日期:2016.4.28 这个版本的关关

<?xml version="1.0"?>
<RuleConfigInfo xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema">
  <RuleVersion>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>xsz.tw
233.tw </Pattern>
    <RegexName>RuleVersion</RegexName>
  </RuleVersion>
  <RuleID>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>91</Pattern>
    <RegexName>RuleID</RegexName>
  </RuleID>
  <GetSiteName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>XXbiquge.com</Pattern>
    <RegexName>GetSiteName</RegexName>
  </GetSiteName>
  <GetSiteCharset>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>utf-8</Pattern>
    <RegexName>GetSiteCharset</RegexName>
  </GetSiteCharset>
  <GetSiteUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>https://www.xxbiquge.com/</Pattern>
    <RegexName>GetSiteUrl</RegexName>
  </GetSiteUrl>
  <NovelSearchUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelSearchUrl</RegexName>
  </NovelSearchUrl>
  <NovelSearchData>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelSearchData</RegexName>
  </NovelSearchData>
  <NovelSearch_GetNovelKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelSearch_GetNovelKey</RegexName>
  </NovelSearch_GetNovelKey>
  <NovelListUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>https://www.xxbiquge.com/</Pattern>
    <RegexName>NovelListUrl</RegexName>
  </NovelListUrl>
  <NovelListFilter>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>NovelListFilter</RegexName>
  </NovelListFilter>
  <NovelList_GetNovelKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern><a href="/\d+_(\d+)/" target="_blank">(.+?)</a></Pattern>
    <RegexName>NovelList_GetNovelKey</RegexName>
  </NovelList_GetNovelKey>
  <NovelUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>https://www.xxbiquge.com/{NovelKey/1000}_{NovelKey}/</Pattern>
    <RegexName>NovelUrl</RegexName>
  </NovelUrl>
  <NovelErr>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>对不起,该文章不存在!</Pattern>
    <RegexName>NovelErr</RegexName>
  </NovelErr>
  <NovelName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:book_name" content="(.+?)"</Pattern>
    <RegexName>NovelName</RegexName>
  </NovelName>
  <NovelAuthor>
    <FilterPattern><a.+?>
</a>
&nbsp;</FilterPattern>
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:author" content="(.+?)"</Pattern>
    <RegexName>NovelAuthor</RegexName>
  </NovelAuthor>
  <LagerSort>
    <FilterPattern><a.+?>
</a>
&nbsp;</FilterPattern>
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:category" content="(.+?)"</Pattern>
    <RegexName>LagerSort</RegexName>
  </LagerSort>
  <SmallSort>
    <FilterPattern><a.+?>
</a>
&nbsp;</FilterPattern>
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:category" content="(.+?)"</Pattern>
    <RegexName>SmallSort</RegexName>
  </SmallSort>
  <NovelIntro>
    <FilterPattern><script((.|\n)*?)</script>
&lt;♂<
&gt;♂>
<a.+?</a>
</div>
</p></FilterPattern>
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:description" content="((.|\n)*?)"</Pattern>
    <RegexName>NovelIntro</RegexName>
  </NovelIntro>
  <NovelKeyword>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:book_name" content="(.+?)"</Pattern>
    <RegexName>NovelKeyword</RegexName>
  </NovelKeyword>
  <NovelDegree>
    <FilterPattern>a♂已完结
b♂连载中</FilterPattern>
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:status" content="(.+?)"</Pattern>
    <RegexName>NovelDegree</RegexName>
  </NovelDegree>
  <NovelCover>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:image" content="(.+?)"</Pattern>
    <RegexName>NovelCover</RegexName>
  </NovelCover>
  <NovelDefaultCoverUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>noimg.jpg</Pattern>
    <RegexName>NovelDefaultCoverUrl</RegexName>
  </NovelDefaultCoverUrl>
  <NovelInfo_GetNovelPubKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>og:novel:read_url" content="(.+?)"/></Pattern>
    <RegexName>NovelInfo_GetNovelPubKey</RegexName>
  </NovelInfo_GetNovelPubKey>
  <PubCookies>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubCookies</RegexName>
  </PubCookies>
  <PubIndexUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>{NovelPubKey}</Pattern>
    <RegexName>PubIndexUrl</RegexName>
  </PubIndexUrl>
  <PubIndexErr>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>获得目录页错误</Pattern>
    <RegexName>PubIndexErr</RegexName>
  </PubIndexErr>
  <PubVolumeContent>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubVolumeContent</RegexName>
  </PubVolumeContent>
  <PubVolumeSplit>
    <FilterPattern />
    <Method>Spilt</Method>
    <Options>None</Options>
    <Pattern><h3</Pattern>
    <RegexName>PubVolumeSplit</RegexName>
  </PubVolumeSplit>
  <PubVolumeName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>>(.+?)</h3></Pattern>
    <RegexName>PubVolumeName</RegexName>
  </PubVolumeName>
  <PubChapterName>
    <FilterPattern>~伪后记~|伪后记</FilterPattern>
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern><dd><a href="/\d+_\d+/\d+.html">(.+?)</a></dd></Pattern>
    <RegexName>PubChapterName</RegexName>
  </PubChapterName>
  <PubChapter_GetChapterKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern><dd><a href="(/\d+_\d+/\d+.html)">.+?</a></dd></Pattern>
    <RegexName>PubChapter_GetChapterKey</RegexName>
  </PubChapter_GetChapterKey>
  <PubContentUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>{ChapterKey}</Pattern>
    <RegexName>PubContentUrl</RegexName>
  </PubContentUrl>
  <PubContentErr>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>获得章节内容页错误</Pattern>
    <RegexName>PubContentErr</RegexName>
  </PubContentErr>
  <PubTextUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubTextUrl</RegexName>
  </PubTextUrl>
  <PubContentText>
    <FilterPattern><span.+?>|<font.+?>|<[Ss][Cc][Rr][Ii][Pp][Tt](.|\n)+?</[Ss][Cc][Rr][Ii][Pp][Tt]>|<[Ff][Oo][Nn][Tt](.|\n)*?</[Ff][Oo][Nn][Tt]>|<[Ii][Ff][Rr][Aa][Mm][Ee](.|\n)+?</[Ii][Ff][Rr][Aa][Mm][Ee]>|<[Aa].+?</[Aa]>|<[Dd][Ii][Vv].+?>|</[Dd][Ii][Vv]>|<!--.+?-->|<[Ss>][Pp][Aa][Nn](.|\n)*?</[Ss>][Pp][Aa][Nn]>|0.{0,10}0.{0,10}小.{0,10}说|</br>|<br>|本書首发于看書罔|未完待续|</span>|</>|</font>|\[\$|妙\]|\[笔|\$|i\]|\[-阁\]|com|\(。\)|U8\?小说|\?.\?|U\?8\?X\?S|\?U\?|8\?小说|U\?8\?X|S\?|\?U8|小说|U|8|\?X\s*\?|\?\?U|8 小|说\?|X|S`|[WwMm]+\.[0-9a-zA-Z]*\.[CcOoMmIiNnEeTtLlAa]|手机用户|请浏览|m.114zw.la|阅读|更优质的阅读体验|天才壹秒記住|114|中文网|』|ф|①|④ω|z|la|呅網|為您|提供精彩|小說閱讀|『|起点读书|最快更新|无弹窗请|&nbsp;&nbsp;&nbsp;&nbsp;
m.biquge.vip♂自适应小说站 xsz.tw
笔趣阁♂小说站</FilterPattern>
    <Method>Match</Method>
    <Options>Singleline</Options>
    <Pattern><div id="content">((.|\n)+?)<div class="bottem2"></Pattern>
    <RegexName>PubContentText</RegexName>
  </PubContentText>
  <PubContentPageUrl>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentPageUrl</RegexName>
  </PubContentPageUrl>
  <PubContentPageKey>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentPageKey</RegexName>
  </PubContentPageKey>
  <PubContentReplace>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern>[WwWwωщщψшШ].{0,3}[WwWwωщщψшШ].{0,3}[WwWwωщщψшШ].{0,3}[00OoOoο].{0,3}[00OoOoο].{0,3}[XxXxχ].{0,3}[SsSs].{0,7}[CcCcСΓ].{0,3}[00OoOoοó].{0,3}[MmMmМ]|[00OoOoο].{0,3}[00OoOoο].{0,3}[XxXxχ].{0,3}[SsSs].{0,7}[CcCcСΓ].{0,3}[00OoOoοó].{0,3}[MmMmМ]|[HhHΗh].{0,3}[TtTt].{0,3}[TtTt].{0,3}[PpPpρр]://|[WwWwωщщψ].{0,3}[WwWwωщщψ].{0,3}[WwWwωщщψ]|[WwWwωщщψ].{0,3}[AaàAaαа].{0,3}[PpPpρр]|[CcCcС].{0,3}[00OoOoο].{0,3}[MmMmМ]|[NnNnΠ∩η].{0,3}[EeEeε].{0,3}[TtTt]|[00OoOoο].{0,3}[RrRr].{0,3}[GgGg]|[CcCcС].{0,3}[NnNnΠ∩η]</Pattern>
    <RegexName>PubContentReplace</RegexName>
  </PubContentReplace>
  <PubContentChapterName>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentChapterName</RegexName>
  </PubContentChapterName>
  <PubContentChapterNum>
    <FilterPattern />
    <Method>Match</Method>
    <Options>None</Options>
    <Pattern />
    <RegexName>PubContentChapterNum</RegexName>
  </PubContentChapterNum>
</RuleConfigInfo>

举个栗子 , 版权所有丨如未注明 , 均为原创丨本网站采用BY-NC-SA协议进行授权
转载请注明原文链接:XXbiquge 关关采集规则分享 20200122
喜欢 (0)
举个栗子
关于作者:
建筑工地上施工员,闲暇时弄个博客打发时间,
发表我的评论
取消评论
表情 贴图 加粗 删除线 居中 斜体 签到

Hi,您需要填写昵称和邮箱!

  • 昵称 (必填)
  • 邮箱 (必填)
  • 网址
(2)个小伙伴在吐槽
  1. 能分享下这个关关版本么 谢谢 邮箱qqxufeng@gmail.com
    Donic2020-04-03 15:55 回复
    • 举个栗子
      可以在其他地方下载。百度下。很多。如果你加入小说群。很多也有
      举个栗子2020-04-03 20:56 回复