1回答

0收藏

如何用IIS限制屏蔽恶意搜索引擎爬虫禁止抓取网站?

系统运维 系统运维 51 人阅读 | 1 人回复 | 2024-05-05

iis环境下,如何限制屏蔽恶意搜索引擎爬虫禁止抓取网站?
"打赏他人,曝光自己,利他利我"
还没有人打赏,支持一下

回答|共 1 个

大神

发表于 2024-5-5 18:32:04 | 显示全部楼层

IIS7/IIS8/IIS10及以上web服务请在网站根目录下创建web.config文件,并写入如下代码即可:

  1. <?xml version="1.0" encoding="UTF-8"?>
  2. <configuration>
  3.   <system.webServer>
  4.    <rewrite>
  5.     <rules>
  6.      <rule name="Block spider">
  7.       <match url="(^robots.txt$)" ignoreCase="false" negate="true" />
  8.       <conditions>
  9.       <add input="{HTTP_USER_AGENT}" pattern="SemrushBot|Webdup|AcoonBot|AhrefsBot|Ezooms|EdisterBot|EC2LinkFinder|jikespider|Purebot|MJ12bot|WangIDSpider|WBSearchBot|Wotbox|xbfMozilla|Yottaa|YandexBot|Jorgee|SWEBot|spbot|TurnitinBot-Agent|mail.RU|curl|perl|Python|Wget|Xenu|ZmEu" ignoreCase="true" />
  10.       </conditions>
  11.        <action type="AbortRequest"/>
  12.      </rule>
  13.     </rules>
  14.    </rewrite>
  15.   </system.webServer>
  16. </configuration>
复制代码

IIS6请在isapi重写组件中添加以下规则:

  1. #Block spider
  2. RewriteCond %{HTTP_USER_AGENT} (MegaIndex|MegaIndex.ru|BLEXBot|Qwantify|qwantify|semrush|Semrush|serpstatbot|hubspot|python|Bytespider|Go-http-client|Java|PhantomJS|SemrushBot|Scrapy|Webdup|AcoonBot|AhrefsBot|Ezooms|EdisterBot|EC2LinkFinder|jikespider|Purebot|MJ12bot|WangIDSpider|WBSearchBot|Wotbox|xbfMozilla|Yottaa|YandexBot|Jorgee|SWEBot|spbot|TurnitinBot-Agent|mail.RU|perl|Python|Wget|Xenu|ZmEu|^$) [NC]
  3. RewriteRule !(^/robots.txt$) - [F]
复制代码



还没有人打赏,支持一下
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则