VforVendetta
V2EX  ›  站长

cloudflared 自动给网站生成 robots.txt

  •  
  •   VforVendetta · Jul 3, 2025 · 1958 views
    This topic created in 326 days ago, the information mentioned may be changed or developed.

    默认禁止一些 llm 爬虫。

    # site through automated means, including any device, tool,
    # or process designed to data mine or scrape content, is
    # prohibited except (1) for the purpose of search engine indexing or
    # artificial intelligence retrieval augmented generation or (2) with express
    # written permission from this site’s operator.
    
    # To request permission to license our intellectual
    # property and/or other materials, please contact this
    # site’s operator directly.
    
    # BEGIN Cloudflare Managed content
    
    User-agent: Amazonbot
    Disallow: /
    
    User-agent: Applebot-Extended
    Disallow: /
    
    User-agent: Bytespider
    Disallow: /
    
    User-agent: CCBot
    Disallow: /
    
    User-agent: ClaudeBot
    Disallow: /
    
    User-agent: Google-Extended
    Disallow: /
    
    User-agent: GPTBot
    Disallow: /
    
    User-agent: meta-externalagent
    Disallow: /
    
    # END Cloudflare Managed Content
    1 replies    2025-07-03 17:51:59 +08:00
    laobaiguolai
        1
    laobaiguolai  
       Jul 3, 2025
    你去 cloudflare 的统计里看看,这些爬虫爬得非常多。。禁了是好事。
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   3758 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 62ms · UTC 10:31 · PVG 18:31 · LAX 03:31 · JFK 06:31
    ♥ Do have faith in what you're doing.