Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
GraceDB Server
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Iterations
Requirements
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Test cases
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Insights
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
IGWN Computing and Software
GraceDB
GraceDB Server
Commits
50a1561f
Commit
50a1561f
authored
3 years ago
by
Alexander Pace
Browse files
Options
Downloads
Patches
Plain Diff
clean up and robots changes
parent
37e08957
No related branches found
Branches containing commit
No related tags found
Tags containing commit
No related merge requests found
Pipeline
#350117
passed
3 years ago
Stage: test
Stage: branch
Changes
2
Pipelines
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
config/gunicorn_config.py
+0
-2
0 additions, 2 deletions
config/gunicorn_config.py
gracedb/static/robots.txt
+129
-0
129 additions, 0 deletions
gracedb/static/robots.txt
with
129 additions
and
2 deletions
config/gunicorn_config.py
+
0
−
2
View file @
50a1561f
...
...
@@ -35,8 +35,6 @@ timeout = 300
#max_requests = 0
#max_requests_jitter = 0
# Enable connection keepalive:
#keepalive = 60
# Logging ---------------------------------------------------------------------
# Access log
...
...
This diff is collapsed.
Click to expand it.
gracedb/static/robots.txt
+
129
−
0
View file @
50a1561f
...
...
@@ -10,6 +10,7 @@
# visible too.
User-agent: *
Crawl-delay: 1024
Disallow: /api/
Disallow: /apiweb/
Disallow: /superevents/
...
...
@@ -21,3 +22,131 @@ Disallow: /alerts/
Disallow: /Shibboleth.sso/
Disallow: /shibboleth-sp/
Disallow: /login/
#Turn off bad bots.
User-agent: Slurp
Disallow: /
#Block bots
User-agent: A6-Indexer
Disallow: /
User-agent: AlphaSeoBot
Disallow: /
User-agent: AlphaSeoBot-SA
Disallow: /
# RDH, 08.19.19: I really don't want to block Applebot, but for now, I am. It is crawling us too much
User-agent: Applebot
Disallow: /
User-agent: AspiegelBot
Disallow: /
User-agent: barkrowler
Disallow: /
# RDH, 05.13.20: I really don't want to block bing, but for now, I am. It is also already in htaccess rules
User-agent: bingbot/2.0
Disallow: /
User-agent: Blackboard Safeassign
Disallow: /
User-agent: BLEXBot
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: crawler4j
Disallow: /
User-agent: DotBot
Disallow: /
# RDH, 06.30.21: Very temporary to get some relief.
User-Agent: Googlebot
Disallow: /
User-agent: Gigabot
Disallow: /
User-agent: LieBaoFast
Disallow: /
User-agent: MauiBot
Disallow:/
User-agent: MauiBot (crawler.feedback+wc@gmail.com)
Disallow: /
User-agent: MegaIndex.ru/2.0
Disallow: /
User-agent: MqqBrowser
Disallow:/
User-agent: Nimbostratus-Bot/v1.3.2
Disallow: /
User-agent: Qwant-news
Disallow: /
User-agent: Qwantify
Disallow: /
User-agent: Seekport Crawler
Disallow: /
User-agent: SemrushBot
Disallow: /
User-agent: SemrushBot-SA
Disallow: /
User-agent: SeznamBot
Disallow: /
User-agent: SputnikBot/2.3
Disallow:/
User-agent: The Knowledge AI
Disallow:/
User-agent: TinyTestBot
Disallow: /
User-agent: TurnitinBot
Disallow: /
User-agent: UCBrowser
Disallow: /
User-agent: yacybot
Disallow: /
User-agent: YandexBot
Disallow: /
User-agent: Yeti
Disallow: /
User-agent: YisouSpider
Disallow: /
User-agent: Facebot Twitterbot
Disallow: /
User-agent: FacebookExternalHit
Disallow: /
User-agent: Mattermost-Bot
Disallow: /
User-agent: AhrefsBot
Disallow: /
User-agent: ZumBot
Disallow: /
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment