[ https://issues.apache.org/jira/browse/SOLR-2787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13109308#comment-13109308 ]
Mark Dickensob commented on SOLR-2787: -------------------------------------- Also bad IP addresses # Harvester Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)- Russia deny from 31.184.238. # Discobot deny from 38.101.148.126 # Harvester Washington, United States deny from 38.127.197.104 # Harvester Ukraine deny from 46.211.205.71 # Harvester Seattle, United States deny from 50.17.81.237 # Harvester Xiamen, China deny from 58.23.252.136 # Harvester Great Britain deny from 62.128.150.15 # Hacker New York, United States deny from 66.114.72.9 # Google!!! # deny from 66.249.71 # Harvester Massapequa, United States deny from 68.194.246.194 # Harvester Lake Orion, United States deny from 71.238.32.52 # Harvester San Marcos, United States deny from 72.199.108.105 # Hacker Russia deny from 77.221.130.4 # Harvester Germany deny from 79.143.182.232 # Harvester Germany deny from 79.143.182.232 # Sheffield, Great Britain deny from 81.105.137.203 # Harvester Israel deny from 82.166.235. # Hacker Höst, Germany deny from 83.169.6.156] # Harvester Netherlands deny from 85.17.147.193 # Harvester Netherlands deny from 85.201.16.158 # Harvester France deny from 87.98.187.40 # Harvester Spain deny from 87.98.228.22 # Hacker Bulgaria deny from 87.120.106.5 # Harvester Zdar Nad Sazavou, Czech Republic deny from 90.180.139.29 # Harvester London, Great Britain deny from 90.194.19. # Harvester London, Great Britain deny from 90.214.146.214 # Hacker Russian Federation deny from 91.195.124.8 # Harvester Netherlands deny from 93.190.136.5 # Harvester Italy deny from 94.23.65.72 # Hacker Bulgaria deny from 94.26.53.6 # Harvester Valencia, Spain deny from 95.19.216.61 # Harvester Germany deny from 95.169.160. # Amsterdam, Netherlands deny from 95.211.73.195 deny from trygoclio.com # Hacker El Segundo, United States deny from 96.46.227.5 # Harvester United States deny from 98.174.196.217 # Harvester United States deny from 108.27.42.190 # Fake Googlebot - Russia deny from 109.86.225.205 # Harvester Tel Aviv, Israel deny from 109.64.34.186 # Harvester Great Britain deny from 109.104.92.118 # Harvester China deny from 111.162.201.111 # Harvester China deny from 113.104.242.61 # Hacker Chinanet deny from 122.225.0.170 # Hacker Chinanet deny from 124.115.1. # Hacker Englewood, United States deny from 130.94.69.217 # Harvester Scranton, United States deny from 173.212.244.106 # Spectrum Adaptive Spider deny from 174.127.132 # Harvester China deny from 175.44.8.36 # Harvester Netherlands deny from 178.239.58.144 # Harvester São Paulo, Brazil deny from 201.95.81.134 # Atlanta, United States deny from 205.251.153.164 # Hacker USA deny from 208.79.212.174 # Ezooms deny from 208.115.111.67 # Harvester USA deny from 209.18.124.32 # Harvester Columbus, United States deny from 209.190.28.178 # Sitebot deny from 212.113.35.162 # Harvester United States, Kill subdomain deny from 212.124.113 # Hacker Great Britain deny from 213.40.79.217 # Harvester Spain deny from 213.149.247.102 # Beijing Harvester deny from 222.187.199.37 > add external http: include file reference for .htaccess processing > ------------------------------------------------------------------ > > Key: SOLR-2787 > URL: https://issues.apache.org/jira/browse/SOLR-2787 > Project: Solr > Issue Type: Improvement > Components: update > Affects Versions: 3.4 > Environment: All operating systems > Reporter: Mark Dickensob > Labels: Spam, killer > Original Estimate: 504h > Remaining Estimate: 504h > > Include an external link directive to an external http: file that supplies a > (.htaccess compatible) list of known bad bot sites. > ie common resource for spam kill list site(s) > Personally, I run a portal and I think that this feature is important to kill > spam! > I will supply the files for testing if you need them. > Mark goan.com -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org