magibney commented on PR #846:
URL: https://github.com/apache/solr/pull/846#issuecomment-1125007411

   I realize this PR is merged (and thanks!), but I have a couple of questions 
that follow logically on the conversation here, so:
   1. Noticing that we don't have (and haven't historically had, I think) an 
old-style `robots.txt`, I wonder: is old-style `robots.txt` completely 
obviated? i.e., should we not bother having one?
   2. Sitemap.xml is being generated I think, and is [present in 
nightlies](https://nightlies.apache.org/solr/draft-guides/solr-reference-guide-nightly/sitemap.xml).
 But it doesn't appear to be accessible on the main site. I think it was 
previously present. Is there still a purpose served by sitemap.xml? Currently 
the nightlies version looks like it points to all (antora) versions -- I'm not 
sure whether we'd want to pare down the referenced pages to make sitemap.xml a 
proper complement to "canonical, no-index/no-follow/no-archive" approach taken 
by this PR?
   3. If sitemap.xml is still relevant and we want to make it accessible, I 
think the sitemap spec calls for sitemap.xml to be referenced from an old-style 
robots.txt ... I'm not aware of other/newer approaches to referencing 
sitemap.xml.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to