Here are my quick findings: Memento (works):
$ curl -I https://swap.stanford.edu/20170628171729/https://www.waseda.jp/fcom/soc/ HTTP/1.1 200 OK TimeGate (works): $ curl -I https://swap.stanford.edu/https://www.waseda.jp/fcom/soc/ HTTP/1.1 302 Found Location: https://swap.stanford.edu/20170628171729/https://www.waseda.jp/fcom/soc/ TimeMap (fails): $ curl -I https://swap.stanford.edu/timemap/link/https://www.waseda.jp/fcom/soc/ HTTP/1.1 404 Not Found Calendar (fails): $ curl -I https://swap.stanford.edu/*/https://www.waseda.jp/fcom/soc/ HTTP/1.1 404 Not Found With this experiment in hand, I would think there might be some issue in how binary search is being performed in loaded CDX files when searching for a specific line vs. a range of lines. I would perhaps guess there might be some issue in how CDX files are sorted. It might be worth trying to sort them again with "LC_ALL=C" environment variable set and see if any of the CDX files are different from what they should be. Alternatively, I would manually locate the WARC file(s) with the help of the CDX file(s) that contain record of this URI. Then I would index those WARC files again and run a test instance of the replay system (with as much of the settings replicated from the production as possible) just on that subset to see how it behaves. Best, -- Sawood Alam Department of Computer Science Old Dominion University Norfolk VA 23529 On Thu, May 24, 2018 at 3:12 PM Nicholas Taylor <[email protected]> wrote: > Hello All, > > I've noticed that for some resources indexed in our OpenWayback instance, > individual mementos can be accessed, e.g., > https://swap.stanford.edu/20170628171729/https://www.waseda.jp/fcom/soc/ > but trying to access the timemap for all versions of the same memento gives > "Resource not in archive": > https://swap.stanford.edu/*/https://www.waseda.jp/fcom/soc/. > > Any suggestions on where to troubleshoot would be welcome. > > Thanks! > > ~Nicholas > > -- > You received this message because you are subscribed to the Google Groups > "openwayback-dev" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > Visit this group at https://groups.google.com/group/openwayback-dev. > To view this discussion on the web visit > https://groups.google.com/d/msgid/openwayback-dev/39c0633c-ed6e-43b9-96c6-b20a12202284%40googlegroups.com > <https://groups.google.com/d/msgid/openwayback-dev/39c0633c-ed6e-43b9-96c6-b20a12202284%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "openwayback-dev" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. Visit this group at https://groups.google.com/group/openwayback-dev. To view this discussion on the web visit https://groups.google.com/d/msgid/openwayback-dev/CALOnmf91NCwYCgSnd7u-irxP-eCwhVPOPLSL8YpZQnjFyU%2BPbA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
