Here are my quick findings:

Memento (works):

$ curl -I
https://swap.stanford.edu/20170628171729/https://www.waseda.jp/fcom/soc/
HTTP/1.1 200 OK


TimeGate (works):

$ curl -I https://swap.stanford.edu/https://www.waseda.jp/fcom/soc/
HTTP/1.1 302 Found
Location:
https://swap.stanford.edu/20170628171729/https://www.waseda.jp/fcom/soc/


TimeMap (fails):

$ curl -I
https://swap.stanford.edu/timemap/link/https://www.waseda.jp/fcom/soc/
HTTP/1.1 404 Not Found


Calendar (fails):

$ curl -I https://swap.stanford.edu/*/https://www.waseda.jp/fcom/soc/
HTTP/1.1 404 Not Found


With this experiment in hand, I would think there might be some issue in
how binary search is being performed in loaded CDX files when searching for
a specific line vs. a range of lines. I would perhaps guess there might be
some issue in how CDX files are sorted. It might be worth trying to sort
them again with "LC_ALL=C" environment variable set and see if any of the
CDX files are different from what they should be. Alternatively, I would
manually locate the WARC file(s) with the help of the CDX file(s) that
contain record of this URI. Then I would index those WARC files again and
run a test instance of the replay system (with as much of the settings
replicated from the production as possible) just on that subset to see how
it behaves.

Best,

--
Sawood Alam
Department of Computer Science
Old Dominion University
Norfolk VA 23529



On Thu, May 24, 2018 at 3:12 PM Nicholas Taylor <[email protected]> wrote:

> Hello All,
>
> I've noticed that for some resources indexed in our OpenWayback instance,
> individual mementos can be accessed, e.g.,
> https://swap.stanford.edu/20170628171729/https://www.waseda.jp/fcom/soc/
> but trying to access the timemap for all versions of the same memento gives
> "Resource not in archive":
> https://swap.stanford.edu/*/https://www.waseda.jp/fcom/soc/.
>
> Any suggestions on where to troubleshoot would be welcome.
>
> Thanks!
>
> ~Nicholas
>
> --
> You received this message because you are subscribed to the Google Groups
> "openwayback-dev" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> Visit this group at https://groups.google.com/group/openwayback-dev.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/openwayback-dev/39c0633c-ed6e-43b9-96c6-b20a12202284%40googlegroups.com
> <https://groups.google.com/d/msgid/openwayback-dev/39c0633c-ed6e-43b9-96c6-b20a12202284%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"openwayback-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
Visit this group at https://groups.google.com/group/openwayback-dev.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/openwayback-dev/CALOnmf91NCwYCgSnd7u-irxP-eCwhVPOPLSL8YpZQnjFyU%2BPbA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to