Hi Mohammed,

That hasn't been determined. Suggestions are welcome!

Best,
Kris

> -----Original Message-----
> From: [email protected] [mailto:openwayback-
> [email protected]] On Behalf Of Mohamed Elsayed
> Sent: 6. september 2016 11:41
> To: openwayback-dev
> Subject: [openwayback-dev] Re: Summary of OpenWayback call 17/08/2016
>
> Where are we going to share WARC dataset for testing OWB 3?
>
> Thank you.
>
> On Friday, August 19, 2016 at 12:06:30 PM UTC+2, Kristinn Sigurðsson wrote:
>
>       Dear all,
>
>       An OWB call was held on 17/08/16 @ 15:00 UTC. No agenda was sent
> out.
>
>       The following topics were discussed.
>
>
>       John Erik expects the new CDX server tools to be ready for testing very
> soon. This includes tools to generate the new CDXJ files as well as the CDX
> server itself. The CDX server will not be feature complete but should support
> the main use cases.
>
>
>       John Erik raised the question whether the CDX server should be packed
> as a servlet (WAR) that is deployed into a web server (e.g. Tomcat) or if it
> should be published as a stand-alone utility (effectively embedding the web
> server). Doing so may reduce the complexity of setup and allow us to choose
> the most appropriate server. Currently John Erik is considering Grizzly for 
> this
> (https://grizzly.java.net/dependencies.html
> <https://grizzly.java.net/dependencies.html>  ). Comments on this are most
> welcome!
>
>
>       With a major new piece needing testing Sawood raised the idea of a
> standard WARC dataset for testing. This has been discussed before and usually
> is well received in principle but (so far) no one has volunteered to put 
> together
> a suitable dataset. We'd very much welcome such volunteers!
>
>
>       There was some discussion about the practical differences between the
> Memento API and the CDX server API.
>
>
>       Mohammed raised a question about input sanitization on URLs
> searched for in OWB. The general consensus was that the search JSP pages
> might benefit from preventing some obvious data entry errors (repeated
> protocol for example) but that any API level interfaces should leave this to 
> the
> caller.
>
>
>       Sawood advocated that issue
> https://github.com/iipc/openwayback/issues/285
> <https://github.com/iipc/openwayback/issues/285>  be considered for the CDX
> server. I.e. that the cdx server advertise its version number in HTTP response
> headers.
>
>
>       There was a brief discussion on URI canonicalization. Existing
> canonicalizers can be over aggressive (e.g. down casing the entire URL). OWB 3
> will include a new canonicalizer.
>
>
>       The next OWB call will be on September 7 @ 15:00 UTC.
>
>
>       Best,
>       Kristinn
>       
> -------------------------------------------------------------------------
>       Landsbókasafn Íslands - Háskólabókasafn | Arngrímsgötu 3 - 107
> Reykjavík
>       Sími/Tel: +354 5255600 | www.landsbokasafn.is
>       
> -------------------------------------------------------------------------
>       fyrirvari/disclaimer - http://fyrirvari.landsbokasafn.is
> <http://fyrirvari.landsbokasafn.is>
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "openwayback-dev" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"openwayback-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to