Hello, I am pleased to let you know that fourth notebook is ready for review at [0]. The notebook uses both WARC and WET format of data. Warcbase library has been extensively used. The notebook contains seven sections ending with a search engine built using Apache Lucene.
Documentation and blog for the notebook is ready at [1]. For ease of viewing of the notebook, a sample demo run of the notebook and demonstration of all features such as finding context of locations or running the search engine have been posted in a video at [2]. Meanwhile, I will start work on the fifth notebook on Stanford datasets. Thanks Alex for all your help. [0]. https://github.com/anish18sun/Zeppelin-Notebooks/tree/master/2BSD5NUW1 [1]. http://zeppelinnotes.blogspot.in/ [2]. https://drive.google.com/open?id=0ByXTtaL2yHBuU2tSTU1WeW1IRnc Thanks, Anish
