Re: [Xen-devel] Updates on the project

2017-04-17 Thread Gayathri Menakath
Hello Jesus,

I would like to thank you for the comments. I will look into the part where
it uploads the data to the Elasticsearch index and the jwzthreading.py. I
believe that I had mentioned in one of the IRC chats that I would be
reusing the jwzthreading.py. I am sorry if I hadn't mentioned it. However,
should I be making any changes to it?

On Mon, Apr 17, 2017 at 4:48 AM, Jesus M. Gonzalez-Barahona <
j...@bitergia.com> wrote:

> On Sat, 2017-04-15 at 20:08 +0530, Gayathri Menakath wrote:
> > Hello Jesus,
> >
> > As my periodical exams were going on I could not spend much time on
> > writing the tests (2nd microtask). I will resume the work soon and
> > will send the updates. Along with my proposal, I have uploaded an
> > official letter from my university which states that I would not be
> > having any academic commitments for at least 8 weeks during the
> > coding period. I hope with that I would be able to meet the
> > eligibility criteria for Outreachy. I had sent a copy of the letter
> > to the Outreachy coordinators and Lars too.
>
> Thanks a lot for the update.
>
> > Meanwhile, may I know if you had reviewed the first microtask's code?
>
> Yes. I did. Some of comments:
>
> * I've tested it with some mboxes, and seem to work pretty well. A bit
> weird that you have to produce a JSON file, and then upload it to ES,
> instead of just uploading it to ES directly. But otherwise, it seems to
> work with the tests I did.
>
> * However, you had hardwired a path in jwzthreading.py, with (I
> presume) the directory where you store the mboxes. After changing it to
> mind, worked like a charm.
>
> * BTW, I don't remember that you commented that you were using
> jwzthreading.py. That's not bad (reusing code which works is always a
> good option to consider), but makes the exercise different, since the
> implementation of the threading algorithm is in it.
>
> * The readme.md explains well how to run the scripts.
>
> Saludos,
>
> Jesus.
>
> > --
> > Yours Sincerely,
> > Gayathri.P.Menakath
> > B-Tech 3rd year,
> > Amrita University
> > blog | Github
> --
> Bitergia: http://bitergia.com
> /me at Twitter: https://twitter.com/jgbarah
>
>


-- 
Yours Sincerely,
Gayathri.P.Menakath
B-Tech 3rd year,
Amrita University <http://amrita.edu>
blog <https://gayathrimenakath96.wordpress.com/> | Github
<https://github.com/Gayathrimenakath>
___
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel


Re: [Xen-devel] [GSoC] Progress in the microtask

2017-03-26 Thread Gayathri Menakath
Hello Jesus and Lars,

I have completed the micro task and now I am working on writing tests fro
my code. The micro task was to write a Python script to use the Perceval
email backend to retrieve data from the xen-devel mailing list, identify
the same threads and feed the result to an ElasticSearch database. In order
to identify the same threads, jwz threading algorithm was implemented on
the data retrieved from Xen-devel mailing list’s archive.

The result of the script is an ElasticSearch search index, with one JSON
document per message, with a value which is same for all the messages in
the same thread. The value could be the message-id of the first message in
the thread. Each month’s archive will have a separate index.

A script to perform a search has also been written. The data is retrieved
from the Elasticsearch index and a search will be performed on it according
to the specified field of the message. The task has been completed and has
helped me to gain knowledge about Perceval and Elasticsearch. The code has
been uploaded to my Github account [1].


On Sun, Mar 26, 2017 at 12:07 PM, Gayathri Menakath <
gayathrimenak...@gmail.com> wrote:

> Hello Jesus and Lars,
>
> I have uploaded the data to the Elasticsearch index. As a part of testing
> it, I have uploaded only some of the fields in the JSON file (Sender, From
> and Message ID). I have also written a python script to perform a search on
> the data in Elasticsearch index. The code has been uploaded to my Github
> account [1]. Please review it.
>
> If you are available this week, could we have an IRC meeting so that I
> will get to know whether there are any changes to be made to the current
> code and what next is to be done? Looking forward to your response.
>
> [1]: https://github.com/Gayathrimenakath/Xen-project
>
>
> On Thu, Mar 23, 2017 at 4:04 AM, Jesus M. Gonzalez-Barahona <
> j...@bitergia.com> wrote:
>
>> On Wed, 2017-03-22 at 23:24 +0530, Gayathri Menakath wrote:
>> > Hello Jesus and Lars,
>> >
>> > As I had mentioned earlier, I was facing and error in the threading
>> > algorithm. I have fixed it and now I am able to get the threaded
>> > output. I have uploaded the code to my github account[1]. Soon I will
>> > upload the data to the Elasticsearch index.
>>
>> Thanks a lot, I will review it asap.
>>
>> > Due to Outreachy's strict time constraints, I came to know that I
>> > will not be able to apply for this round. However I am eligible to
>> > apply for GSoC and will be applying for the same.
>> >
>> > [1]: https://github.com/Gayathrimenakath/Xen-project
>>
>> OK. Thanks for reporting.
>>
>> Jesus.
>>
>> --
>> Bitergia: http://bitergia.com
>> /me at Twitter: https://twitter.com/jgbarah
>>
>>
>
>
> --
> Yours Sincerely,
> Gayathri.P.Menakath
> B-Tech 3rd year,
> Amrita University <http://amrita.edu>
> blog <https://gayathrimenakath96.wordpress.com/> | Github
> <https://github.com/Gayathrimenakath>
>



-- 
Yours Sincerely,
Gayathri.P.Menakath
B-Tech 3rd year,
Amrita University <http://amrita.edu>
blog <https://gayathrimenakath96.wordpress.com/> | Github
<https://github.com/Gayathrimenakath>
___
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel