Re: GSoC 2015 with Apache any23
Hi all, Here with I have attached a detailed analysis of diference between microfrmats 1 and 2. see this link [1]. As a sample I have implemented extractors HAdr and HGeo with relavant unit test cases. According to the Microformats 2 spec HAdr can have HGeo as a nested property. I have includes the sample test cases to cover this scenario. See my fork under gsoc_2015 branch [2] please see the recent commit added HAdr and HGeo Extractors support. I can basically cover the rest of the using this approach. I am expecting a feedback so that I can move forward. Also note I have added week 2 and week 3 reports to the wiki [1] https://docs.google.com/document/d/17DXcr8jv7D41FOuR1Rba0Ex-4VVheMNInmBSygkih5E/edit?usp=sharing [2] https://github.com/nisalanirmana/any23/commits/gsoc_2015 regards,nisala On Monday, June 15, 2015, Michele Mostarda michele.mosta...@gmail.com wrote: Great job Nisala! Best Michele On 13 June 2015 at 20:28, Lewis John Mcgibbney lewis.mcgibb...@gmail.com javascript:_e(%7B%7D,'cvml','lewis.mcgibb...@gmail.com'); wrote: Excellent Nisala. I committed your patch and your report looks good for now I am making some comments. Lewis On Mon, Jun 8, 2015 at 10:36 AM, Nisala Mendis nisal...@gmail.com javascript:_e(%7B%7D,'cvml','nisal...@gmail.com'); wrote: Hi Lewis, Please note that I have added week 1 report and added a comment attached patch to the jira[1]. [1]https://issues.apache.org/jira/browse/ANY23-185 Regards Nisala On Mon, Jun 1, 2015 at 12:45 AM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com javascript:_e(%7B%7D,'cvml','lewis.mcgibb...@gmail.com'); wrote: https://cwiki.apache.org/confluence/display/ANY23/Google+Summer+of+Code#GoogleSummerofCode-ProjectReports On Sun, May 31, 2015 at 12:09 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com javascript:_e(%7B%7D,'cvml','lewis.mcgibb...@gmail.com'); wrote: Hi Nisala, On Sun, May 31, 2015 at 11:10 AM, Nisala Mendis nisal...@gmail.com javascript:_e(%7B%7D,'cvml','nisal...@gmail.com'); wrote: Hi Lewis, I really apologize for not sending any updates via private mails, I had uploaded my proposal already even before the GSoC selection period, Once again I apologize for not informing you by mail this since I assumed you noticed it. Thanks for the update and no problems at all. It was just a bit concerning that around a month had gone by and we hadn't heard from you :) I started working things bit out waited to give you an update some concrete work. You can find my proposal in [1]. [1] https://cwiki.apache.org/confluence/display/ANY23/Google+Summer+of+Code I've added a section for your reports. Please begin to populate the page based upon anything you've been doing. Thank you for writing to us, it's very much appreciated. Lewis -- *Lewis* -- *Lewis* -- Michele Mostarda Senior Software Engineer skype: michele.mostarda phone: +39 327 01 86 259 twitter: micmos email: m...@michelemostarda.it javascript:_e(%7B%7D,'cvml','m...@michelemostarda.it'); website: http://michelemostarda.it
Re: GSoC 2015 with Apache any23
Hi Lewis, Please note that I have added week 1 report and added a comment attached patch to the jira[1]. [1]https://issues.apache.org/jira/browse/ANY23-185 Regards Nisala On Mon, Jun 1, 2015 at 12:45 AM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: https://cwiki.apache.org/confluence/display/ANY23/Google+Summer+of+Code#GoogleSummerofCode-ProjectReports On Sun, May 31, 2015 at 12:09 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, On Sun, May 31, 2015 at 11:10 AM, Nisala Mendis nisal...@gmail.com wrote: Hi Lewis, I really apologize for not sending any updates via private mails, I had uploaded my proposal already even before the GSoC selection period, Once again I apologize for not informing you by mail this since I assumed you noticed it. Thanks for the update and no problems at all. It was just a bit concerning that around a month had gone by and we hadn't heard from you :) I started working things bit out waited to give you an update some concrete work. You can find my proposal in [1]. [1] https://cwiki.apache.org/confluence/display/ANY23/Google+Summer+of+Code I've added a section for your reports. Please begin to populate the page based upon anything you've been doing. Thank you for writing to us, it's very much appreciated. Lewis -- *Lewis*
[jira] [Commented] (ANY23-185) Add missing meta element attributes to HTMLMetaExtractor
[ https://issues.apache.org/jira/browse/ANY23-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14569593#comment-14569593 ] Nisala Mendis commented on ANY23-185: - Hi Lewis John McGibbney, One thing I have notice in the spec[1] itemprop is stated as unimplemented, and I cant really find a reason behind this labeling. WDYT ? --- if the itemprop is set, a user-defined metadata, transparent for the user-agent as the semantics of the metadata is user-specific. Living Standard Unimplemented --- Also for the cases other cases name and http-quiv and charset ( attribute type), this labeling has not been used and more information is given for these case where as for the itemprop case no any other information or sample case is provided. ( see examples in the bottom of the the spec ) So my patch is written considering these 3 cases except itemprop. I am more than happy to include this case of itemprop if more information is available. :) [1]https://developer.mozilla.org/en-US/docs/Web/HTML/Element/meta Regards Nisala Add missing meta element attributes to HTMLMetaExtractor -- Key: ANY23-185 URL: https://issues.apache.org/jira/browse/ANY23-185 Project: Apache Any23 Issue Type: Improvement Components: core Affects Versions: 0.9.0 Reporter: Lewis John McGibbney Assignee: Nisala Mendis Fix For: 1.2 Attachments: ANY23-185_patch.patch Right now we only pick up 'name' and 'content' attributes within HTMLMetaExtractor. We should add missing attributes as per the Mozilla specification https://developer.mozilla.org/en-US/docs/Web/HTML/Element/meta -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Re: GSoC 2015 with Apache any23
Hi Lewis, I really apologize for not sending any updates via private mails, I had uploaded my proposal already even before the GSoC selection period, Once again I apologize for not informing you by mail this since I assumed you noticed it. I started working things bit out waited to give you an update some concrete work. You can find my proposal in [1]. [1]https://cwiki.apache.org/confluence/display/ANY23/Google+Summer+of+Code Regards Nisala On Sun, May 31, 2015 at 11:03 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, Thanks for getting in touch I was starting to get a bit worried as I hadn't heard from you and had emailed you a several times. I t is really encouraging to see that you are still working on Google Summer of Code as I thought that you had maybe changed your mind or that your plans had changed. It would have been nice to see you on the user@ and/or dev@ list during the community bonding period, however this time frame is now past and we have missed the opportunity for that one. I would like you to send me your wiki username for the Any23 wiki, we need you to put your proposal on there and begin writing up your intentions moving forward. This is a much better way for us to work things as oppose to strictly over email. Please keep working on Jira issues as you see fit it. Thanks Lewis [0] https://cwiki.apache.org/confluence/display/ANY23/Any23+Wiki+Home+Index On Sun, May 31, 2015 at 4:17 AM, Nisala Mendis nisal...@gmail.com wrote: Hi all, Just to give you an update about the work which I carried through out the past few weeks. I have been working with ANY23-185[1] issue related to HTMLMetaExtractor which is somewhat related to my project related extractors and already attached a patch to to JIRA with added more JUnit test cases. As I noticed still there exist a unstable build for any23 and I have disabled some test cases locally already related to core component to pass the test cases thus I am not exactly sure if a further test fails, but I have made changes to failed some of test cases of Any23Test because of the changes of HTMLMetaExtractor. I have already looked into the current extractors of microformats and details to the specification. But I kindly request from my mentors, could you please provide me some feedback regarding my proposal, so that I can make some improvements along the way of implementation. Further I want to mention that I will be working on the following fork for the time being. I will update my work more frequently now onwards. [1]https://issues.apache.org/jira/browse/ANY23-185 [2]https://github.com/nisalanirmana/any23 Regards Nisala On Wed, May 6, 2015 at 11:21 PM, Michele Mostarda michele.mosta...@gmail.com wrote: Hi Nisala, On 29 April 2015 at 22:35, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, On Wed, Apr 29, 2015 at 12:19 PM, dev-digest-h...@any23.apache.org wrote: I must really thankful for Apache Any23 community for accepting my proposal titled Microformats2 support for Apache any23 and also Chris and Lewis who volunteered to mentor my project for Google summer of code 2015. I am very excited and honored to be selected and I am looking forward to begin working with both of you as well as the Apache any23 community. I'm happy too to read your enthusiasm! I'm also really happy to find out that the project was accepted. I think that Michele Mostarda will be involved in your mentoring as well. Yes I confirm I will support as co-mentor together with Lewis. Please give us some time to synchronize on the mentoring side and we can begin to get working ;) Lewis, do you manage to have a chat let say tomorrow (07 of May)? In the meantime, we have an unstable build. During the next week, you can get to work by picking up some issues on the JIRA https://issues.apache.org/jira/browse/ANY23 Thanks Nisala Thanks Lewis Best Michele -- Michele Mostarda Senior Software Engineer skype: michele.mostarda twitter: micmos mail: m...@michelemostarda.it site: http://michelemostarda.it -- *Lewis*
[jira] [Commented] (ANY23-185) Add missing meta element attributes to HTMLMetaExtractor
[ https://issues.apache.org/jira/browse/ANY23-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14566507#comment-14566507 ] Nisala Mendis commented on ANY23-185: - Hi Lewis, I have attached patch for the following issue. Please review the patch. Regards Nisala Add missing meta element attributes to HTMLMetaExtractor -- Key: ANY23-185 URL: https://issues.apache.org/jira/browse/ANY23-185 Project: Apache Any23 Issue Type: Improvement Components: core Affects Versions: 0.9.0 Reporter: Lewis John McGibbney Fix For: 1.3 Attachments: ANY23-185_patch.patch Right now we only pick up 'name' and 'content' attributes within HTMLMetaExtractor. We should add missing attributes as per the Mozilla specification https://developer.mozilla.org/en-US/docs/Web/HTML/Element/meta -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Re: GSoC 2015 with Apache any23
Hi all, Just to give you an update about the work which I carried through out the past few weeks. I have been working with ANY23-185[1] issue related to HTMLMetaExtractor which is somewhat related to my project related extractors and already attached a patch to to JIRA with added more JUnit test cases. As I noticed still there exist a unstable build for any23 and I have disabled some test cases locally already related to core component to pass the test cases thus I am not exactly sure if a further test fails, but I have made changes to failed some of test cases of Any23Test because of the changes of HTMLMetaExtractor. I have already looked into the current extractors of microformats and details to the specification. But I kindly request from my mentors, could you please provide me some feedback regarding my proposal, so that I can make some improvements along the way of implementation. Further I want to mention that I will be working on the following fork for the time being. I will update my work more frequently now onwards. [1]https://issues.apache.org/jira/browse/ANY23-185 [2]https://github.com/nisalanirmana/any23 Regards Nisala On Wed, May 6, 2015 at 11:21 PM, Michele Mostarda michele.mosta...@gmail.com wrote: Hi Nisala, On 29 April 2015 at 22:35, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, On Wed, Apr 29, 2015 at 12:19 PM, dev-digest-h...@any23.apache.org wrote: I must really thankful for Apache Any23 community for accepting my proposal titled Microformats2 support for Apache any23 and also Chris and Lewis who volunteered to mentor my project for Google summer of code 2015. I am very excited and honored to be selected and I am looking forward to begin working with both of you as well as the Apache any23 community. I'm happy too to read your enthusiasm! I'm also really happy to find out that the project was accepted. I think that Michele Mostarda will be involved in your mentoring as well. Yes I confirm I will support as co-mentor together with Lewis. Please give us some time to synchronize on the mentoring side and we can begin to get working ;) Lewis, do you manage to have a chat let say tomorrow (07 of May)? In the meantime, we have an unstable build. During the next week, you can get to work by picking up some issues on the JIRA https://issues.apache.org/jira/browse/ANY23 Thanks Nisala Thanks Lewis Best Michele -- Michele Mostarda Senior Software Engineer skype: michele.mostarda twitter: micmos mail: m...@michelemostarda.it site: http://michelemostarda.it
[jira] [Updated] (ANY23-185) Add missing meta element attributes to HTMLMetaExtractor
[ https://issues.apache.org/jira/browse/ANY23-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nisala Mendis updated ANY23-185: Attachment: ANY23-185_patch.patch attached patch for the issue Add missing meta element attributes to HTMLMetaExtractor -- Key: ANY23-185 URL: https://issues.apache.org/jira/browse/ANY23-185 Project: Apache Any23 Issue Type: Improvement Components: core Affects Versions: 0.9.0 Reporter: Lewis John McGibbney Fix For: 1.3 Attachments: ANY23-185_patch.patch Right now we only pick up 'name' and 'content' attributes within HTMLMetaExtractor. We should add missing attributes as per the Mozilla specification https://developer.mozilla.org/en-US/docs/Web/HTML/Element/meta -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Re: GSoC 2015 with Apache any23
Hi all, I must really thankful for Apache Any23 community for accepting my proposal titled Microformats2 support for Apache any23 and also Chris and Lewis who volunteered to mentor my project for Google summer of code 2015. I am very excited and honored to be selected and I am looking forward to begin working with both of you as well as the Apache any23 community. Regards Nisala On Fri, Mar 27, 2015 at 9:01 PM, Nisala Mendis nisal...@gmail.com wrote: Hi all, I have updated the any23 wiki and posted my proposal in Google melange. [1] http://www.google-melange.com/gsoc/proposal/public/google/gsoc2015/nisala12/5629499534213120 regards Nisala On Thu, Mar 26, 2015 at 7:13 PM, Nisala Mendis nisal...@gmail.com wrote: Hi Lewis, Thanks for adding me to the wiki. Can you please give some comments to my previous mail on this thread regarding the microformat parser? Regards Nisala On Thu, Mar 26, 2015 at 7:07 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, On Thu, Mar 26, 2015 at 12:23 AM, dev-digest-h...@any23.apache.org wrote: Hi all, The test failures I came across have been reported recently with https://issues.apache.org/jira/browse/ANY23-256. Can I have access to the ANY23 wiki, my user name is: nisala12. Regards Nisala DONE. Apologies for the delay. Best. Lewis
Re: GSoC 2015 with Apache any23
Hi all, I have updated the any23 wiki and posted my proposal in Google melange. [1] http://www.google-melange.com/gsoc/proposal/public/google/gsoc2015/nisala12/5629499534213120 regards Nisala On Thu, Mar 26, 2015 at 7:13 PM, Nisala Mendis nisal...@gmail.com wrote: Hi Lewis, Thanks for adding me to the wiki. Can you please give some comments to my previous mail on this thread regarding the microformat parser? Regards Nisala On Thu, Mar 26, 2015 at 7:07 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, On Thu, Mar 26, 2015 at 12:23 AM, dev-digest-h...@any23.apache.org wrote: Hi all, The test failures I came across have been reported recently with https://issues.apache.org/jira/browse/ANY23-256. Can I have access to the ANY23 wiki, my user name is: nisala12. Regards Nisala DONE. Apologies for the delay. Best. Lewis
Re: GSoC 2015 with Apache any23
Hi Lewis, Thanks for adding me to the wiki. Can you please give some comments to my previous mail on this thread regarding the microformat parser? Regards Nisala On Thu, Mar 26, 2015 at 7:07 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Nisala, On Thu, Mar 26, 2015 at 12:23 AM, dev-digest-h...@any23.apache.org wrote: Hi all, The test failures I came across have been reported recently with https://issues.apache.org/jira/browse/ANY23-256. Can I have access to the ANY23 wiki, my user name is: nisala12. Regards Nisala DONE. Apologies for the delay. Best. Lewis
Re: GSoC 2015 with Apache any23
Hi all, The test failures I came across have been reported recently with https://issues.apache.org/jira/browse/ANY23-256. Can I have access to the ANY23 wiki, my user name is: nisala12. Regards Nisala On Thu, Mar 19, 2015 at 9:52 PM, Nisala Mendis nisal...@gmail.com wrote: Hi all, I missed this response completely, sorry for the rate reply, I bit researched on microformats 2 as Any23 currently support only microformats parser support. This is area something I would like to work on. There are couple of questions I need clarify, if we want to retain original microformat support, then we should implement microfromat2 parsers from the scratch as there no native java libraries available for this. Actually there are good implementations available for .net and python. Is there any way we could reuse this libraries. Or can this GSoC be involved porting such library to java and integration to the any23 ? Regards Nisala On Wed, Mar 11, 2015 at 6:49 AM, Peter Ansell ansell.pe...@gmail.com wrote: Hi Nisala, As Lewis mentions, the most important thing when reporting issues with open source code is to include a small stack trace or error message in the initial report. If we need more we will request it, but we need some indication of the specific issue before we can help. Lewis, can I have edit access to the ANY23 wiki. My username is: ansell Thanks, Peter On 10 March 2015 at 05:23, Nisala Mendis nisal...@gmail.com wrote: Hi all, I checked out the code from git and was able to successfully build it. But there were some core component test failures as well as plugin component integration-test failures. But was able to remove some test dependencies from pom files and was able to build without tests. I am trying to be familiarize with the product while using parsers and extractors. The idea of issue [Any23-249] is so broad It will need some pointers of guidance in order grasp the issue and heading in the correct drection. Anyway I have good knowledge some the data formats any23 currently support etc JSON, XML etc. I will try to review these parsers and extractors over those data by going through samples over them using tool whether these are upto date to the standards. I will come with my findings when I got through those back in this mailing list. Thanks Nisala
GSoC 2015 with Apache any23
Hi all, I checked out the code from git and was able to successfully build it. But there were some core component test failures as well as plugin component integration-test failures. But was able to remove some test dependencies from pom files and was able to build without tests. I am trying to be familiarize with the product while using parsers and extractors. The idea of issue [Any23-249] is so broad It will need some pointers of guidance in order grasp the issue and heading in the correct drection. Anyway I have good knowledge some the data formats any23 currently support etc JSON, XML etc. I will try to review these parsers and extractors over those data by going through samples over them using tool whether these are upto date to the standards. I will come with my findings when I got through those back in this mailing list. Thanks Nisala