[Wikitech-l] Call for participation in OpenSym 2015, Aug 19-20, San Francisco!

2015-07-04 Thread Dirk Riehle
Call for participation in OpenSym 2015! Aug 19-20, 2015, San Francisco, http://opensym.org FOUR FANTASTIC KEYNOTES Richard Gabriel (IBM) on Using Machines to Manage Public Sentiment on Social Media Peter Norvig (GOOGLE) on Applying Machine Learning to Programs Robert Glushko (UC

[Wikitech-l] WikiSym + OpenSym 2013: Less than 2 weeks for Community Track Submissions

2013-05-07 Thread Dirk Riehle
for regular submissions: May 31, 2013 * Camera-ready for both rounds: June 9, 2013 As long as it is May 17 somewhere on earth, your submission will be accepted. COMMUNITY TRACK PROGRAM COMMITTEE Chairs Regis Barondeau (Université du Québec à Montréal) Dirk Riehle (Friedrich-Alexander

Re: [Wikitech-l] programmatically extracting lists from list pages on Wikipedia

2011-11-22 Thread Dirk Riehle
Try the Sweble parser for extracting structured data from Wikitext http://sweble.org http://dirkriehle.com, +49 157 8153 4150, +1 650 450 8550 On Nov 22, 2011 9:35 PM, Fred Zimmerman zimzaz@gmail.com wrote: hi, I want to programmatically extract lists from list pages on Wikipedia. That

Re: [Wikitech-l] Announcing Wikihadoop: using Hadoop to analyze Wikipedia dump files

2011-09-14 Thread Dirk Riehle
Hello everyone! Wikihadoop sounds like a great project! I wanted to point out that you can make it even more powerful for many research applications by combining it with the Sweble Wikitext parser. Doing so, you could enable Wikipedia dump processing not only on the rough XML dump level, but

Re: [Wikitech-l] WYSIWYG and parser plans (was What is wrong with Wikia's WYSIWYG?)

2011-05-03 Thread Dirk Riehle
On 05/03/2011 08:28 PM, Neil Harris wrote: On 03/05/11 19:44, MZMcBride wrote: ... The point is that the wikitext and its parsing should be completely separate from MediaWiki/PHP/HipHop/Zend. I think some of the bigger picture is getting lost here. Wikimedia produces XML dumps that contain

Re: [Wikitech-l] Announcing the Open Source Sweble Wikitext Parser v1.0

2011-05-01 Thread Dirk Riehle
You should identify whether you mean MediaWikitext, or some other dialect -- MediaWiki Is Not The Only Wiki... and you should post to wikitext-l as well. The real parser maniacs hang out over there, even though traffic is low. It is MediaWiki's Wikitext; elsewhere it is usually called wiki

Re: [Wikitech-l] Announcing the Open Source Sweble Wikitext Parser v1.0

2011-05-01 Thread Dirk Riehle
You should identify whether you mean MediaWikitext, or some other dialect -- MediaWiki Is Not The Only Wiki... and you should post to wikitext-l as well. The real parser maniacs hang out over there, even though traffic is low. It is MediaWiki's Wikitext; elsewhere it is usually called wiki