CfP: Entity Extraction and Linking Challenge (#Microposts2014 @ WWW2014)
=== Entity Extraction and Linking Challenge at the 4th Making Sense of Microposts Workshop (#Microposts2014) @ WWW 2014 http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html 7 April 2014, Seoul, Republic of Korea === Microposts are a highly popular medium to share facts, opinions or emotions. They are an invaluable wealth of data, ready to be mined for training predictive modelings. This year the #Microposts 2014 Workshop will host an Entity Extraction and Linking Challenge. The overall task of the challenge is to automatically extract entities from English microposts, and link them to the corresponding English DBpedia v3.9 resources (if the linkage exists). As linking stage we aim to to disambiguate expressions that are formed by discrete (and typically short) sequences of words. Existing entity linking tools are intended for use over news corpora and similar document-based corpora with relatively long length. We organise this challenge to foster research into novel, more accurate solutions for the automatic entity linking in (much shorter) micropost data. We will ask the participants to automatically extract entities (e.g., Obama, London, Rakuten) belonging to all entity types (e.g., Person, Location, Organisation) from a collection of microposts. Participants will have to automatically provide context-relevant DBpedia resources for each entity in a micropost. DATASET --- The dataset comprises of 3.5K tweets extracted from a much larger collection of over 18 million tweets. This collection, provided by the Redites project (http://demeter.inf.ed.ac.uk/redites/), covers event-annotated tweets collected for the period of 15th July 2011 to 15th August 2011 (31 days). It extends over multiple noteworthy events including the death of Amy Winhehouse, the London Riots and the Oslo bombing. Since the task of this challenge is to automatically extract and link entities, we have built our dataset considering both event and non-event tweets. While event tweets are more likely to contain entities, non-event tweets enable us to evaluate the performance of the system in avoiding false positives in the entity extraction phase. The dataset has been split into a training (70%) and testing (30%) sets. Following the Twitter TOS we will only provide tweet IDs and annotations for the training set; and tweet IDs for the test set. We will also provide a common framework to mine these datasets from Twitter. The training set will be released as tsv file where each line consists of : - tweet_id - entity_mention_1 - entity_uri_1 … - entity_mention_n - entity_uri_n Tokens are separated by TABs. Entity mentions and uris are listed according to their appearance order in the tweet. We will timely advertise the release of the data sets on the workshop mailing list. Please subscribe to https://groups.google.com/d/forum/microposts2014. More information about dates are available in the Challenge website. EVALUATION -- The evaluation consists of two separated stages: 1.- Paper peer review : A community of experts of the domain will judge the quality and applicability of the approaches taken, to provide useful insights on your research; 2.- Precision and Recall: F1 (F-measure with beta = 1) will be computed on a gold standard manually created from the test set. The automatically extracted entities and links will be both matched against this ground truth. All submissions will be only ranked according to the F1 of each best submission. SUBMISSIONS --- Submissions should be provided as a zip file using your system name as the file name (e.g. 'awesome.zip'), containing: 1. a TSV file with your system name (e.g. 'awesome.tsv'). We accept up to 3 different submissions, and we will consider *only* the best. If you do so you must specify clearly in your paper the modifications applied to each labelled submission. In this case the submission should contain each of up to 3 TSV files with the tool/system name with _n appended to each (e.g. awesome_1.tsv, awesome_2.tsv, awesome_3 ). In order to evaluate your submissions we require you to submit a tsv file following the format in which the training set is provided. 2. a paper of 6 pages describing your approach and how you tuned/tested it using the training split. All submissions must be in English. Submissions must be in PDF formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS) [http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0]. For details on the LNCS style, see Springer’s Author Instructions. All submissions are not anonymous. Please send us your submission before the deadline through Easychair
Open Research Assistant Position in Semantic Web/Life Sciences at INSIGHT Galway (formerly DERI Galway) Deadline Jan 10, 2014
[Apologies for cross-posting] *Open Position: Research Assistant ? Life Sciences and the Semantic Web INSIGHT@ NUI Galway (formerly DERI Galway)* In collaboration with our industrial partners, the successful applicant will play a key role working for projects in the Bioinformatics and Systems Biology research unit. Bioinformatics and Systems Biology research in INSIGHT@NUI Galway focuses on applying Semantic Web technologies for improving and enabling discovery in Life Sciences. Candidates must have a passion for solving real-world problems in the Health Care, Life Sciences and Bioinformatics domain(s). The position will involve developing and applying technologies relating to the Semantic Web, Linked Open Data, SPARQL Querying, Query Federation and Data Integration. *Research Role* The successful candidate will contribute to research and commercialisation activities within the Life Sciences research domain of INSIGHT@NUI Galway (formerly DERI). The position will involve: - Working within an enthusiastic team to apply Semantic Web technologies to the Life Sciences domain. - Interacting with Industry and Academic partners working on projects that aim to change the field. - Active involvement in scientific dissemination of research and project results via conference and journal publications. - Support for further project acquisition activities. The position is full-time, located at INSIGHT @ NUI Galway. The duration of the positions will be for 12 months in the first instance with possible extensions. The position is available immediately and we expect the successful candidate to start at the earliest possible date. The salary will be in the range 25,425 to 28,938 euro, depending on qualifications and experience. *Candidates sought* Applicants would need to have significant programming experience, competent research skills and excellent English. The requirements are as follows: - Master's degree or equivalent in Computer Science/ Bioinformatics or relevant subjects - Strong programming skills with minimum three years software development experience and proven knowledge of software design, development and maintenance processes in languages such as Java. - Demonstrable experience in Semantic Web or Linked Data technologies such as RDF and SPARQL. - The applicant should be creative, enthusiastic, with excellent communication and scientific writing skills and the ability to collaborate with industrial and academic partners. - Ability to work independently as well as in a team environment. Other desirable criteria for applicants include: - Experience of databases, data stores, ontologies, RDFS, OWL, and SPARQL Federation. - Working experience for Health Care and Life Science/ Bio-Informatics projects with knowledge of Bio-Ontologies, SPARQL endpoints, Life Science Datasets and Federated Query Processing. Experience working with biomedical Linked Datasets-- including Bio2RDF, LinkedCT and FU Berlin Linked datasets ?- is particularly welcome. *Application* Applicants should include a cover letter, curriculum vitae, a list of accepted publications (if any) and the names and contact details of at least three referees (in English Language), via email (Word or PDF only) to( hr...@deri.org and ali.hasn...@deri.org) with the subject ?Research Assistant ? Life Sciences and the Semantic Web? by Friday, January 10, 2014.. *Contact* Enquiries about the position may be made to Prof. Stefan Decker, Director of INSIGHT Galway: stefan.dec...@deri.org. For informal discussion about this post please contact: ali.hasn...@deri.org. See the following page for further information: https://deri.ie/content/research-assistant-%E2%80%93-life-sciences-and-semantic-web-insight-nui-galway ___ deri.ie-all mailing list deri.ie-...@lists.deri.org http://lists.deri.org/mailman/listinfo/deri.ie-all
[Final CfP]: Uncertainty and Imprecision on the Web of Data @ IPMU 2014
Apologies for cross-postings --- CALL FOR PAPERS --- Uncertainty and Imprecision on the Web of Data July 15-19, 2014 Montpellier, France -- Special Session *at the* 15th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems http://www.ipmu2014.univ-montp2.fr Submission deadline December 31, 2013 - ***Short description*** Phenomena related to uncertainty and imprecision are common on the Web of Data. On the one hand, data published as linked open data are often incomplete and of variable quality; we are frequently faced to dealing with missing, imperfect, vague and imprecise data in many real-world applications. On the other hand, often these data's meta-models are of inherently uncertain or imprecise nature and dealing with these resources requires a suitable framework. Although active research addressing these issues has been conducted recently, handling uncertainty and imprecision is still an open problem in the context of the Web of Data. The goal of this special session is to bring together researchers working in the field of imprecise/uncertain knowledge and data management and interested in linked open data technologies. The session will address problems related to handling imprecision and/or uncertainty of data and ontologies in the processes of publishing, interconnecting and querying data by following the Linked Data principles. Two major (partly intersecting) communities are targeted: (1) the community dealing with reasoning under uncertainty and (2) the community focusing on knowledge discovery, data mining, data integration and information retrieval when data are fuzzy, imperfect or imprecise. The topics of interest can be articulated along the following axes (the list being non-exhaustive): * Fuzzy ontological languages * Linking of imperfect/imprecise/vague data * Representaion of uncertain links * Fuzzy/probabilistic/approximate ontology matching * Reasoning techniques under uncertainty and fuzziness * Imprecision and uncertainty in specific domains, e.g.: - the biological and bio-medical domains - the geo-spatial domain - trust, provenance and security - multimedia - multilingualism * Quality of open data * Fuzzy data mining and knowledge extraction * Querying warehouses opened on the web: imprecise queries and approximate answers ***Submissions*** Contributions to the special session can be made in terms of papers which will undergo the standard reviewing process of the IPMU 2014 conference. Complete information regarding the submission process can be found at the conference website: http://www.ipmu2014.univ-montp2.fr, more precisely in the section Program - Special Sessions. In the submission process, note that the name of the special session will appear (and has to be selected) in the list of conference tracks on the Easychair site. The accepted papers will be published in the proceedings of IPMU 2014. ***Organizers*** Zohra Bellahsene Anne Laurent François Scharffe Konstantin Todorov (Main contact) LIRMM / University of Montpellier 2 contact: {firstname.lastname}@lirmm.fr ***Program Committee*** Jamal Atif / Université Paris 11, France Zohra Bellahsene / University of Montpellier 2, France Isabelle Bloch / Télécom ParisTech – LTCI, France Fernando Bobillo / University of Zaragoza, Spain Silvia Calegari / University of Milano, Italy Nicola Fanizzi / University of Bari, Italy Peter Geibel / Charité Berlin, Germany Celine Hudelot / MAS - ECP, France Souhila Kaci / University of Montpellier 2, France Anne Laurent / University of Montpellier 2, France Olivier Pivert / IRISA, France François Scharffe / University of Montpellier 2, France Umberto Straccia / ISTI - CNR, Italy Matthias Thimm / Universität Koblenz, Germany Konstantin Todorov / University of Montpellier 2, France Serena Villata / INRIA, France
Re: [Final CfP]: Uncertainty and Imprecision on the Web of Data @ IPMU 2014
Hi Konstantin, As an irony fan, I note that the conference on Uncertainty and Imprecision begins the day after Bastille Day. Interesting comment on the revolutionary character of the Web of Data :-) Uncertainty and imprecision can not be solved by disapproval. If you took a stack of 10 Trillion Dollar Bills (10^14) (representing Water Molecules) then a Chemist would tell you that one bill in the pile has no picture of George Washington and one bill has two pictures of George Washington. Quantum Mechanics is weird and it makes Economist's heads blow up. Anyway ... I doubt I would be able to attend but can offer some organized, if not exactly structured (the American Public Domain is not structured, just free) data for visualizations. http://lists.w3.org/Archives/Public/public-egovernance/2013Dec/.html (the problem) http://www.rustprivacy.org/2013/education/fednet.html (the why's of the broad strokes model) http://www.rustprivacy.org/2013/education/ (applicability to local Education issues) --Gannon On Fri, 12/20/13, Konstantin Todorov konstantin@gmail.com wrote: Subject: [Final CfP]: Uncertainty and Imprecision on the Web of Data @ IPMU 2014 To: public-lod@w3.org Date: Friday, December 20, 2013, 7:00 AM Apologies for cross-postings---CALL FOR PAPERS --- Uncertainty and Imprecision on the Web of Data July 15-19, 2014Montpellier, France --Special Session at the 15th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems http://www.ipmu2014.univ-montp2.fr Submission deadline December 31, 2013 - ***Short description*** Phenomena related to uncertainty and imprecision are common on the Web of Data. On the one hand, data published as linked open data are often incomplete and of variable quality; we are frequently faced to dealing with missing, imperfect, vague and imprecise data in many real-world applications. On the other hand, often these data's meta-models are of inherently uncertain or imprecise nature and dealing with these resources requires a suitable framework. Although active research addressing these issues has been conducted recently, handling uncertainty and imprecision is still an open problem in the context of the Web of Data. The goal of this special session is to bring together researchers working in the field of imprecise/uncertain knowledge and data management and interested in linked open data technologies. The session will address problems related to handling imprecision and/or uncertainty of data and ontologies in the processes of publishing, interconnecting and querying data by following the Linked Data principles. Two major (partly intersecting) communities are targeted: (1) the community dealing with reasoning under uncertainty and (2) the community focusing on knowledge discovery, data mining, data integration and information retrieval when data are fuzzy, imperfect or imprecise. The topics of interest can be articulated along the following axes (the list being non-exhaustive): * Fuzzy ontological languages * Linking of imperfect/imprecise/vague data* Representaion of uncertain links * Fuzzy/probabilistic/approximate ontology matching* Reasoning techniques under uncertainty and fuzziness * Imprecision and uncertainty in specific domains, e.g.: - the biological and bio-medical domains - the geo-spatial domain - trust, provenance and security - multimedia - multilingualism* Quality of open data * Fuzzy data mining and knowledge extraction* Querying warehouses opened on the web: imprecise queries and approximate answers ***Submissions*** Contributions to the special session can be made in terms of papers which will undergo the standard reviewing process of the IPMU 2014 conference. Complete information regarding the submission process can be found at the conference website: http://www.ipmu2014.univ-montp2.fr, more precisely in the section Program - Special Sessions. In the submission process, note that the name of the special session will appear (and has to be selected) in the list of conference tracks on the Easychair site. The accepted papers will be published in the proceedings of IPMU 2014. ***Organizers*** Zohra BellahseneAnne LaurentFrançois Scharffe Konstantin Todorov (Main contact) LIRMM / University of Montpellier 2 contact: {firstname.lastname}@lirmm.fr ***Program Committee*** Jamal Atif / Université Paris 11, FranceZohra Bellahsene / University of Montpellier 2, France Isabelle Bloch / Télécom ParisTech – LTCI, FranceFernando Bobillo / University of Zaragoza, Spain Silvia Calegari / University of Milano, Italy Nicola Fanizzi / University of Bari, ItalyPeter Geibel / Charité