CfP: Entity Extraction and Linking Challenge (#Microposts2014 @ WWW2014)

2013-12-20 Thread Giuseppe Rizzo

===
  Entity Extraction and Linking Challenge
   at the 4th Making Sense of Microposts Workshop
   (#Microposts2014) @ WWW 2014
   http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html
7 April 2014, Seoul, Republic of Korea
===

Microposts are a highly popular medium to share facts, opinions or 
emotions. They are an invaluable wealth of data, ready to be mined for 
training predictive modelings. This year the #Microposts 2014 Workshop 
will host an Entity Extraction and Linking Challenge.
The overall task of the challenge is to automatically extract entities 
from English microposts, and link them to the corresponding English 
DBpedia v3.9 resources (if the linkage exists). As linking stage we aim 
to to disambiguate
expressions that are formed by discrete (and typically short) sequences 
of words.
Existing entity linking tools are intended for use over news corpora and 
similar document-based corpora with relatively long length. We organise 
this challenge to foster research into novel, more accurate solutions 
for the automatic entity linking in (much shorter) micropost data.
We will ask the participants to automatically extract entities (e.g., 
Obama, London, Rakuten)  belonging to all entity types (e.g., Person, 
Location, Organisation) from a collection of microposts. Participants 
will have to automatically provide context-relevant DBpedia resources 
for each entity in a micropost.


DATASET
---
The dataset comprises of 3.5K tweets extracted from a much larger 
collection of over 18 million tweets. This collection, provided by the 
Redites project (http://demeter.inf.ed.ac.uk/redites/), covers 
event-annotated tweets collected for the period of 15th July 2011 to 
15th August 2011 (31 days). It extends over multiple noteworthy events 
including the death of Amy Winhehouse, the London Riots and the Oslo 
bombing. Since the task of this challenge is to automatically extract 
and link entities, we have built our dataset considering both event and 
non-event tweets. While event tweets are more likely to contain 
entities, non-event tweets enable us to evaluate the performance of the 
system in avoiding false positives in the entity extraction phase.


The dataset has been split into a training (70%) and testing (30%) sets. 
Following the Twitter TOS we will only provide tweet IDs and annotations 
for the training set; and tweet IDs for the test set. We will also 
provide a common framework to mine these datasets from Twitter.


The training set will be released as tsv file where each line consists of :
- tweet_id
- entity_mention_1
- entity_uri_1
…
- entity_mention_n
- entity_uri_n
Tokens are separated by TABs. Entity mentions and uris are listed 
according to their appearance order in the tweet.


We will timely advertise the release of the data sets on the workshop 
mailing list. Please subscribe to 
https://groups.google.com/d/forum/microposts2014. More information about 
dates are available in the Challenge website.


EVALUATION
--
The evaluation consists of two separated stages:
1.- Paper peer review : A community of experts of the domain will judge 
the quality and applicability of the approaches taken, to provide useful 
insights on your research;
2.- Precision and Recall:  F1 (F-measure with beta = 1) will be computed 
on a gold standard manually created from the test set. The automatically 
extracted entities and links will be both matched against this ground truth.


All submissions will be only ranked according to the F1 of each best 
submission.


SUBMISSIONS
---
Submissions should be provided as a zip file using your system name as 
the file name (e.g. 'awesome.zip'), containing:


1. a TSV file with your system name (e.g. 'awesome.tsv'). We accept up 
to 3 different submissions, and we will consider *only* the best. If you 
do so you must specify clearly in your paper the modifications applied 
to each labelled submission. In this case the submission should contain 
each of up to 3 TSV files with the tool/system name with _n appended 
to each (e.g. awesome_1.tsv, awesome_2.tsv, awesome_3 ).
In order to evaluate your submissions we require you to submit a tsv 
file following the format in which the training set is provided.


2. a paper of 6 pages describing your approach and how you tuned/tested 
it using the training split. All submissions must be in English. 
Submissions must be in PDF formatted in the style of the Springer 
Publications format for Lecture Notes in Computer Science (LNCS) 
[http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0]. For 
details on the LNCS style, see Springer’s Author Instructions. All 
submissions are not anonymous. Please send us your submission before the 
deadline through Easychair 

Open Research Assistant Position in Semantic Web/Life Sciences at INSIGHT Galway (formerly DERI Galway) Deadline Jan 10, 2014

2013-12-20 Thread Stephane
[Apologies for cross-posting]

*Open Position: Research Assistant ? Life Sciences and the Semantic Web 
INSIGHT@ NUI Galway (formerly DERI Galway)*

In collaboration with our industrial partners, the successful applicant 
will play a key role working for projects in the Bioinformatics and 
Systems Biology research unit. Bioinformatics and Systems Biology 
research in INSIGHT@NUI Galway focuses on applying Semantic Web 
technologies for improving and enabling discovery in Life Sciences.

Candidates must have a passion for solving real-world problems in the 
Health Care, Life Sciences and Bioinformatics domain(s). The position 
will involve developing and applying technologies relating to the 
Semantic Web, Linked Open Data, SPARQL Querying, Query Federation and 
Data Integration.


*Research Role*

The successful candidate will contribute to research and 
commercialisation activities within the Life Sciences research domain of 
INSIGHT@NUI Galway (formerly DERI). The position will involve:

- Working within an enthusiastic team to apply Semantic Web technologies 
to the Life Sciences domain.
- Interacting with Industry and Academic partners working on projects 
that aim to change the field.
- Active involvement in scientific dissemination of research and project 
results via conference and journal publications.
- Support for further project acquisition activities.

The position is full-time, located at INSIGHT @ NUI Galway. The duration 
of the positions will be for 12 months in the first instance with 
possible extensions.

The position is available immediately and we expect the successful 
candidate to start at the earliest possible date.

The salary will be in the range 25,425 to 28,938 euro, depending on 
qualifications and experience.


*Candidates sought*

Applicants would need to have significant programming experience, 
competent research skills and excellent English.

The requirements are as follows:

- Master's degree or equivalent in Computer Science/ Bioinformatics or 
relevant subjects
- Strong programming skills with minimum three years software 
development experience and proven knowledge of software design, 
development and maintenance processes in languages such as Java.
- Demonstrable experience in Semantic Web or Linked Data technologies 
such as RDF and SPARQL.
- The applicant should be creative, enthusiastic, with excellent 
communication and scientific writing skills and the ability to 
collaborate with industrial and academic partners.
- Ability to work independently as well as in a team environment.

Other desirable criteria for applicants include:

- Experience of databases, data stores, ontologies, RDFS, OWL, and 
SPARQL Federation.
- Working experience for Health Care and Life Science/ Bio-Informatics 
projects with knowledge of Bio-Ontologies, SPARQL endpoints, Life 
Science Datasets and Federated Query Processing. Experience working with 
biomedical Linked Datasets-- including Bio2RDF, LinkedCT and FU Berlin 
Linked datasets ?- is particularly welcome.


*Application*

Applicants should include a cover letter, curriculum vitae, a list of 
accepted publications (if any) and the names and contact details of at 
least three referees (in English Language), via email (Word or PDF only)
to( hr...@deri.org and 
ali.hasn...@deri.org) with the subject ?Research Assistant  ?
Life Sciences and the Semantic Web? by Friday, January 10, 2014..


*Contact*

Enquiries about the position may be made to Prof. Stefan Decker, 
Director of INSIGHT Galway: stefan.dec...@deri.org.

For informal discussion about this post please contact: 
ali.hasn...@deri.org.

See the following page for further information: 
https://deri.ie/content/research-assistant-%E2%80%93-life-sciences-and-semantic-web-insight-nui-galway



___
deri.ie-all mailing list
deri.ie-...@lists.deri.org
http://lists.deri.org/mailman/listinfo/deri.ie-all


[Final CfP]: Uncertainty and Imprecision on the Web of Data @ IPMU 2014

2013-12-20 Thread Konstantin Todorov
Apologies for cross-postings
---
CALL FOR PAPERS
---

Uncertainty and Imprecision on the Web of Data

July 15-19, 2014
Montpellier, France
--
Special Session
*at the*
15th International Conference on Information Processing and Management of
Uncertainty in Knowledge-Based Systems
http://www.ipmu2014.univ-montp2.fr

Submission deadline December 31, 2013

-

***Short description***

Phenomena related to uncertainty and imprecision are common on the Web of
Data. On the one hand, data published as linked open data are often
incomplete and of variable quality; we are frequently faced to dealing with
missing, imperfect, vague and imprecise data in many real-world
applications. On the other hand, often these data's meta-models are of
inherently uncertain or imprecise nature and dealing with these resources
requires a suitable framework. Although active research addressing these
issues has been conducted recently, handling uncertainty and imprecision is
still an open problem in the context of the Web of Data.


The goal of this special session is to bring together researchers working
in the field of imprecise/uncertain knowledge and data management and
interested in linked open data technologies. The session will address
problems related to handling imprecision and/or uncertainty of data and
ontologies in the processes of publishing, interconnecting and querying
data by following the Linked Data principles. Two major (partly
intersecting) communities are targeted: (1) the community dealing with
reasoning under uncertainty and (2) the community focusing on knowledge
discovery, data mining, data integration and information retrieval when
data are fuzzy, imperfect or imprecise.


The topics of interest can be articulated along the following axes (the
list being non-exhaustive):


* Fuzzy ontological languages

* Linking of imperfect/imprecise/vague data

* Representaion of uncertain links

* Fuzzy/probabilistic/approximate ontology matching

* Reasoning techniques under uncertainty and fuzziness

* Imprecision and uncertainty in specific domains, e.g.:

  - the biological and bio-medical domains

  - the geo-spatial domain

  - trust, provenance and security

  - multimedia

  - multilingualism

* Quality of open data

* Fuzzy data mining and knowledge extraction

* Querying warehouses opened on the web: imprecise queries and approximate
answers


***Submissions***

Contributions to the special session can be made in terms of papers which
will undergo the standard reviewing process of the IPMU 2014 conference.
Complete information regarding the  submission process can be found at the
conference website: http://www.ipmu2014.univ-montp2.fr, more precisely in
the section Program - Special Sessions. In the submission process, note
that the name of the special session will appear (and has to be selected)
in the list of conference tracks on the Easychair site. The accepted papers
will be published in the proceedings of IPMU 2014.


***Organizers***

Zohra Bellahsene

Anne Laurent

François Scharffe

Konstantin Todorov (Main contact)


LIRMM / University of Montpellier 2

contact: {firstname.lastname}@lirmm.fr


***Program Committee***

Jamal Atif / Université Paris 11, France

Zohra Bellahsene / University of Montpellier 2, France

Isabelle Bloch / Télécom ParisTech – LTCI, France

Fernando Bobillo / University of Zaragoza, Spain

Silvia Calegari / University of Milano, Italy

Nicola Fanizzi / University of Bari, Italy

Peter Geibel / Charité Berlin, Germany

Celine Hudelot / MAS - ECP, France

Souhila Kaci / University of Montpellier 2, France

Anne Laurent / University of Montpellier 2, France

Olivier Pivert / IRISA, France

François Scharffe / University of Montpellier 2, France

Umberto Straccia / ISTI - CNR, Italy

Matthias Thimm / Universität Koblenz, Germany

Konstantin Todorov / University of Montpellier 2, France

Serena Villata / INRIA, France


Re: [Final CfP]: Uncertainty and Imprecision on the Web of Data @ IPMU 2014

2013-12-20 Thread Gannon Dick
Hi Konstantin,

As an irony fan, I note that the conference on Uncertainty and Imprecision 
begins the day after Bastille Day.  Interesting comment on the revolutionary 
character of the Web of Data :-)

Uncertainty and imprecision can not be solved by disapproval.  If you took a 
stack of 10 Trillion Dollar Bills (10^14) (representing Water Molecules) then a 
Chemist would tell you that one bill in the pile has no picture of George 
Washington and one bill has two pictures of George Washington.  Quantum 
Mechanics is weird and it makes Economist's heads blow up.  Anyway ...  

I doubt I would be able to attend but can offer some organized, if not exactly 
structured (the American Public Domain is not structured, just free) data for 
visualizations.  

http://lists.w3.org/Archives/Public/public-egovernance/2013Dec/.html
(the problem)

http://www.rustprivacy.org/2013/education/fednet.html
(the why's of the broad strokes model)

http://www.rustprivacy.org/2013/education/
(applicability to local Education issues)

--Gannon

On Fri, 12/20/13, Konstantin Todorov konstantin@gmail.com wrote:

 Subject: [Final CfP]: Uncertainty and Imprecision on the Web of Data @ IPMU 
2014
 To: public-lod@w3.org
 Date: Friday, December 20, 2013, 7:00 AM
 
 Apologies for
 cross-postings---CALL FOR PAPERS
 ---
 
 Uncertainty and
 Imprecision on the Web of Data
 July 15-19,
 2014Montpellier,
 France
 
 --Special Session at the 15th International Conference on
 Information Processing and Management of Uncertainty in
 Knowledge-Based Systems
 
 http://www.ipmu2014.univ-montp2.fr
 Submission
 deadline December 31, 2013
 
 
 -
 ***Short
 description***
 
 
 Phenomena
 related to uncertainty and imprecision are common on the Web
 of Data. On the one hand, data published as linked open
 data are often incomplete and of variable quality; we are
 frequently faced to dealing with missing, imperfect, vague
 and imprecise data in many real-world applications. On the
 other hand, often these data's meta-models are of
 inherently uncertain or imprecise nature and dealing with
 these resources requires a suitable framework. Although
 active research addressing these issues has been conducted
 recently, handling uncertainty and imprecision is still an
 open problem in the context of the Web of Data.
 
 
 The
 goal of this special session is to bring together
 researchers working in the field of imprecise/uncertain
 knowledge and data management and interested in linked open
 data technologies. The session will address problems
 related to handling imprecision and/or uncertainty of data
 and ontologies in the processes of publishing,
 interconnecting and querying data by following the Linked
 Data principles. Two major (partly
 intersecting) communities are targeted: (1) the community
 dealing with reasoning under uncertainty and (2) the
 community focusing on knowledge discovery, data mining, data
 integration and information retrieval when data are fuzzy,
 imperfect or imprecise.
 
 
 The
 topics of interest can be articulated along the following
 axes (the list being non-exhaustive):
 
 
 *
 Fuzzy ontological languages
 
 * Linking of imperfect/imprecise/vague data*
 Representaion of uncertain links
 
 * Fuzzy/probabilistic/approximate ontology matching*
 Reasoning techniques under uncertainty and fuzziness
 
 * Imprecision and uncertainty in specific domains,
 e.g.: 
     - the biological and bio-medical domains
 
       - the geo-spatial domain 
     - trust, provenance and security
 
       - multimedia 
     - multilingualism*
 Quality of open data
 
 *
 Fuzzy data mining and knowledge extraction*
 Querying warehouses opened on the web: imprecise queries and
 approximate answers
 
 
 
 ***Submissions***
 Contributions to the
 special session can be made in terms of papers which will
 undergo the standard reviewing process of the IPMU 2014
 conference. Complete information regarding the  submission
 process can be found at the conference website: 
http://www.ipmu2014.univ-montp2.fr,
 more precisely in the section Program - Special
 Sessions. In the submission process, note that the name of
 the special session will appear (and has to be selected) in
 the list of conference tracks on the Easychair site. The
 accepted papers will be published in the proceedings of IPMU
 2014.
 
 
 
 ***Organizers***
 
 Zohra BellahseneAnne
 LaurentFrançois
 Scharffe
 
 Konstantin Todorov (Main contact)
 LIRMM
 / University of Montpellier 2
 
 contact: {firstname.lastname}@lirmm.fr
 
 
 
 
 ***Program
 Committee***
 
 
 Jamal Atif / Université Paris 11,
 FranceZohra Bellahsene / University of
 Montpellier 2,
 France
 
 Isabelle
 Bloch / Télécom ParisTech – LTCI, FranceFernando
 Bobillo / University of
 Zaragoza, Spain
 
 Silvia
 Calegari / University of Milano,
 Italy
 
 Nicola Fanizzi / University of Bari, ItalyPeter
 Geibel / Charité