Release of BabelNet 5.3 https://babelnet.org
We are proud to announce the release of a new version of BabelNet <https://babelnet.org/> and its programmatic *Java and Python API*, developed jointly by the Sapienza NLP Group <http://nlp.uniroma1.it> of *Sapienza University of Rome* under the supervision of prof. Roberto Navigli <https://www.diag.uniroma1.it/navigli/> and Babelscape <http://babelscape.com/>, *a deep-tech multilingual NLP company* providing innovative solutions for natural language understanding. BabelNet -- winner of the *prominent paper award 2017* from the Artificial Intelligence Journal and the META prize 2015, and covered in media such as The Guardian <https://www.theguardian.com/news/2018/feb/23/oxford-english-dictionary-can-worlds-biggest-dictionary-survive-internet> and Time Magazine <http://wwwusers.di.uniroma1.it/~navigli/img/Redefining_the_modern_dictionary.png> -- is today's *most far-reaching multilingual lexical-semantic knowledge graph* which, according to need, can be used as an *encyclopedic dictionary*, or a *semantic network* or a huge *knowledge base/ontology* e.g. to be integrated into *deep learning solutions*. It has been used by more than *1000 universities and research institutions*, enabling multilinguality in several fields of AI and NLP, such as multilingual semantic search, Word Sense Disambiguation and Entity Linking, Semantic Role Labeling, image tagging and semantically-enhanced multimodality. BabelNet was created by means of the seamless integration and interlinking of the largest multilingual Web encyclopedia - i.e., Wikipedia - with the most popular computational lexicon of English - i.e., WordNet, and other lexical-semantic resources such as Wikidata, Wiktionary, OmegaWiki, dozens of wordnets (including Open English WordNet), GeoNames, and ImageNet. The BabelNet model is centered around *multilingual synsets*, i.e., concepts and named entities lexicalized in many languages, and connected with large amounts of semantic relations. *Version 5.3* ships with the following features: - *80 new languages* for a grand total of *600 languages*; - *23 million synsets* covered; - *Lemma casing updated in 24 languages*; - *Wikipedia and Wikidata updated* thanks to BabelNet live (November 2023 dump); - *Open English WordNet* has been updated to version 2023; - *Images* associated with synsets have been updated; - *Wiktionary* has been *updated* and *20k new concepts* have been integrated (November 2023 dump); - *Significantly improved cross-lingual resource mapping*, ensuring more accurate and contextually relevant lexicalizations and translations; - *General data cleanup* (glosses, senses, Named Entity vs. Concept labels); - *Wikipedia multilingual labels updated.* More statistics are available at: babelnet.org/statistics. Kind regards, The BabelNet group -- ============================================== Roberto Navigli* - Professor* Department of Computer, Control and Management Engineering Sapienza University of Rome Via Ariosto, 25 00185 Roma Italy Phone: +39 06 77274109 Home Page: https://www.diag.uniroma1.it/navigli/ Sapienza NLP Group: http://nlp.uniroma1.it Co-founder of Babelscape <https://babelscape.com> ==============================================
_______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-le...@list.elra.info