[Corpora-List]Call for Participation: Shared Task on "Survey Variable Identification in Social Science Publications"

2022-06-20 Thread Simone Paolo Ponzetto
Dear colleagues, we hope that this is interesting for some of you working at the intersection of NLP, CSS and Scholarly Data Mining — we just released the training data and look forward to your participation, thanks! Best - Simone _ Call for

[Corpora-List]Re: [EXTERNAL] Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
On Tue, Jun 21, 2022 at 12:14 AM Flor, Michael wrote: > The notion of 'word' has difficulties in linguistics. > But not enough for abandoning it. > > Except we don't need it at all --- for both human or machine processing. > The argument from the paper "Fairness in Representation for

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Hi Linas, As also a native EN speaker myself, I know "w*rds" is a very colloquial term that gets used often. It got "cemented" via computational implementation* and hence reinforced people's idea of what grammar is or ought to be. I am not "undoing" EN writing (that'd be nonsense --- writing is a

[Corpora-List]Re: [EXTERNAL] Re: Complex Word Identification in French

2022-06-20 Thread Flor, Michael
The notion of 'word' has difficulties in linguistics. But not enough for abandoning it. The argument from the paper "Fairness in Representation for Multilingual NLP" is not convincing at all. Even if the early findings are correct for transformers , applicability to human language faculty is not

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Linas Vepstas
Hi Ada, In the English language, "words" are a thing. Children are taught to place spaces between "words". You're not going to undo a millennium-worth of English writing by discouraging the use of words. Much of Latin was written without blank spaces to denote word boundaries. In Chinese

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
I used "term" bc it makes room for a little bit of (mental) shifting for some ppl... Everyone (non-specialists included) uses "w*rd". Nothing is 100% --- when it comes to "language" or abstract concepts (or everything in the empirical world?), but 99% is better than 98 or 60%. (E.g. we may have

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
WAR IS PEACE FREEDOM IS SLAVERY IGNORANCE IS STRENGTH You are a flaw in the pattern, Winston. You are a stain that must be wiped out. Did I not tell you just now that we are different from the persecutors of the past? We are not content with negative obedience, nor even with the most abject

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Yeah... I really don't know what to do with "the resistance", "the ignorance" (as in, both the practice of intentionally ignoring my results, and otherwise)... etc.. Many of us are so used to both naming and processing at such granularity... it'd take the whole world for change to happen and I

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
Just to clarify my position, I don't actually think that the En. lexeme “w*rd” is easy to define, precise or theoretically well-founded (I prefer “lexeme” here, as Ada's previous use of “term” is improper from a wusterian point of view, given that “w*rd” lacks distinctive traits due to its

[Corpora-List]The New Directions in Analyzing Text as Data (TADA) Conference

2022-06-20 Thread heather froehlich
*The New Directions in Analyzing Text as Data (TADA) Conference* October 6-7, 2022 at Cornell Tech, Roosevelt Island, New York City Deadline for abstract submission: July 18, 2022 Form for abstract submission: https://forms.gle/XSarfYQGAXArACEn9 The New Directions in Analyzing Text as Data

[Corpora-List] [CFP] 2022 AAAI Fall Symposium on “Artificial Intelligence for Human-Robot Interaction” (AI-HRI)

2022-06-20 Thread Ross Mead
[CFP] 2022 AAAI Fall Symposium on “Artificial Intelligence for Human-Robot Interaction” (AI-HRI) The ninth annual AAAI Fall Symposium on “Artificial Intelligence for Human-Robot Interaction” (AI-HRI) will take place on November 17-19, 2022, at the Westin Arlington Gateway in Arlington, VA, USA.

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
Not to mention all these shamefully unscientific posts on Corporalist: /12th International Global W*rdnet Conference Donostia / San Sebastian, Basque Country 23-27, 2023 Global W*rdnet Association: www.globalw*rdnet.org// //Conference website: https://hitz.eus/gwc2023/ /18th Workshop on

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
@Daniel: Yeah, I think our whole field could benefit from a curriculum update... On Mon, Jun 20, 2022 at 9:47 PM Daniel HENKEL wrote: > Looks as if Linguistlist is in need of some scientific enlightenment as > well : > > http://linguistlist.org/issues/33/33-2063.html > > *In the new, thoroughly

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
Looks as if Linguistlist is in need of some scientific enlightenment as well : http://linguistlist.org/issues/33/33-2063.html /In the new, thoroughly revised second edition of W*rds of Wonder: Endangered Languages and What They Tell Us, Second Edition (formerly called Dying W*rds: Endangered

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
(I just expounded on a point as a twitter reply today re the granularity of one's thinking/processing. Pls feel free to read that also.) One can think of it in a less binary manner --- not "good" vs "bad", not "words" then "sentences", but to think of an utterance/sequence with all the finer

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Sylvain Kahane
“We’re destroying words–scores of them, hundreds of them, every day. We’re cutting the language down to the bone.” […] “It’s a beautiful thing, the destruction of words. Of course the great advantage is in the verbs and adjectives, but there are hundreds of nouns that can be got rid of as

[Corpora-List]unsubscribe me please from this list

2022-06-20 Thread I G
___ UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora Corpora mailing list -- corpora@list.elra.info To unsubscribe send an email to corpora-le...@list.elra.info

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Hi Christopher, It is of the best interest of the community to discontinue the usage of "word". The term is not only very shaky in its foundation (if any), but it can also effect disparity in performance in computational processing and robustness when human evaluation is involved. Despite the

[Corpora-List]Complex Word Identification in French

2022-06-20 Thread Christopher Collins
Hello, I'm looking for any open source or cloud-hosted solution for complex word identification or word difficulty rating in French for a reading application. As a backup plan we can use measures like corpus frequency, length, number of senses, but we're hoping someone has already made a tool

[Corpora-List]Post doc position: Semantics and Pragmatics of Emojis in Digital Communication

2022-06-20 Thread Tatjana Scheffler via Corpora
The project EmDiCom "Semantics and Pragmatics of Emojis in Digital Communication" will develop a formal semantics for emojis as a prime example of visual communication within the newly established DFG priority program ViCom (“Visual Communication. Theoretical, Empirical, and Applied

[Corpora-List]First Call for Papers Global WordNet Conference 2023

2022-06-20 Thread Itziar Gonzalez Dios
[Apologies for cross posting] 1st Call for Papers 12th International Global Wordnet Conference Donostia / San Sebastian, Basque Country 23-27, 2023 Global Wordnet Association: www.globalwordnet.org Conference website: https://hitz.eus/gwc2023 The Global Wordnet Association is pleased to

[Corpora-List]Two postdoc positions in Transfer Learning for Demographic Factors from September 2022

2022-06-20 Thread mail
2-year Postdoc position in Natural Language Processing on Incorporating Demographic Factors into Natural Language Processing Models Funded by ERC Starting grant INTEGRATOR Start: from September 2022 Dirk Hovy, Bocconi University and MilanLP

[Corpora-List] Fully-funded PhD position “Computer Science: Computational Linguistics & Corpus Annotation” (m/f/d), TIB – Leibniz Information Centre for Science and Technology, Germany

2022-06-20 Thread Jennifer D'Souza
**Deadline fast-approaching** Dear colleagues and friends, The Data Science and Digital Libraries at the TIB – Leibniz Information Centre for Science and Technology and University Library

[Corpora-List]Call for Participation: Shared Task on Indian Language Summarization (ILSUM 2022)

2022-06-20 Thread Parth Mehta
Apologies for the multiple postings. *Indian Language Summarization (ILSUM 2022)* Website: https://ilsum.github.io/ To be organized in conjunction with FIRE 2022 (fire.irsi.res.in) 9th-13th December 2022 (Hybrid Event, hosted in Kolkata) Registration Deadline*: 22nd July 2022*