[Corpora-List]Re: Complex Word Identification in French

2022-06-23 Thread Ada Wan
Hi Milos, Pardon my late reply. I actually took the time and found some joy in reading Vít's dissertation (thanks for pointing me to it) --- a "kindred spirit" to my paper in various versions (FaiR short , R ,

[Corpora-List]Re: Complex Word Identification in French

2022-06-22 Thread Miloš Jakubíček
Hi Ada, a very good paper (and lot of work done - congrats!) and a very interesting thread. Clearly linguistics as a field is terribly lacking some unified taxonomy (compared to biology, chemistry, whatever -- the difference is rather striking) and yes, this is causing serious trouble in NLP and

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Hi Linas, As also a native EN speaker myself, I know "w*rds" is a very colloquial term that gets used often. It got "cemented" via computational implementation* and hence reinforced people's idea of what grammar is or ought to be. I am not "undoing" EN writing (that'd be nonsense --- writing is a

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Linas Vepstas
Hi Ada, In the English language, "words" are a thing. Children are taught to place spaces between "words". You're not going to undo a millennium-worth of English writing by discouraging the use of words. Much of Latin was written without blank spaces to denote word boundaries. In Chinese

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
I used "term" bc it makes room for a little bit of (mental) shifting for some ppl... Everyone (non-specialists included) uses "w*rd". Nothing is 100% --- when it comes to "language" or abstract concepts (or everything in the empirical world?), but 99% is better than 98 or 60%. (E.g. we may have

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
WAR IS PEACE FREEDOM IS SLAVERY IGNORANCE IS STRENGTH You are a flaw in the pattern, Winston. You are a stain that must be wiped out. Did I not tell you just now that we are different from the persecutors of the past? We are not content with negative obedience, nor even with the most abject

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Yeah... I really don't know what to do with "the resistance", "the ignorance" (as in, both the practice of intentionally ignoring my results, and otherwise)... etc.. Many of us are so used to both naming and processing at such granularity... it'd take the whole world for change to happen and I

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
Just to clarify my position, I don't actually think that the En. lexeme “w*rd” is easy to define, precise or theoretically well-founded (I prefer “lexeme” here, as Ada's previous use of “term” is improper from a wusterian point of view, given that “w*rd” lacks distinctive traits due to its

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
Not to mention all these shamefully unscientific posts on Corporalist: /12th International Global W*rdnet Conference Donostia / San Sebastian, Basque Country 23-27, 2023 Global W*rdnet Association: www.globalw*rdnet.org// //Conference website: https://hitz.eus/gwc2023/ /18th Workshop on

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
@Daniel: Yeah, I think our whole field could benefit from a curriculum update... On Mon, Jun 20, 2022 at 9:47 PM Daniel HENKEL wrote: > Looks as if Linguistlist is in need of some scientific enlightenment as > well : > > http://linguistlist.org/issues/33/33-2063.html > > *In the new, thoroughly

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Daniel HENKEL
Looks as if Linguistlist is in need of some scientific enlightenment as well : http://linguistlist.org/issues/33/33-2063.html /In the new, thoroughly revised second edition of W*rds of Wonder: Endangered Languages and What They Tell Us, Second Edition (formerly called Dying W*rds: Endangered

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
(I just expounded on a point as a twitter reply today re the granularity of one's thinking/processing. Pls feel free to read that also.) One can think of it in a less binary manner --- not "good" vs "bad", not "words" then "sentences", but to think of an utterance/sequence with all the finer

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Sylvain Kahane
“We’re destroying words–scores of them, hundreds of them, every day. We’re cutting the language down to the bone.” […] “It’s a beautiful thing, the destruction of words. Of course the great advantage is in the verbs and adjectives, but there are hundreds of nouns that can be got rid of as

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Hi Christopher, It is of the best interest of the community to discontinue the usage of "word". The term is not only very shaky in its foundation (if any), but it can also effect disparity in performance in computational processing and robustness when human evaluation is involved. Despite the